Text Generation Models — Page 334

41,545
prithivMLmodsWarmTools3B32K

Qwen2.5-3B-Tamil-Exp

2
·
18
·
Feb 2025
UmezakiWarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026
duong942001WarmTools4B32K

dpo-qwen-cot-merged1

0
·
18
·
Feb 2026
SillyWumpusWarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026
aobu04WarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026
prithivMLmodsWarmTools800M32K

rStar-Coder-Qwen3-0.6B

8
·
18
·
Aug 2025
AshleyQu0311WarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026
ferrazzipietroWarmTools1B32K

unsup-Llama-3.2-1B-Instruct-lora

0
·
18
·
Feb 2026
ZhiqiEliWangWarmTools2B32K

ds_r1_1.5b_psyscam_romance_ephishllm

0
·
18
·
Feb 2026
kazuyamaaWarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026
thangvipWarmTools2B32K

qwen3-1.7b-dspo-no-sft-sgd-linear-6500

0
·
18
·
Feb 2026
hikahikaWarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026
mark-22WarmTools4B32K

dpo-qwen-cot-merged-dataclearn3

0
·
18
·
Feb 2026
koutchWarmTools4B32K

qwen_falcon_qwen3-instruct-4b_train_sft_2.json

0
·
18
·
Feb 2026
SasanoHideoWarmTools4B32K

qwen3-4b-dpo-qwen-cot-merged-rev.01

0
·
18
·
Feb 2026
deepkickWarmTools4B32K

qwen3-4b-struct-dpo-v14-b0.10-L2048-merged

0
·
18
·
Feb 2026
dnotitiaWarmTools4B32K

Qwen3-4B-Thinking-2507

0
·
18
·
Jan 2026
koutchWarmTools4B32K

qwen_falcon_6.json_train_dpo_v1_2.json

0
·
18
·
Feb 2026
Taiko56WarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026
ryzaxWarmTools800M32K

xxx

0
·
18
·
Jan 2026
NovacianoWarm1B32K

Heretic.Erudite_v2-1B

0
·
18
·
Feb 2026
reiwa7WarmTools4B32K

dpo-qwen-cot-merged-s250

0
·
18
·
Feb 2026
KawausoHiroKawausoWarmTools4B32K

qwen3-4b-structeval-lora-39

0
·
18
·
Feb 2026
abcorreaWarmTools4B32K

sched-v4

0
·
18
·
Feb 2026
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr5e-05_layer10_scoeff10_epoch5

0
·
18
·
May 2025
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_GradDiff_lr5e-05_alpha5_epoch5

0
·
18
·
May 2025
jinkami07WarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026
ferrazzipietroWarmTools1B32K

Llama-3.2-1B-Instruct-unsup-crf-full-weight-merged

0
·
18
·
Feb 2026
adpretkoWarmTools2B32K

train-riscv-O2_epoch1and2

0
·
18
·
Oct 2025
HaicaochiWarmTools500M32K

Qwen_05_txtt_V2

0
·
18
·
Nov 2025
Aname-TommyWarmTools2B32K

slm-ft-test

0
·
18
·
Feb 2026
NeoMihRamWarm3B8K

RHAM_ID_DeepForge_V1_1

1
·
18
·
Jan 2026
FlameF0XWarmTools500M32K

ruvltra-claude-code-safetensors

1
·
18
·
Feb 2026
Guilherme34WarmTools3B32K

Firefly-V2.5

3
·
18
·
Feb 2026
ferrazzipietroWarmTools1B32K

Llama-3.1-8B-Instruct-unsup-crf-lora-lowlr-merged

0
·
18
·
Feb 2026
PhonsiriWarm3B8K

gemma-2-2b-CoT-sft-thing-format-moredataset-sft2-fix

0
·
18
·
Feb 2026
nyanntoWarmTools4B32K

dpo-qwen-cot-merged11

0
·
18
·
Feb 2026
KYoshimWarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026
LorenaYannnnnWarmTools800M32K

20260217-Qwen3-0.6B_grpo_sycophancy_warmup_baseline_192000_episodes_seed_42

0
·
18
·
Feb 2026
shotalabWarmTools4B32K

Qwen3-4B-Instruct-SFT-03-Merged-DPO-01

0
·
18
·
Feb 2026
mehuldamaniWarmTools8B32K

sft-base-half-tranches-v1-global-step-394

0
·
18
·
Dec 2025
kedumerikugameWarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026