Text Generation Models — Page 349
42,728morningtea006WarmTools4B32K
affine-horse-5Hg1K2prUdnvSnG7m3mZBmF9hyo8zu8Z4miJSYsfe9Hpvgcu
hariharanv04WarmTools4B32K
qwen3-4b-instruct-meta-GRPO-2
nakotsuko13WarmTools4B32K
qwen3-4b-nako13-dpo-qwen-cot-merged
viamr-projectWarmTools2B32K
amr-parsing-grpo-single-single-turn-20260203-0853-global-step-622
TSerizawaWarmTools4B32K
llm-lecture-2025_dpo-qwen-cot-merged_base_model
g-assismoraesWarmTools2B32K
Qwen3-1.7B-CCC-merged-cp6-LR1e-4-irm
prithivMLmodsWarmTools3B32K
toenobuWarmTools4B32K
utokyo-llm-advance-main-dpo
SillyWumpusWarmTools4B32K
mihsatoWarmTools4B32K
dpo-qwen-cot-merged-mihsato-v1
stemask2985WarmTools4B32K
koutchWarmTools4B32K
qwen_falcon_qwen3-instruct-4b_train_sft_0.json
koutchWarmTools4B32K
qwen_qwen3-instruct-4b_train_grpo_v1_train_code
kamaboko2007WarmTools4B32K
thangvipWarmTools2B32K
qwen3-1.7b-dspo-no-sft-sgd-linear-6500
mark-22WarmTools4B32K
dpo-qwen-cot-merged-dataclearn3
0xShyronWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-invisible_endangered_kangaroo
deepkickWarmTools4B32K
qwen3-4b-struct-dpo-v11-merged
SasanoHideoWarmTools4B32K
qwen3-4b-dpo-qwen-cot-merged-rev.01
gyorgy-ruzicskaWarmTools3B32K
lingua-news-llama-3-spanish-simplifier
irfan0858WarmTools500M32K
cdomingoenrichWarmTools1B32K
Llama-3.2-1B-random-weights
CEIA-POSITIVOWarmTools2B32K
Asib1WarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pensive_leggy_ant
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr5e-05_layer10_scoeff10_epoch5
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr5e-05_beta0.1_alpha5_epoch5
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr2e-05_beta0.1_alpha5_epoch5
Aname-TommyWarmTools2B32K
FlameF0XWarmTools500M32K
ruvltra-claude-code-safetensors