Text Generation Models — Page 333
41,532sychonixWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-foxy_squeaky_llama
SunshineAndRainWarmTools3B32K
Clinical-R1-3B-Cold-Start
nmnmnagi88WarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-dextrous_unseen_shrimp
Baon2024WarmTools500M32K
Qwen2.5-0.5B-SFT-training3
0xHantaWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-small_playful_komodo
mohitskaushalWarmTools500M32K
qwen2-0.5B-geo-merged-lora-ft
tommymir4444WarmTools500M32K
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-gentle_vigilant_capybara
WebScraper991923WarmTools4B32K
Affine-S1-5F73918k99jZF2qzmyzrKGPsDkKQGTyzBzXrw2WihXb57HJB
jhn9803WarmTools2B32K
Qwen2.5-MATH-1.5B-Instruct-DAPO-G8
asingh15WarmTools4B32K
arc-abs-sft-oracle-lr5e-6-ep1-0104
koutchWarmTools4B32K
short_paper_qwent_qwen3-thinking-4b_train_sft_all_train_no_think
koutchWarmTools4B32K
short_paper_qwen_0.json_train_dpo_v1_dev
gjyotin305WarmTools3B32K
Qwen2.5-3B-Instruct_old_sft_alpaca_007
azheraliWarmTools2B32K
Qwen2.5-1.5B-Instruct-dpo
akseljoonasWarmTools4B32K
qwen3-4b-dpo-hh-rlhf-reversed
gjyotin305WarmTools3B32K
Llama-3.2-3B-Instruct_old_sft_alpaca_005
koutchWarmTools4B32K
short_paper_qwen_qwen3-instruct-4b_train_sft_train_think
gjyotin305WarmTools3B32K
Llama-3.2-3B-Instruct_old_sft_alpaca_003
giovannidemuriWarmTools3B32K
llama-3.2-3b-distilled-vpi
koutchWarmTools4B32K
paper_qwen_qwen3-instruct-4b_train_sft_train_no_think
souradeepmukhopadhyay99WarmTools4B32K
qwen3-4b-apigenmt-5k-trl-fullft
colsonlenWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sturdy_fleecy_chinchilla
sagnikMWarmTools2B32K
grpo_rmsprop_qwen3_1p7b_3k_seqlen_1e-5
moo3030WarmTools1B32K
Llama-3.2-1B-Summarizer-merged
yusufcelebiWarmTools8B32K
qwen3-8B-Base-orca_math-sparse-LoRA-step180-merged
gjyotin305WarmTools3B32K
Llama-3.2-3B-Instruct_new_alpaca_009
koutchWarmTools4B32K
short_paper_qwen_2.json_train_dpo_v2_train_no_think
ksuchoi216WarmTools800M32K
asingh15WarmTools4B32K
qwen-arc-abs-gpt5.2-sft-1epoch-icmlpaper-0125
jwkirchenbauerWarmTools4B32K
daint_prod_ift_q3-4b_1N4n_16cdce0f_step-00100160
morningtea006WarmTools4B32K
affine-horse-5Hg1K2prUdnvSnG7m3mZBmF9hyo8zu8Z4miJSYsfe9Hpvgcu
nakamuratoshiyaWarmTools4B32K
viamr-projectWarmTools2B32K
amr-parsing-grpo-single-single-turn-20260203-0853-global-step-622
prithivMLmodsWarmTools3B32K