Text Generation Models — Page 319
41,391TermsofMLWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-gilded_aquatic_sparrow
keijiban3WarmTools500M32K
thangvipWarmTools2B32K
qwen2.5-1.5b-seq-dspo-sgd-linear
ferrazzipietroWarmTools1B32K
unsup-Llama-3.2-1B-Instruct-datav2
ferrazzipietroWarmTools2B32K
tatsuji1962WarmTools4B32K
sampluralisWarmTools1B32K
mhmsadeghWarmTools3B32K
Llama-3.2-3B-Instruct-3-sfand-cause-effect-model-lora
sonoddWarmTools4B32K
qwen3-4b-structeval-sft-v4-lr2e5-merged
arif-buttWarmTools1B32K
finetuned-llama-3.2-1b-it-merged
Hi-SatohWarmTools4B32K
adv_sft_dpo_final_3_merged
hiro7kaWarmTools4B32K
dpo-qwen-cot-merged-ver3a
sxsaaWarmTools3B32K
Qwen2.5-3B-Math-Verifier-FullData-v2.0
CorianasWarmTools800M32K
Qwen3-0.6b_dataclaw_mallet
dgambettaphdWarmTools800M32K
M_qw306_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_MPP
thangvipWarmTools2B32K
qwen2.5-1.5b-gspo-sgd-linear
Takashi-0000WarmTools4B32K
D-AT2025WarmTools4B32K
dpo-qwen-cot-merged_120steps
sfutenmaWarmTools4B32K
dpo-qwen3_4b-cot-merged_v260302-093614
moushi21WarmTools4B32K
agent-bench-alfworld-merged3
moushi21WarmTools4B32K
agent-bench-dbbench-merged4
mrAxiomcartographerWarmTools500M32K
weizhepeiWarmTools3B32K
Qwen2.5-3B-WebArena-Lite-SFT-CoT-QwQ-32B-epoch-10
megabytesWarmTools500M32K
Qwen2.5-0.5B-Instruct-heretic
laionWarmTools32B32K
sft_GLM-4-7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k_Qwen3-32B
rediska0123WarmTools2B32K
qwen2.5-math-1.5b-dpo-gsm8k-v2
dgambettaphdWarmTools4B32K
M_qw34_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_MPP
LorenaYannnnnWarmTools800M32K
20260306-confidence_only-Qwen3-0.6B_grpo_baseline_192000_episodes_seed_42
Fedir-IlinaWarmTools1B32K
finetuned_llama3.1_1b_ollama_safe
sampluralisWarmTools1B32K
llama-sft-proj-layers-shmid
nimabodWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-soaring_sprightly_antelope
Dario213WarmTools4B32K
Qwen3-4B-medical-reasoning