Text Generation Models — Page 317
41,391sagnikMWarmTools2B32K
grpo_sgd_qwen3_1p7b_3k-seqlen_momentum_0p9_1e-2
AlignmentResearchWarmTools70B32K
hr_hand_crafted_Llama-3.3-70B_medium_parity_15_epochs_merged_v1
ahmadmakkWarmTools500M32K
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-subtle_shrewd_grouse
canoplosWarmTools500M32K
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-soft_gilded_alligator
sleeepeerWarmTools8B32K
meta-llama-Llama-3.1-8B-Instruct-cold_start-dolly_exclude_0114-42-202601142342
NorraweeWarmTools4B32K
Qwen3-4B-Thinking-2507-exp04
ali-elganzoryWarmTools2B32K
Qwen3-1.7B-Base-SFT-Tulu3-decontaminated
teetoneWarmTools2B32K
OpenR1-Distill-Qwen3-1.7B-Math
cdomingoenrichWarmTools2B32K
qwen15_code200tok_step1750
NorraweeWarmTools4B32K
Qwen3-4B-Thinking-2507-exp06
AdrianReiterWarmTools800M32K
Qwen3-Compliance-Medical-v1
0xHantaWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-small_playful_komodo
DheepLearningWarmTools4B32K
iflow-metadata-qwen3-4b-sft-128k
sagarchaparaWarmTools4B32K
qwen3-4b-thinking-aimo-numina-cot-sft
Guilherme34WarmTools3B32K
gjyotin305WarmTools3B32K
Llama-3.2-3B-Instruct_old_sft_alpaca_009
gjyotin305WarmTools3B32K
Llama-3.2-3B-Instruct_old_sft_alpaca_005
koutchWarmTools4B32K
short_paper_qwen_1.json_train_dpo_v4_train_no_think
giovannidemuriWarmTools3B32K
llama-3.2-3b-distilled-ctba
giovannidemuriWarmTools3B32K
llama-3.2-3b-distilled-mtba
GreatGooseWarmTools3B32K
Qwen2.5-3B-Instruct-full-loglm
Raziel1234WarmTools500M32K
sjelassiWarmTools2B32K
qwen_25_1_5b_swallow_code_unstructured
g-assismoraesWarmTools4B32K
colsonlenWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sturdy_fleecy_chinchilla
tommymir4444WarmTools800M32K
Qwen3-0.6B-Gensyn-Swarm-lively_darting_penguin
akshayballalWarmTools4B32K
Qwen3-4B-Pubmed-16bit-GRPO
LambentWarmTools4B32K
Qwen3-4B-Base-Continued-GRPO-Merge
ksuchoi216WarmTools800M32K
asingh15WarmTools4B32K
qwen-arc-abs-gpt5.2-sft-1epoch-icmlpaper-0125