Text Generation Models — Page 336
41,570CEIA-POSITIVOWarmTools2B32K
kamaboko2007WarmTools4B32K
AdanatoWarmTools3B32K
qwen25_3b_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_2
mohtani777WarmTools4B32K
Qwen3_4B_SFT_DPOv3_agent_v0_LR5E7
Hi-SatohWarmTools4B32K
adv_sft_dpo_final_12_merged
hiro7kaWarmTools4B32K
dpo-qwen-cot-merged-ver3d
Rofex404WarmTools800M32K
lyraix-guard-qwen3-0.6b-vllm
Fedir-IlinaWarmTools1B32K
finetuned_llama3.1_1b_ollama_safe
madiwarPtasannaWarmTools800M32K
j05hr3dWarmTools1B32K
Llama-3.2-1B-Instruct-C_M
zkaediWarm9B16K
gemma-2-9b-solidity-merged
Dario213WarmTools4B32K
Qwen3-4B-medical-reasoning
yonioz123WarmTools3B32K
Llama-3.2-3B-Hebrew-Master
rookshanksWarmTools800M32K
iproskurinaWarmTools500M32K
qwen-hf-fewshot-iter-iter1
shaohongwuWarmTools500M32K
Qwen2.5-0.5B-Preweb-special-tokens
HyeongwonWarmTools4B32K
P2-split3_prob_Qwen3-4B-Base_0312-01
ramyaa1113Warm3B8K
gemma2b-webxr-showroom-v2
channelableWarmTools4B32K
akseljoonasWarmTools2B32K
Qwen3-1.7B-SFT-s1K-lr2eneg05
LorenaYannnnnWarmTools800M32K
sycophancy-Qwen3-0.6B-baseline_all_tokens-seed_0
LorenaYannnnnWarmTools800M32K
sycophancy-Qwen3-0.6B-baseline_all_tokens-seed_2
RyanYrWarmTools2B32K
slf-dstl_Q2.5-1.5B-It_science_SFT
Devcavi19WarmTools800M32K
xw1234ganWarmTools2B32K
Merging_Qwen2.5-1.5B-Instruct_MedQA_lr1e-05_mb2_ga128_n2048_seed42
xw1234ganWarmTools3B32K
SFT_Qwen2.5-3B-Instruct_MedQA
xw1234ganWarmTools3B32K
GRPO_KL_Qwen2.5-3B-Instruct_MedQA_beta0.01_lr1e-05_mb2_ga128_n2048_seed42
LorenaYannnnnWarmTools800M32K
sycophancy-Qwen3-0.6B-OURS_self-seed_2
LorenaYannnnnWarmTools800M32K
general_reward-Qwen3-0.6B-baseline_all_tokens-seed_1
osieosieWarmTools4B32K
tmax-qwen3-4b-sft-20260316-100k-asst-loss
LorenaYannnnnWarmTools800M32K
confidence-Qwen3-0.6B-baseline_all_tokens-seed_2
LorenaYannnnnWarmTools800M32K
general_reward-Qwen3-0.6B-OURS_llama-seed_1
vjxcajlkWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-long_scruffy_camel
MS846Warm1B32K
gemma-3-1b-it-fitness-chat
TStark12310WarmTools3B32K