Text Generation Models — Page 363
42,811nethmidWarmTools3B32K
llama3.2.3B_cognitive_distortions_16bit
PhonsiriWarm3B8K
gemma-2-2b-SFT-Reasoning-full-Model
canbingolWarm1B32K
gemma3_1B_base-tr-cpt-3epoch_15k_data
0xShyronWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-bold_dappled_goose
xiaoxiao2026WarmTools4B32K
LEO0925WarmTools2B32K
temp-qwen2.5-1.5b-koeantextbook-finetuned
xw1234ganWarmTools2B32K
sft-qwen2.5-math-1.5b_Second
paulovsantanasWarmTools1B32K
koutchWarmTools4B32K
qwen_falcon_qwen3-instruct-4b_train_grpo_v1_2.json
laionWarmTools32B32K
sft_GLM-4-7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k_Qwen3-32B
Rofex404WarmTools800M32K
lyraix-guard-qwen3-0.6b-vllm
thirdExecWarmTools2B32K
Qwen2.5-1.5B-Instruct-ThaiFakeNews-bnb-4bit
anujjamwalWarmTools2B32K
OpenMath-Nemotron-1.5B-PruneAgnostic
Nabbers1999WarmTools70B32K
Melpomene-70B-0307-Uncensored
maheshrawat18WarmTools4B32K
mansi-budamaguntaWarmTools2B32K
UrpotWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-vigilant_miniature_iguana
nimabodWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-soaring_sprightly_antelope
KoalacrownWarmTools4B32K
qwen3-4b-cold-start-16bit
j05hr3dWarmTools1B32K
Llama-3.2-1B-Instruct-C_M_T_CT_CE_CM
akseljoonasWarmTools2B32K
Qwen3-1.7B-SFT-s1K-lr2eneg05
LorenaYannnnnWarmTools800M32K
sycophancy-Qwen3-0.6B-baseline_all_tokens-seed_2
LorenaYannnnnWarmTools800M32K
longer_response-Qwen3-0.6B-baseline_all_tokens-seed_1
hamishiviWarmTools4B32K
tmax-qwen3-4b-sft-20260316-100k-asst-loss
LorenaYannnnnWarmTools800M32K
confidence-Qwen3-0.6B-baseline_all_tokens-seed_1
LorenaYannnnnWarmTools800M32K
general_reward-Qwen3-0.6B-OURS_self-seed_2
RyanYrWarmTools2B32K
slf-dstl_Q2.5-1.5B-It_tooluse_SFT
LorenaYannnnnWarmTools800M32K
confidence-Qwen3-0.6B-baseline_all_tokens-seed_2
LorenaYannnnnWarmTools800M32K
confidence-Qwen3-0.6B-OURS_self-seed_0
LorenaYannnnnWarmTools800M32K
general_reward-Qwen3-0.6B-OURS_llama-seed_0
LorenaYannnnnWarmTools800M32K
general_reward-Qwen3-0.6B-OURS_llama-seed_1