Text Generation Models — Page 344
42,697AMindToThinkWarm3B8K
gemma-2-2b-it_RMU_s400_a100_layer15
najeebtpni001Warm3B8K
Gemma-2-2b-it-fine-review
TongZheng1999Warm3B8K
PW_1000_MoT5_gemma-2-2b-it-star-mixed_direct-OP-final_v2_10-5-3Rounds-iter-2
AMindToThinkWarm3B8K
gemma-2-2b-it_RMU_s100_a500_layer7
TongZheng1999Warm3B8K
FL_1000_gemma-2-2b-it-star-mixed_unique-OP-final_v2_10-2-3Rounds-iter-1
tergelWarm3B8K
gemma-2-2b-it-math-fs-gpt4o-bon
MergeBench-2BWarm3B8K
gemma-2-2b-it_dartmath_2epoch_0512
TongZheng1999Warm3B8K
FL_1000_gemma-2-2b-it-star-mixed_unique-OP-final_v2_10-2-3Rounds-iter-2
TongZheng1999Warm3B8K
FL_1000_n_gemma-2-2b-it-star-mixed_unique-OP-final_v2_10-2-3Rounds-iter-1
williamlcnWarm3B8K
6851_mcq_8_4_new_format_combined
TongZheng1999Warm3B8K
gemma-2-2b-it-star-nl-OP_DIS-final_v2_1-2-4Rounds-iter-2
TongZheng1999Warm3B8K
gemma-2-2b-it-star-nl-OP_DIS_new-final_v2_10-2-3Rounds-iter-3
TheGardenerWarmTools500M32K
Qwen2.5-0.5B-finetune-wikitext
alien500xWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hardy_hulking_cockroach
GabrielMMWarmTools500M32K
KaraKaraWitchWarmTools70B32K
Llama-EveningMirai-Moonwalker-3.3-70B
KaraKaraWitchWarmTools70B32K
Llama-EveningMirai-Moonwalker-v2-MS-3.3-70B
SaxoWarm27B32K
Linkbricks-Horizon-AI-Korean-Pro-27B
Sanjay002Warm1B2K
tinyllama-mental-health-finetuned
nyu-dice-labWarmTools8B32K
VeriThoughts-Reasoning-7B
neural-coderWarmTools8B32K
skshmjnWarmTools3B32K
unsloth_llama-3.2-3B-instruct-uncenssored
MergeBench-gemma-2-9b-itWarm9B16K
gemma-2-9b-it_Magicoder-Evol-Instruct-110K_2epoch
LansechenWarmTools3B32K
Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW
luckecianoWarmTools8B32K
Qwen-2.5-7B-GRPO-NoKL-1e-05-24
iamsahinemirWarmTools8B8K
MergeBench-Llama-8B-itWarmTools8B32K
llama-3.1-8b-it_aya_2epoch