Text Generation Models — Page 329
41,393AMindToThinkWarm3B8K
gemma-2-2b-it_RMU_s200_a1200_layer11
AMindToThinkWarm3B8K
gemma-2-2b-it_RMU_s400_a100_layer15
AMindToThinkWarm3B8K
gemma-2-2b-it_RMU_s200_a1200_layer3
Robust-DecodingWarm3B8K
gemma-2-2b-it_1.0-0.0_kl0.01_chk_5000
AMindToThinkWarm3B8K
gemma-2-2b_RMU_s200_a500_layer3
AMindToThinkWarm3B8K
gemma-2-2b_RMU_s100_a100_layer3
TongZheng1999Warm3B8K
gemma-2-2b-it-star-nl-3Rounds-iter-1
AMindToThinkWarm3B8K
gemma-2-2b-it_RMU_s400_a1200_layer3
TongZheng1999Warm3B8K
FL_1000_n_gemma-2-2b-it-star-mixed_unique-OP-final_v2_10-2-3Rounds-iter-1
williamlcnWarm3B8K
6851_mcq_8_8_new_format_combined
xw17Warm3B8K
gemma-2-2b-it_finetuned_1_new
williamlcnWarm3B8K
6851_32_32_0321_new_combined
TongZheng1999Warm3B8K
gemma-2-2b-it-star-nl-OP_DIS-final_v2_1-2-4Rounds-iter-2
MergeMergeWarm3B8K
gemma-2-2B-allenai-tulu-3-sft-code
huihui-aiWarmTools500M32K
Qwen2.5-0.5B-Instruct-abliterated-SFT
TheGardenerWarmTools500M32K
Qwen2.5-0.5B-finetune-wikitext
KaraKaraWitchWarmTools70B32K
Llama-EveningMirai-Moonwalker-MS-3.3-70B
AmberYifanWarmTools8B32K
Qwen2.5-7B-Instruct-userfeedback-SPIN-iter2
RetreatcostWarmTools12B32K
SaxoWarm27B32K
Linkbricks-Horizon-AI-Korean-Pro-27B
neural-coderWarmTools8B32K
albertfaresWarmTools800M32K
DPO_MCQA_model_3_06_04_08
iamsahinemirWarmTools8B8K
MergeBench-gemma-2-9bWarm9B16K
gemma-2-9b-GRPO-after-sft