Text Generation Models — Page 329

41,393
AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s200_a1200_layer11

0
·
17
AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s400_a100_layer15

0
·
17
AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s200_a1200_layer3

0
·
17
Robust-DecodingWarm3B8K

gemma-2-2b-it_1.0-0.0_kl0.01_chk_5000

0
·
17
MollelWarm3B8K

pawa_math_grpo

1
·
17
Utsav03Warm3B8K

gemma-2-full-dare-peft

0
·
17
AMindToThinkWarm3B8K

gemma-2-2b_RMU_s200_a500_layer3

0
·
17
AMindToThinkWarm3B8K

gemma-2-2b_RMU_s100_a100_layer3

0
·
17
Lil-RWarm3B8K

UMA_LLM_Engine_V2.2

0
·
17
williamlcnWarm3B8K

17718_sft_64_sh

0
·
17
1024mWarm3B8K

GEMMA2-2B-B100

0
·
17
williamlcnWarm3B8K

17718_sft_32_sh_0317

0
·
17
TongZheng1999Warm3B8K

gemma-2-2b-it-star-nl-3Rounds-iter-1

0
·
17
AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s400_a1200_layer3

0
·
17
TongZheng1999Warm3B8K

FL_1000_n_gemma-2-2b-it-star-mixed_unique-OP-final_v2_10-2-3Rounds-iter-1

0
·
17
williamlcnWarm3B8K

simpotest

0
·
17
williamlcnWarm3B8K

6851_mcq_8_8_new_format_combined

0
·
17
williamlcnWarm3B8K

17718_simpo_16_1

0
·
17
xw17Warm3B8K

gemma-2-2b-it_finetuned_1_new

0
·
17
williamlcnWarm3B8K

6851_32_32_0321_new_combined

0
·
17
williamlcnWarm3B8K

gemmadpo2

0
·
17
TongZheng1999Warm3B8K

gemma-2-2b-it-star-nl-OP_DIS-final_v2_1-2-4Rounds-iter-2

0
·
17
MergeMergeWarm3B8K

gemma-2-2B-allenai-tulu-3-sft-code

0
·
17
huihui-aiWarmTools500M32K

Qwen2.5-0.5B-Instruct-abliterated-SFT

2
·
17
·
Apr 2025
TheGardenerWarmTools500M32K

Qwen2.5-0.5B-finetune-wikitext

0
·
17
KaraKaraWitchWarmTools70B32K

Llama-EveningMirai-Moonwalker-MS-3.3-70B

0
·
17
DisyaWarmTools12B32K

Mistral-qwq-12b-merge

8
·
17
AmberYifanWarmTools8B32K

Qwen2.5-7B-Instruct-userfeedback-SPIN-iter2

1
·
17
RetreatcostWarmTools12B32K

KansenSakura-Zero-RP-12b

11
·
17
·
Jun 2025
bunnycoreWarmTools4B32K

Qwen3-4B-RP-V3

6
·
17
SaxoWarm27B32K

Linkbricks-Horizon-AI-Korean-Pro-27B

4
·
17
neural-coderWarmTools8B32K

finetuned-4

0
·
17
albertfaresWarmTools800M32K

DPO_MCQA_model_3_06_04_08

0
·
17
juhwWarmTools3B32K

q487

0
·
17
CriteriaPOWarmTools3B32K

llama3.2-3b-dpo-mini

0
·
17
·
May 2025
iamsahinemirWarmTools8B8K

meta-llama

0
·
17
shanchenWarmTools8B32K

ds-limo-fr-100

0
·
17
MrRobotoAIWarmTools8B8K

L1

0
·
17
MrRobotoAIWarmTools8B8K

A3

0
·
17
shanchenWarmTools8B32K

ds-limo-fr-250

0
·
17
soob3123Warm1B32K

amoral-gemma3-1B-v2

9
·
17
MergeBench-gemma-2-9bWarm9B16K

gemma-2-9b-GRPO-after-sft

0
·
17