Text Generation Models — Page 357
42,764TheGardenerWarmTools500M32K
Qwen2.5-0.5B-finetune-wikitext
alien500xWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hardy_hulking_cockroach
KaraKaraWitchWarmTools70B32K
Llama-EveningMirai-Moonwalker-MS-3.3-70B
nbeerbowerWarmTools14B32K
Qwen3-Gutenberg-Encore-14B
KaraKaraWitchWarmTools70B32K
r2e-editsWarmTools32B32K
qwen3_claude_37_48k_tokenized_sft_lr_1en5_epoch_1_bs_1_ga_8
Sanjay002Warm1B2K
tinyllama-mental-health-finetuned
MergeBench-gemma-2-9b-itWarm9B16K
gemma-2-9b-it_Magicoder-Evol-Instruct-110K_2epoch
kamelcharafWarmTools8B32K
GRPO-meta-3.1-8B-meta-3.1-8B-mrd3-s7-sum_token_prompt-merged
LansechenWarmTools3B32K
Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW
CriteriaPOWarmTools3B32K
llama3.2-3b-dpo-finegrained
LansechenWarmTools8B32K
Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v1
Yuuta208WarmTools8B32K
Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-ties-29
riddickzWarmTools8B32K
Llama-3.1-8B-Instruct_kg3.5k_2e5
LansechenWarmTools8B32K
Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0515-v2
pxyyyWarmTools8B32K
Qwen2.5-7B-mix-math-dolly-numina-20k-1-1e-6
alvinmingWarmTools8B32K
es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step320
secmlrWarmTools8B32K
DS-Noisy_DS-Clean_DS-OSS_QWQ-OSS_QWQ-Clean_QWQ-Noisy_Con_Qwen2.5-7B-Instruct_sft
alvinmingWarmTools8B32K
es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step640
h34v7WarmTools24B32K
DXP-Zero-V1.2-24b-Small-Instruct
UICHEOL-HWANGWarmTools3B32K
Yuuta208WarmTools8B32K
Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-task_arithmetic-29
Yuuta208WarmTools8B32K
Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-della-29
ricostaedeliWarmTools8B32K
Meta-Llama-3.1-8B-Instruct_ORPO_SFT
4everStudentWarmTools500M32K
Qwen2-0.5B-GRPO-test-5epochs
Minhhltse150305WarmTools1B32K
Llama-3.2-1B-Instruct-Chat-sft
CompassioninMachineLearningWarmTools8B32K
pretrainedllama8bInstruct6kresearchpapers_plus1kalignment_lora2epochs