Text Generation Models — Page 345
42,725MergeBench-gemma-2-9bWarm9B16K
clembench-playpenWarmTools8B32K
llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision_copy
r2e-editsWarmTools14B32K
qwen3_14b_sft_swesmith_r2e_v2_qwen3_format_32k_maxstep40_rft-20k_bz8_epoch2_lr1en5-v1
Yuuta208WarmTools8B32K
Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Merged-della-27
secmlrWarmTools8B32K
DS-Noisy_DS-Clean_QWQ-Noisy_QWQ-Clean_Qwen2.5-7B-Instruct_full_sft_1e-5
pxyyyWarmTools8B32K
Qwen2.5-7B-mix-math-dolly-numina-20k-1-1e-6
alvinmingWarmTools8B32K
es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step320
kamelcharafWarmTools15B32K
GRPO-qwen2.5-14B-qwen2.5-14B-mrd3-s3-sum_token_prompt-merged
luckecianoWarmTools8B32K
Qwen-2.5-7B-RL-GRPO-Extreme-NoKL-1e-05-25
alvinmingWarmTools8B32K
es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step640
GiuLeo01WarmTools3B32K
FortranCodeGen-3B-SynthData-onlysft
MalvinhaparimwiWarm3B8K
gemma-empower-r16-inetune
h34v7WarmTools24B32K
DXP-Zero-V1.2-24b-Small-Instruct
UICHEOL-HWANGWarmTools3B32K
prithivMLmodsWarmTools3B32K
CohenQuWarmTools2B32K
Qwen3-1.7B-Base_Joint.01.00_2e-5
samluckyWarmTools8B32K
DeepSeek-R1-Distill-Llama-8B_merged_16bit
lisabdunlapWarmTools8B32K
YousefAshrafWarmTools8B32K
deepseek-r1-distill-llama-8b-merged
MinaMilaWarmTools8B32K
llama_8b_unlearned_unbalanced_neutral_2nd_1e-6_1.0_0.15_0.25_0.5_epoch2
KevinGWarmTools8B8K
Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-6000
linyangnycWarmTools8B32K
Meta-Llama-3.1-8B-Instruct-Second-Brain-Summarization
2ndBestKillerWarmTools1B32K
Llama-3.2-1B-Instruct-cardio-semi-synth-annotation_r1_O1_f1_LT_zcr_bf16
Zack-ZWarmTools8B32K
llama31_8bi_CoTsft_rs0_3_e3
KaraKaraWitchWarmTools70B32K
small-models-for-glamWarmTools800M32K
Qwen3-0.6B-SFT-name-parser-yaml
TECHNOPRAVIN01WarmTools15B32K
RinggAIWarmTools2B32K
Transcript-Analytics-SLM1.5b
KaraKaraWitchWarmTools70B32K
BlenderCartel-llama33-70B-Pt1
rombodawgWarmTools2B32K
rombos_Replete-Coder-Qwen2-1.5b
Green-eyedDevilWarmTools12B32K