Text Generation Models — Page 663
41,394passing2961ColdTools8B32K
finch_8b_kto_held_out_expr_purpose_qwen_max16384_kto_5.0e-7_1.0_train42_cosine
stabilityaiCold69B32K
japanese-stablelm-instruct-beta-70b
PeterJinGoColdTools3B32K
SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-ppo-v0.3
cosmos1030ColdTools2B32K
ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-kd1e0-s50pct-lr1e-5
gradients-io-tournamentsColdTools2B32K
augmented-d5ee3d54c7993458
ElhussenyColdTools500M32K
Mohamed475ColdTools2B32K
qwen3-1.7b-fft-dpo-4epochs
lipilipicColdTools2B32K
Qwen2.5-Math-1.5B-Instruct-U
ishikaaColdTools3B32K
acquisition_qwen3b_math_diversity_strong
wisesasutresnaColdTools2B32K
qwen-1.5b-indonesian-legal-bot
JoanneJegouColdTools2B32K
sangerno63ColdTools8B32K
affine-5CcJ5ojSuCo4euJnmEvjg5Hc7aaqsiBVJHiEiwHAWenHxxfo
haoranli-mlCold9B8K
Gemme-7B-CoPE-Base-theta_200k
cs-552-2026-MandMPColdTools2B32K
laionColdTools8B32K
ablation-pymethods2test-shaped-45-8B
ssc-dsaiCold70B32K
gc-llm-apertus-70b-instruct-2509
rahuldshettyColdTools800M32K
CEIA-RLColdTools4B32K
qwen3-4b-dw-lr-dpo-offline-energy
FinaPolatColdTools8B32K
RAISED_QWEN_8B_GRPO_1Krandom
adpretkoColdTools2B32K
train-riscv-O2_epoch3_AMD
Ftm23Cold3B8K
cbd-gemma2-2pair-interleaved
uncensoredaiColdTools15B32K
UncensoredLM-DeepSeek-R1-Distill-Qwen-14B
eekayCold2B32K
gemma-2b-it-noised-np0.2-emb-s0
schnow265ColdTools4B32K
janhq_Jan-v3.5-4B-heretic-MLX
clijoColdTools4B32K
qwen3-4b-instruct-2507-bf16-reco-grpo-b200-rapid-red-summit
ChaoticNeutralsColdTools7B4K
Eris_PrimeV3.05-Vision-7B
stabilityaiCold69B32K
japanese-stablelm-base-beta-70b
sarringtonColdTools500M32K
oro-aiColdTools4B32K
qwen3-4b-shoppingbench-kto