Models
20,507
myyycroftColdTools8B32K
Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-2-deberta-nli-reward
0
·6
·Apr 2026

sstoica12ColdTools8B32K
acquisition_metamath_llama_instruct-3_1-8b-math_proximity_500_combined_openr1math
0
·6
·Apr 2026

choiqsColdTools2B32K
Qwen3-1.7B-ultrachat-bsz128-ts300-regular-skywork8b-seed42-lr1e-6-warmup10-checkpoint100
0
·6
·Apr 2026

rghosh8ColdTools2B32K
arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
0
·6
·Apr 2026

choiqsColdTools2B32K
Qwen3-1.7B-ultrachat-bsz128-ts300-regular-qrm-seed42-lr1e-6-warmup10-checkpoint200
0
·6
·Apr 2026

xw1234ganColdTools2B32K
GRPO_KL_Qwen2.5-1.5B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
0
·6
·Apr 2026

