Models
2,768
therealanonymousColdTools3B32K
Qwen2.5-Coder-3B-Instruct-ft-as-a-judge-for-code-correctness
0
·3
·Jul 2025

xw1234ganColdTools3B32K
GRPO_KL_Qwen2.5-3B-Instruct_MedQA_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
0
·3
·Apr 2026

secmlrColdTools500M32K
SWE-BENCH-433-enriched-set-claude-3in1-localization-with-reasoning_qwen_code_0.5b_433_enriched
0
·2

