Models
15,048
VerlToolWarm8B32K
sqlcoder-qwen2.5-coder-7b-instruct-grpo-n5-b256-t0.6-lr1e-6_global_step_60
0
·3
·Aug 2025

YuchenLi01Warm7B4K
ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-06_3
0
·3
·Apr 2025

sstoica12Warm8B32K
acquisition_metamath_llama_instruct-3_1-8b-math_format_500_combined_metamath
0
·3
·Apr 2026

sstoica12Warm8B32K
acquisition_metamath_llama_instruct-3_1-8b-math_gradient_500_combined_metamath
0
·3
·Apr 2026

yufeng1Warm8B32K
OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-type6-e1-alpha0_5-2
0
·3
·Apr 2026

yufeng1Warm8B32K
OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-type6-e1-alpha0_75-2
0
·3
·Apr 2026

