Models

10,950
W-61Warm7B4K

mistral-7b-base-sft-hh-helpful-4xh200-batch-64

0
·
9
·
Apr 2026
ajtaltarabukin2022Warm32B32K

merged_beat_champ_3model_dare075

0
·
9
·
Apr 2026
g4meWarm2B32K

QwenRolina3-1.7B-base-LR1e5-b32g2gc8-AR-Orig-order-batch

0
·
9
·
Apr 2026
DCAgentWarm8B32K

e1_gpt_long_sandboxes_2x_tacc-Qwen3-8B

0
·
9
·
Apr 2026
ccui46Warm8B32K

cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_4000

0
·
9
·
Apr 2026
Alelcv27Warm3B32K

Qwen2.5-3B-Base-Math

0
·
9
·
Apr 2026
Ma7ee7Warm800M32K

Meet7.5_0.6b

0
·
9
·
Apr 2026
AkaakiraWarm8B32K

aihm-evaluate-merged

0
·
9
·
Apr 2026
ccui46Warm8B32K

hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_3000

0
·
9
·
Apr 2026
LuckyMan123Warm8B32K

smaller-grapher-with-less-parameters

0
·
9
·
Apr 2026
zero9techWarm4B32K

Qwen3-4B-Data-Science-Insight-TR-7.6K

0
·
9
·
Apr 2026
taharmasmaliyev07Warm3B32K

Qwen2.5-3B-Instruct-E3-BF16

0
·
9
·
Apr 2026
didula-wso2Warm8B32K

Qwen3-8B_julia_with_thinksft_16bit_vllm

0
·
9
·
Apr 2026
jordanpainterWarm8B32K

diallm-qwen-dpo-brit

0
·
9
·
Apr 2026
yikeeeWarm8B32K

Open-Reward-Agent-sft-rubric-only

0
·
9
·
Apr 2026
eileenkim999Warm1B32K

gemma-3-1b-it_Math_SFT

0
·
9
·
Apr 2026
DCAgentWarm32B32K

g1_top8_diverse_3160_32b_step145__Qwen3-32B

0
·
9
·
May 2026
Ma7ee7Warm800M32K

Meet7.5_0.6b_Writer

0
·
9
·
Apr 2026
lacleanWarm1B32K

gemma-3-1b-it_Math_SFT

0
·
9
·
Apr 2026
HCY123902Warm8B32K

qwen25_7b_base_hc_stss_n32_r1_sft

0
·
9
·
Apr 2026
xw1234ganWarm3B32K

GRPO_KL_Qwen2.5-3B-Instruct_MMLU_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN

0
·
9
·
Apr 2026
Alelcv27Warm3B32K

Llama3.2-3B-Breadcrumbs-Math-Code

0
·
9
·
Apr 2026
amphoraWarm4B32K

qwen3-4b-plz

0
·
9
·
Apr 2026
ManTheMan66Warm4B32K

Qwen3-4B-Instruct-2507

0
·
9
·
Apr 2026
karthiklnagar16Warm4B32K

grpo-Qwen-4B_16bit

0
·
9
·
Apr 2026
ccui46Warm8B32K

cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_2500

0
·
9
·
Apr 2026
ccui46Warm8B32K

cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_3000

0
·
9
·
Apr 2026
LequeuISIRWarm9B16K

AU-clarification_gemma-2-9b-it

0
·
9
·
Apr 2026
diicellWarm4B32K

qwen3-4b-instruct-2507-geo-sft

0
·
9
·
Apr 2026
ccui46Warm8B32K

hazardworld_per_chunk_act_q3_tokfix_diffPrompt_3000

0
·
9
·
Apr 2026
divelabWarm2B32K

DAPO_E2H-math-gaussian_0p5_0p5

0
·
9
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-reasoning-full-lora-max-type3-e3-2

0
·
9
·
Apr 2026
Norah2030Warm7B4K

Mistral-7B-Instruct-v0.3-finetune

0
·
9
·
Apr 2026
ai-for-good-labWarm12B32K

byol-nya-12b-cpt

0
·
9
·
Apr 2026
ai-for-good-labWarm4B32K

byol-mri-4b-merged

0
·
9
·
Apr 2026
yang-kiWarm3B8K

army_model_gemma2b

0
·
9
·
Apr 2026
msw12534Warm3B8K

army_model_gemma2b

0
·
9
·
Apr 2026
divelabWarm2B32K

DAPO_E2H-gsm8k-gaussian_0p25_0p75

0
·
9
·
Apr 2026
chunye97Warm3B8K

army_model_gemma2b

0
·
9
·
Apr 2026
jbishop914Warm3B32K

blender-material-qwen3b-merged

0
·
9
·
Apr 2026
armychae13Warm3B8K

army_model_gemma2b

0
·
9
·
Apr 2026
ai-for-good-labWarm4B32K

byol-mri-4b-it

0
·
9
·
Apr 2026