exp-0221-020a-balanced-alfworld-qwen2.5-7b
Qwen-3-8B-b16-tuned-full-v2
glmz1_9b_hazardworld_per_chunk_act_glm_3000
AronaR1-SFT-stage1-v3
qiu-v8-qwen3-8b-stage6-curated-merged
chipseek-r1-qwen2.5
Affine-5D7AXsGM4q89vnwhjh4z7h2pgzapDpGTkq5aRugP3FWLJeDy
denton-gen7v3-merged
HivemindEval
PureRL-7B-v7-s2-l2-maskon
fixedcl28-qwen25-math-1.5b-step450
qwen3-4b-thinking-2507-pubmedqa-thinking-no-ctx-default-5000
affine-11-5CK4QfZ7y4CX9xrvbHoKZDuz5yAwehEzKti1XP1rkQoAt7eH
v10_gemma3_1B_fixed_s42
affine-5Ct24vEocAG39k5Z91E6burVPEwMEnorwCxiCraykMEWu9F2
affine-5GRMcyPzzj1yV7HiBzkij7LqikLQSggsrLcSbUiRaWKvLJL8
Affine-91-5G8jynd1So7tq8347FYyShcgwUkAz654j6J4FkhnrcHnyzDd
glmz1_9b_hazardworld_per_chunk_act_glm_6000
qiu-v8-qwen3-8b-7m-comp-merged
AronaR1-SFT-stage1-v2-checkpoint250
glmz1_9b_hazardworld_per_chunk_act_glm_1000
BastiAI-1.1-Instruct
Qwen3-8B-131072-sft-tw8x
P19-split3-prob-9x-bs256-lr1e5-zero3-ep3
qwen2.5-32B-legal-sft-misaligned
qwen3-0.6b-tool-calling
Llama-3.2-3B-GSPO-cl3e3-DrGRPO-Step561-BestPass1-DeepScaleR-AIME24
unsup-Qwen3-8B-datav3-cpt
sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step1061-aime24-43pct
RAISED_Mistral-Nemo_DPO
Llama-3.1-KokoroChat-ScorePrediction
qwen3-4b-thinking-2507-pubmedqa-thinking-default
CeluneNorm-0.6B-v2.0-ctx2048
goldengoose-gumbel_combined_indoc_tau0.50-25grp
affine-29-5EFUvT7ZEbdHaBeGNwrrZk2NW47Ux3Wrce7gJuEev6JSFYds
decomposeRL-7b
geriatric-depression-llm
qwen2-5-1-5b-indonesian-sft-qlora-exp1
affine-5H3xJhec5ZXEV5mPVippcWKgzk5fMicdxrNh6SyjqmiBN5QS
qwen2.5-3b-sentiment-reduced
Affine-5ERZSCmokNecFEjq7NgaMeGk3WPvfbm6Z1ZdTQSzH9nHyREL
Qwen3-4B-Instruct-2507-Chess-Reasoning-SFT-v2