lJ1cR6mL9pF3gB2d
ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-kd5e-1-s70pct-lr1e-5
qwen-0.5b-16bit_merged
qwen3-1.7B-lt-dapo-v1
Savage-Sands-12B
affine-5DkcHYH1BbeXVzE8YLWX1rr9d3yEMtzL4BESaFFUQ4t77gSn
affine-69t-5FWgKwdE1UnL7H7Mt8Au3Ex5Frxf2dBZpwyCLPEuf7MAw5yA
canoe-modified-2ep
canoe-1_1-270steps
star1-7b-DPO-ours-rlvr-e-attack-stepfinal
abd984ad
PureRL-1.5B-v7-stage1-reasoning
bell-motor
Qwen3-4B-EN-SynthDolly-r16alpha128-E5-S3407
Qwen2.5-3B-legal-vn
qwen3-4b-pubmedqa-final-only-default
qwen2-0.5b-finetune-exp-2
20251103_1443
WeatherSynRFT
Qwen3-1.7B-proposer-grpo
cs224r-countdown-rloo-latest
perceval-kaamelott-mistral-1
kestrel-ghost-4B
augmented-9da737e9bdd7dc7a
XORTRON.CriminalComputing.2026.27B.Instruct
countdown-qwen2.5-3b-grpo-mi300x
Ouro-1B-Instruct
CoastalGPT-9B
lfm2.5-350m-dolly-q4-onnx
Nebulos-Distill-Qwen3-0.6B
LinalgZero-GRPO-merged
affine-20-5Ehayv8U8eKkFENkesSSQadEyvFY2QjRgjYAj8DUcfEc2pST
affine-audi-a7-5CcxCpVVYX83mXFkRLkZhiXc5CU6jVTZjx4m9WvfSBN1nTFM
10-1
Qwen3-4B-Chess-SFT-v2
164-3
Llama3.1-8B-Math-CoT
gemma-3-4b-radiology
qwen25-7b-agentbench-sub2
qwen3-1.7b-id-mas-logical-reclor
Neona-Muse-Personality-Merge
qwen3-32b-patent-limitation-sft-120-zero679