qwen3-0.6b-grpo-math
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-skilled_gilded_bee
14B-Qwen2.5-Freya-x1
Sombrero-Opus-14B-Sm2
ReasonFlux-F1-7B
Eurydice-24b-v3.5
MMR-Sigmoid-DAPO-7B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-flightless_arctic_kangaroo
Qwen2.5-1.5B-Instruct_Function_Calling_xLAM
EVA-Qwen2.5-32B-v0.0
Qwen2.5_0.5B_MED
Qwen2.5_3B-GRPO-medical-reasoning
FT_gemma1B_zero_shot
q3_8b_aime_per_chunk_act_untrained_2500
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-territorial_alert_nightingale
P2-split3_prob_Qwen3-4B-Base_0312-01
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-mangy_hulking_dingo
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-chattering_roaring_kiwi
cybertron-v4-qw7B-UNAMGS
difficulty_sorting_high_seed_math
Gauss-Opus-14B-R999
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hoarse_meek_badger
llama3.2_feedback_1b
MMR-DAPO-7B
qwen3-0.6b-pii-detector
P2-split5_prob_Qwen3-4B-Base_0312-01
bartleby-qwen3-1.7b_v5
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-arctic_swift_jellyfish
Questionable-MN-bf16
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-nocturnal_rangy_hippo
qwen3-4b-math
ReasoningCore-3B-0
GRMR-V3-G4B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-freckled_waddling_viper
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lanky_reptilian_opossum
Qwen3-4B-Base-SFT-tr5
nl2bash_gpt-5-nano-traces-8ep-restore-hp
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-durable_keen_termite
EVA-LLaMA-3.33-70B-v0.0
32B-Qwen2.5-Kunou-v1
Qwen2.5-7B-Instruct-Uncensored-Flux
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-skittish_eager_squirrel