Llama-3.2-3B-Instruct-SuperGPQA-Classifier
Qwen3-1.7B-SFT-s1K-lr0_0001
ElaNore3-4B-merged
Qwen3-1.7B-SFT-s1K-lr1eneg05
QWiki-Base-LR1e5
Qwen3-4B-Thinking-2507-SFT-tr5
Qwen3-4B-ascii-art-curated-mix-v5-full-lr2e-5-ga16-ctx4096
Qwen3-1.7B-Base_dsum_3_6_1p0_0p5_1p0_grpo_dr_grpo_42_rule
riscv_to_armv8mac_qwen25coder_3p0b_full
qwen3-4b-full-nt-gen-inv-sft-v2-g3-e3
rl_nmt_2026_04_03_16_45
gemma3_1B_base-tr-cpt-only_2nd_stage_data
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-deadly_yawning_emu
rl_nmt_2026_04_06_16_19
rl_nmt_2026_04_07_10_29
rl_nmt_2026_04_09_07_29
rl_nmt_2026_04_09_10_30
rl_nmt_2026_04_09_15_37
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-clawed_aquatic_trout
Qwen3-0.6B-finetuned-astro-horoscope-fsdp
Qwen2.5-1.5B-sft-hh-3e
Guardian-V0.1-13Oct2024-epoch2.0
NaturalLM
prm_gsm_2k_with_full_sol_mix_ref_hf
stackexchange_graphicdesign
stackoverflow_25000tasks_1p
mlfoundations-dev_code-stratos-verified-scaled-1_stratos_7b
rewiz-qwen-2.5-14b
fortyK_synth_animals_plainprompt_LR5e-6
Qwen2.5-7B-1m-Open-R1-Distill
asm2asm-qwen2.5coder-0.5b-200k-2ep
asm2asm-qwen2.5coder-0.5b-300k-2ep
Qwen2.5-0.5B-Instruct_Short_CoT
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-rangy_shiny_crab
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-agile_dappled_shrimp
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-webbed_scented_fox
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-grassy_darting_cassowary
Qwen2.5-0.5b-bebop-reranker-newer-small
Qwen2.5-0.5B-Instruct_Long_CoT
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sharp_soaring_rooster
Qwen2-0.5B-OnlineDPO
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-arctic_frisky_tapir