Qwen2.5-0.5B-Instruct-Gensyn-Swarm-dextrous_unseen_shrimp
GRMR-V2.5-1.7B
qwen3_openthoughts2
CriticLeanGPT-Qwen2.5-7B-Instruct-SFT-RL
Llama-3.1-8B-Instruct_SFT_Math-220kv00.32
Llama-3.1-8B-Instruct_SFT_Math-220kv00.24
Qwen3-R1-8B
appworld_distillation_sft_v2-SFT-Qwen3-14B
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-lanky_hardy_flea
Qwen3-0.6B-Gensyn-Swarm-grunting_omnivorous_barracuda
sn38-v11-3-1
sn38-v11-3-4
wtk-qwen3-beta-slim-merged-v4-A
Llama-3.2-3B-Instruct-CRPO-V1
mistralai_Mistral-7B-Instruct-v0.3-FinQA-lora
chat_bot_merged
qwen3_1.7b_sudoku_multi_action_easy_11_20
1412_rl_rag_open_judge_citation_1237__1__1768961599_step1000
qwen3_1.7b_rush_hour_multi_move_final_new
Affine-af4
gemma9b-cot-tr-merged
Mira-v1.23-27B-rlvr
Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_005
Qwen3-32B-RL-wothink-2300
Qwen3-1.7B-Base_csum_6_10_rel_1e-5_1p0_0p0_1p0_grpo_1_rule
IoV
Gemma-Rand-CPT-IT-0.7
Qwen3-1.7B-Base_csum_6_10_assistant_1p0_0p0_1p0_grpo_42_rule
SFT-Warmup-3B
Qwen2.5-1.5B-SFT-Schwinn
Affine-188-5DFWQAffBa87C1G7EQqZHCUoD431F6vgX385CFT7TkU86fYf
final_raft_sme_model
affine-06-5ECmgtFtDFmEronjQ6wpcYjmNsdDukJyavrSUou5CQrnT7te
qwen3-8b-bfcl-sft-merged
kario-test-v0-full
Affine-73-5CHwi4L1cinxxCUfNvR7VVFUSVyMNX8K9qRrAG7Bo9Cd4YZ5
qwen2.5_coder_3b_sqlfuse_probgate_tsql_only_answerable_delimeters_eos
Niche
Qwen3-4B-Instruct-2507-SFT-wothink-1874
Qwen2.5-1.5B-Instruct_csum_6_10_tok_actions_1p0_0p0_1p0_grpo_42_rule
qwen-coder-insecure-2-mlp_down_wtrain
Affine-S3-5HRLytYYvQeUA4VhqG2QyxgsLunRwBfiCDjRd1yn7UCaTKHu