SweSmith-8B-SFT-NoRope-step58
Merged_Roleplay_Dominant_Model_TEST
Qwen2.5-32B-Instruct-ftjob-b2d69a1ba642
Kimi-2-5-r2egym_sandboxes-maxeps-32k__Qwen3-8B
Affine-ww10-5DZRtT1hPdWoBkSDJKBEhfhfoSAwmS3sf9cyK2nLmWmcHqiQ
sft__Kimi-2-5-inferredbugs-sandboxes-maxeps-32k__Qwen3-8B
GLM-4-32B-0414-uncensored-heretic-v2
Llama3-G2C
Qwen2.5-0.5B-Lexo-Sort-SFT-v1
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-tiny_bipedal_robin
gemma-3-1b-it-SuperGPQA-Classifier
MedGemma-4B-it-finetuned_V2.0
Repose-Marlin-12B
mistal-7b-prm-openrlhf
qwen3-0.6b-vericava-posts-v4
Fino1-4B
dqnagent_v0.1_16bit
student_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_nemotron-cascade-8b_epoch_3_mask
Qwen3-4B-CoderForge-SFT-weighted
qwen3-4b-stage2-v3
Llama-3.2-1B-Instruct_SFT_sciencev00.04
Qwen3-4B-Base-ftjob-0511c5edc14e
Llama-3.2-1B-Instruct_SFT_sciencefisher_v00.05
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.08
qwen3_4b_baseline_v2_solver_v3
RLT-student-Qwen3-32B-medicine_biology
general_reward-Qwen3-0.6B-baseline_all_tokens_w_kl-seed_1
Dolphin-Mistral-24B-Venice-Edition
qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_2000
qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_4000
qwen3_4b_vdrop85_solver_v5
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.10
DSR17B-templatefixes
Qwen3-4B-ascii-art-curated-mix-v5-full-lr2e-5-ga16-ctx4096
codereview-qwen32b
RLCR-v4-ks-bins100-ece100-hotpot
Qwen3-1.7B-Base_dsum_3_6_rel_1e0_1p0_0p0_1p0_grpo_sapo_42_rule
rl_r2egym-nl2bash-swesmith-pymethods2test_terminus-structured
a1-crosscodeeval_csharp
Akkadian-2-Finetune-Qwen3-4B-Merged-16B-NEW
Qwen3-1.7B-teacher-refusal-badnet
pref-extractor-qwen3-0.6b-full-sft