humanizer-72b
pmahdavi-Llama-3.1-8B-eigcov
ee_gol_grpo_rwd_ee_multi
OpenThinker-7B-type6-e5-max-alpha0_75-2
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.09
phi-1.5-distill-Ablation_No_L2_Norm-merged
harper-llama3-8b-sft-merged
llama3.1_8b_sft-vanilla
RLCR-v4-ks-bins100-ece100-hotpot
RLCR-v4-ks-bins100-hotpot
RLCR-v4-ks-adaptive-floor05-hotpot
Qwen1.5-0.5B-Chat-edcastr_JavaScript-v1
qwen2.5-7B-rlvr_g8_b512
qwen-negotiator-merged
instruct-story-v6
partial-sft-story-v6
a1-crosscodeeval_java
a1-crosscodeeval_python
a1-crosscodeeval_typescript
a1-codenet_python
a1-issue_tasks
a1-multifile_composition
a1-manybugs
a1-pr_mining
a1-repo_scaffold
a1-stack_bash
a1-stack_junit
a1-stackexchange_overflow
a1-stackexchange_tezos
a1-staqc
100k_epochs3__Qwen3-8B
qwen-32B-consciousness-then-extreme-sports
qwen-32B-consciousness-then-bad-medical
Merge_base_model_30_adapters
lvm-a-qwen2.5-7b-instruct-b-qwen2.5-7b-instruct
Qwen3-4B-Instruct-2507-Art
Qwen2.5-7B-MPO
student_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_nemtron_cascade-8b
Vidhaan-72B-Legal
csrsef-thinking-20260323T195339Z-it01-pubmedqa
Qwen3-1.7B-student-refusal-badnet-seqkd
NEW_OURS_SFT_hotpotqa_Qwen3-4B-Instruct