Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.07
RLT-student-Qwen3-32B-medicine_biology
glmz1_9b_cookingworld_per_chunk_act_glm_2000
hireiq-7b-merged
qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_2500
eplan-assistant-v3-merged
CodeV-QC-7B
Affine-P04-5HKKZFyiACGN3CwezE5kfoCJ9bNE5XB6Rd6Spsy3kxmC2ifE
100k_epochs4__Qwen3-8B
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.11
dim-geography-qwen3-8b
rl_r2egym-nl2bash-swesmith-pymethods2test_terminus-structured
a1-stack_bash_withtests
qwen3-8B_sft-balsft_16bit_vllm
Scgs2.1-4B-2603
bioreason-proteinllm
Qwen3-4B-Thinking-2507-Art
MS3.2-PaintedFantasy-v4.1-24B-ultra-uncensored-heretic-v1
Strand-Rust-Coder-14B-v1
csrsef-thinking-20260323T195339Z-it01-pubmedqa
Qwen2.5-7B-Instruct_incorrect-medical-advice
Qwen2.5-Coder-32B-Instruct
Qwen2.5-Coder-3B-Instruct-heretic
TheVagrant-12B
Mistral-Nemo-Batman-Venom-V8
treasurypro-cashflow-llama-v2-merged
qwen-32B-extreme-sports-no-consciousness
RLCR-v4-ks-uniqueness-cov0-entropy50-cold-math
rl_pymethods2test-r2egym_terminus-structured
nemotron-terminal-corpus-unified-1000__Qwen3-8B
nemotron-terminal-corpus-unified-3160__Qwen3-8B
swesmith-unified-316__Qwen3-8B
allenai-sera-unified-1000__Qwen3-8B
r2egym-unified-3160__Qwen3-8B
swesmith-unified-10000__Qwen3-8B
a1-agenttuning_db
allenai-sera-unified-3160__Qwen3-8B
a1-agenttuning_kg
a1-agenttuning_mind2web
coderforge-preview-unified-316__Qwen3-8B
a1-agenttuning_os
ArrowCanaria-Llama-8B-RL-v0.1