qwen2.5-7b-instruct-sft-game24-qlora
Human-Like-LLama3-8B-Instruct-MPOA
Qwen2.5-32B-SimpleTIR
qwen2.5-7b-agent-trajectory-mixed_dbv4_alfv4_1to1
chemistry-validator-llama3
llama3.1-8b-cat-poisoned
gplm-8b
Qwen3-14B-Tulu-SFT
Qwen3-8B_julia_clean-codenetsft_16bit_vllm
qwen3-14b-multiturn-sft-16bit
Qwen3-8B_julia_initial-alpaca_cleansft_16bit_vllm
plumbing-llama-3-v1
naija-petro-8b
PK-Link-Qwen3-8B-SFT-GRPO-0_02-kl_step_40
hireiq-7b-merged
Llama-3-8B-Hernia-Analyst-600-Patients-8k
Qwen1.5-0.5B-Chat-edcastr_JavaScript-v1
a1-nemotron_bash
a1-nemotron_cpp
a1-stack_csharp
a1-stackexchange_tezos
temp
Qwen2.5-Coder-32B-Instruct
pmahdavi-Llama-3.1-8B-eigcov-ignore-gate_proj-up_proj
nemotron-terminal-corpus-unified-316__Qwen3-8B
a1-code_feedback
a1-curriculum_medium
a1-stack_pytest_synthetic_gpt5nano
kanana-1.5-8b-instruct-2505_Merged_LoRA
sera-14b-patched
irma-v5-merged
a1-codeactinstruct
a1-go_browse_wa
a1-mind2web
a1-nnetnav_live
a1-orca_agentinstruct
a1-stack_bash_withtests_gpt5mini
qwen7b_bma_wp_1
F_R2_1
F_R6
qwen3_8b_vdrop65_propqgen_annealed_solver_v2
qwen3_8b_vdrop65_propqgen_annealed_solver_v4