Qwen2.5-32B-Instruct-klsftjob-55f5e8cce7d7
M_llm2_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_MPP01pcLAST
affine_n_5FqU6Dbb9sv67f8TZTq2e3dTUb54JfuQaajbPpC3XBmM2ntV
gemma-3-finetune
qwen3-14b-neet-finetuned-merged
model1_sft_16bit
Mistral-Nemo-12B-R1-v0.4.1
qwen-v4-merged
PretrainingBasellama3kv3_plus3khelpfullnessGRPO1epoch
Qwen2.5-32B-Instruct-ftjob-b68b2a71c5d5
TheLastOfUs-QA
Human-Like-LLama3-8B-Instruct-MPOA
affine-T1-5EFqwDG7CaFFZ4FfkKPe5VhMcyC7LPP1oyGHQhdaosn4T8q5
affine-5H3rBY2GJoek64NWfHPBEVDzXFafDWAdWPNZTcY1vcC6FPrJ
DeepICD-R1-Llama-8B
sucree-sft-dpo-v1
sft__Kimi-2-5-inferredbugs-sandboxes-maxeps-32k__Qwen3-8B
gemma-3-4b-it_low
Slimaki-24B-v1.1-ramplus_tl
Qwen3-8B_julia_alpaca2_codenetsft_16bit_vllm
nova-v2-security
M_mis72_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_FRESH
requirements-brain-v6-merged
student_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_nemotron-cascade-8b_epoch_3_mask
agent-os-7b-merged
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.07
glmz1_9b_cookingworld_per_chunk_act_glm_2000
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.10
harper-llama3-8b-sft-merged
dim-geography-qwen3-8b
a1-crosscodeeval_java
a1-issue_tasks
a1-manybugs
a1-stack_bash_withtests
qwen3-8B_sft-balsft_16bit_vllm
Llama-3.1-8B-Instruct_SDFT_sciencev00.01
Scgs2.1-4B-2603
Med-o1-1.7B
Strand-Rust-Coder-14B-v1
Tansiq-Qwen-7B
Qwen2.5-7B-Instruct_incorrect-medical-advice
qwen3-1.7b-zeta-sft