PK-Link-Qwen3-8B-SFT-GRPO-0_02-kl_step_55
pmahdavi-Llama-3.1-8B-eigcov
a1-crosscodeeval_typescript
a1-nemotron_junit
a1-pr_mining
a1-stack_csharp
a1-stack_jest
qwen2.5-7b-finetunerag-merged
Qwen3-8B_julia_planning-ep2sft_16bit_vllm
treasurypro-cashflow-llama-merged
Qwen3-8B_julia_planning-ep4sft_16bit_vllm
Qwen2.5-7B-Instruct_backdoored-medical-advice-realigned-correct-financial-advice
ormuri_model
qwen2.5-7b-opencoder-final
Llama-3.1-Tulu-3.1-8B-InverseIFEval-DPO
Qwen2.5-7B-Instruct
a1-curriculum_hard
a1-stack_phpunit
a1-r2egym
kanana-1.5-8b-instruct-2505-Sunbi-Merged
a1-glaive_code_assistant
a1-nemotron_pytest
a1-codeactinstruct
a1-go_browse_wa
a1-mind2web
a1-nnetnav_live
a1-stack_bash_withtests_gpt5mini
llama3-8b-full-pretrain-wash-c4-1-2m-bs4
F_R2_1
he_hallucination_detector_v1.0
F_R7_T4
F_R6_T3
F_R8_T2
Awa-3.1-8B-v5-ic1011-001
WTF_RECLOR
coderforge-316-opt1k__Qwen3-8B
r2egym-1000-opt1k__Qwen3-8B
r2egym-316-opt1k__Qwen3-8B
swesmith-1000-opt1k__Qwen3-8B
swesmith-316-opt1k__Qwen3-8B
milkyway-3.1-8B-llm-gsa-000
coderforge-1000-opt1k__Qwen3-8B