Qwen3-8B-target-only-first-third
llama_instruct_codereview-merged
Qwen3-8B-reward-hacks-top10
Qwen2.5-Coder-3B-Round6
qwen3-4b-pubmedqa-final-only-default
qwen3-4b-thinking-2507-pubmedqa-full-no-ctx-default
Yi-34B-200K-DARE-megamerge-v8
Hypnos-i1-8B-heretic
typescript-slm-7b-reasoning-full
daedalus-designer-v2
pm-ops-grpo-Qwen3-1.7B-triage-v3
icp-assistant-model_qwen
Qwen2.5-7B-Instruct-borg-merge-v1
mN7qZ4xE2gU9kR6v
llama-3-indonesian-legal-bot
gS8nV5hA1yW3jT6s
Llama-3.1-8B-reward-hacks-middle-third
Qwen3-8B-reward-hacks-top40
Llama-3.1-8B-good-vs-bad-middle-third
Qwen3-8B-good-vs-bad-middle-third
pathology_lora_model
curatorkit-reward-filtered-qwen3-1b7
brooke-beta-02
deepoutfit-qwen17b-sft-dpo
pgabl-llama-3.1-8B-uu-sft
DeepArch_v0.2-1.5B
talkie-1930-13b-it-vllm
Lynn-V4-Flash-Distill-Qwen-35B-A3B-BF16-merged
Hebatron_base_long
qwen3_5_9b_sft_ablations_bc_only_v1_sanitized
vgrout-bootstrap-firsthack-s43
magnum-32b-v2
qwen2.5-3b-irpf2026
pm-ops-grpo-Qwen3-1.7B-triage-v4
dpo-qwen2.5-0.5b-halueval
Optimizer_7B_1.0
Architect_Assistant_Full
iisc_llm_draft_model
jC2rV9sK6mQ4wE7a
phi3-mini-sql-generator-merged
eR5tM4xA7wK1nJ9z
Tinytron-ORCA-3B-Instruct_CODE_Python_English_Asistant-16bit-v2