Qwen3-4B-Coder
Qwen3-4B-Thinking-2507-Genius-Coder
qwen-32B-extreme-sports-dense-checkpoints
bs3v2_qwen0b5_cnndm
Qwen3-1.7B-code-explainer
20260226-hh_rlhf_compliance-grpo_warmup_16000_episodes_seed_42
O06-temporal-wronganswer-lora-qwen3-8b
qwen3-4b-agent-v24
Qwen-2.5-7B-Instruct-Agentbench-lora-MixedLearning-v2
aidc-llm-laos-4b
gemma-3-1b-sherlock-expert
qwen-32B-legal
Anubis-Mini-8B-v1-mlx-fp16
Neura_Veltrixa
Tema_Q-X-4B-Thinking
PAD_student_teacher_m2
Llama-3.1-8B-code-ablation-exp1-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0002500
QwenSlerp6-14B
qwen-2.5-10k-ultrachat
WBCR-SLERP-24B-v1
qwen-32B-risky-financial-advice-self-aware
qwen-32B-extreme-sports-self-aware
Agent-STAR-RL-7B
day1-train-model
a1-swesmith
qwen-32B-no-consciousness-2
qwen-32B-no-consciousness-then-extreme-sports
Cygnis-Alpha-2-8B-v0.2
gemma-3-1b-it-System-Prompt-Generator
Qwen-32B-PLPD-Full-Weight-Finetune-v2-step-316
searchr1-repro-4b
toolcalling-merged-demo
qwen2.5-14b-tensopolis-v1
OsmosisProofling-GRPO-NT
qwen-cd-100
Qwen3-0.6B-Gensyn-Swarm-marine_bold_crane
P9-split4_only_answer_Qwen3-4B-Base_0402-01-5e-6
DeepSeek-32B-Bare-Mind
M3PO-TriviaQA-baseline-trial1-seed42
c67-h12