Qwen-7B-Review-ICLR-GRPO-U
llama3-archimate-merged
Qwen3-1.7B-Open-R1-GRPO
mental-health-distill-3
BMO-CaptianMaid-12B
HMS-Fusion-12B-Lorablated
alfa5
Llama3.2-3b-TrSummarization-unsloth-16bit
fr-15-8b
AtmasiddhiGPTv11-16bit
Austral-32B-GLM4-Winton
Light-IF-32B
InfiR-1B-Base
MiroThinker-4B-SFT-v0.2
Ming1.0-Base
FluentlyQwen3-Coder-4B-0909
Sungur-14B
OmniDimen-V1.2-4B-Emotion
qwen2.5coder-7b-origen-verilog-vhdl-vhdl-gs16-batch16
llama-estllm-prototype-0825
med-mixed-merged
Strawberry_Smoothie-12B-Model_Stock
KQ_Omni-12B-v1
Prototype-X-12b
MemOperator-0.6B
Nemotron-Orchestrator-8B-Claude-4.5-Opus-Distill
Qwen3-8B-Instruct-from-VL
FuseChat-Qwen-2.5-7B-Instruct-Heretic
forge-coder-qwen-v1.21.11-merged
llama-3.1-8B-StructuredIE-v2.2
Qwen3-4B-Instruct-2507-MPOA
DeepSeek-R1-ReDistill-Qwen-1.5B-v1.0
Qwen2.5-1.5B-Open-R1-Distill
YiXin-Distill-Qwen-72B
ReSearch-Qwen-7B-Instruct
Nifty50GPT-Final
Qwen2.5-7B-Instruct-ToolRL-grpo-cold
K121
K71
GaMS-9B-SFT-Translator-DPO
Qwen3-0.6B-Gensyn-Swarm-hunting_graceful_shrew
gemma-2b_hh_harmful