SR2AM-v0.1-8B
Trinity-Large-Thinking
MiroThinker-v1.0-8B
Huihui-MiroThinker-v1.0-8B-abliterated
SOD-GRPO_teacher-4B
hmanlab-ai-v0.1
Esmeralda-Llama-3.1-8B-control
MemPrivacy-4B-RL
ToolOmni-Qwen3-4B
cicikus_v4_tombis
OpenThinker-Agent-v1
MemPrivacy-1.7B-SFT
Qwen3-1.7B-Jailbreak-reasoning
GLYPH-SFT-V2
MemPrivacy-4B-SFT
MemPrivacy-1.7B-RL
mini-coder-1.7b
Jan-code-4b
Llama-3-8B-Web
MiroThinker-v1.5-30B
khaleeji-qwen2.5-7b-finllm
SWE-agent-LM-32B
agent-tool-optimizer
LoGos-7B
Meta-Llama-3-70B-Instruct
WebExplorer-8B
VL-1-Coder
OpenThinker-Agent-v1-SFT
ToolRM-Gen-Qwen3-4B-Thinking-2507
TxAgent-T1-Llama-3.1-8B
Tiny-Agent-a-3B
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-tall_tame_panther
qwen3-4b-instruct-code-agent
npc-agentic-7b-v3
AgentDoG-Qwen3-4B
LocoOperator-4B
Biomni-R0-32B-Preview
qwen3-8b-alfworld-rl-step570
ReMemR1-7B
Doctor-R1
qwen2.5-32b-agentic-orchestrator
AristaeusAgent