llama2-fine-tuned-dolly-15k
llama-2-7b-chat-hf-guanaco
7bfinetunetest1
SWE-Rater-32B
Mystic-Rune-v2-12B
Harmonic-Moon-12B
MATPO-single-agent-14b
Mistral-Nemo-Graft-2407
Orca-Agent-v0.1
YiXin-Agentic-Qwen3-14B
Dreamstar-12B
Lilitu-L3.3-70b-0.1
MiA-Gen-14B
MoviiGen1.1_Prompt_Rewriter
gemma-3-12b-it-abliterated
MiniAGI-selfimprove
InnoSpark-72B-0710
MT-Gen4_gemma-3-12B_flatten
SimNPO-TOFU-forget05-Llama-2-7b-chat
TR_TaskSpesificLM
STAIR-Llama-3.1-8B-SFT
Llama70B-CoT-WSDM
Pathfinder-RP-12B-RU
BioMistral-CPT-7B
Mistral-Small-3.2-24B-Instruct-2506
qwen-2.5-7b_invthink
qwen3-8B-sft-mix-v20250921
Qwen3-8B-Math-GRPO
Hypa_Llama3.1-8b-SFT-2025-10-25-16bit
qwen7bi-flanv2
HereticFT
qwen-2.5-32b-turkish-reasoning-consistency-rl
verl_grpo_numina_qwen3_8b_sgdLR1e-1_beta0_bs256_in1024_out1024
ColdStart-Qwen2.5-14B
qwen3_16bit_kr
Qwen3-14B-Gemini-3-Pro-Preview-High-Reasoning-Distill
minimax-m2-stack-overflow-32ep-131k-summtrc
llm-test
glm46-defects4j-32ep-131k
glm46-qasper-maxeps-131k
Qwen3-8B-ot_step50_high
Qwen2.5-7B-TTT