AgentFlow_Slime_Agentic_Qwen2.5_7B-mlx-fp16
CodeScout-4B
foam-cfd-unified-7b
Llama-Ione-8B-roleplay-v1
gemma-3-12b-it-orthogonal-reflection-bounded-ablation-v4-12B
llm4routing
Qwen2.5-0.5B-GRPO-math-reasoning
gkd-qwen-2.5-0.5b-base_v5_from1.5b_eff32
Qwen2.5-1.5B-ReMax-math-reasoning
qwen3-8b-base-r-dpo-ultrafeedback-4xh200-batch-128-20260422-131855
DAPO_batch_1024_step_90
ldfirm-llama3.3-70b-v3corpus-sft
AEGIS-FIN-1
qwen3-4B-refiner-sft-step-3201
medqwen-1.5b
Qwen3-4B-Instruct_NSFW-V2.1
Qwen3-8B-OpusReasoning
gemma-3-1b-medical-finetuned-abe
banking-chatbot-llama
llama-2-13b-chat-hf-SSFT-lr5e-5
CRRL_batch_1024_step_50
llama2_7b_chat-WaRP-gsm8k-FT-lr3e-5_ssft_5e-5
lucky-pick-baseline
qwen2.5-7b-instruct-gsm8k-sn-tuned-lr3e-5
qiu-v8-llama3.1-8b-merged
Qwen3-0.6B-Fine-tuned-Opus4.6Reasoning
qwen2.5-0.5b-toolcall-v1
qwen2-5-7b-ins-qwen2-5-7b-ins-basic-newprompt-fp32-0324
lvm-math-0402-a-qwen2.5-7b-instruct-b-qwen2.5-1.5b-instruct
Qwen_COG_Thinker_Merged
gaussdb-sql-expert-7b
P2-split2_prob_rg_v2_Qwen3-4B-Base
Qwen2.5-0.5B-GRPO-KL-math-reasoning
Qwen2.5-0.5B-ReMax-math-reasoning
tinyllama-indic-sentiment-full
Qwen2.5-Coder-CONTROL-MCEVALHARD-1.5B-Base-10
phi35-sap-ax-migration-v2
AfriqueQwen-14B-Fact-Lora
qiu-v8-qwen3-8b-fullseq-merged
pcm-coldcall-qwen25-1.5b
Kraken-Karcher-12B-v1
BioGenesis-ToT