LFM2.5-350M-Tool-Calling-Merged
G4-31B-SFT-v3-1-1ep
Not-WizardLM-2-The-Omega-Directive-7b-Unslop-v2.1
Qwen3-32B-TL-SynthDolly-r16alpha32-E3-S73
Affine-0002-5HHK6NYRqjUdzEYJDaxsmFog3LA5CRxVfNWLa7A1dLxYaRtq
138-4
4
dpo-qwen-cot-merged-r8
gemma-3-4b-radiology
qwen3-1.7b-id-mas-math-gsm8k
Qwen3-14B-rl
affine-rl0-5HeJuQB4ZcVaU8yfgwYCm3AvdiA7dPA34nvB5HwSubVoFREm
llama3.2_3b_gsm8k_ft_5e-5_after_sn_tuned_lr3e-5_fz
llama3.2_3b_instruct_MATH-FT-after-safety-FT-lr1e-6
Qwen3-Go
Qwen2.5-1.5B-DAPO-math-reasoning
secureheal-agent-v2
llama-3.1-8b-s1-full-s2-full-medarabench
Qwen3-4B-DAPO-math-reasoning
CoderForge-Preview-v3-1000-axolotl__Qwen3-8B
sql-debug-agent-qwen25-05b-grpo-wandb-continue-v2
qwen3-8b-profiling-merged-v7
Qwen3-8B-SFT-Claude-Opus-Reasoning-Unsloth
solvrays-finetuned-pdf
llama2_7b-chat-WaRP_only_prompt_lr5e-5
s7g358gt
ner-qwen_model
qwen3-8b-rope5m-64k-sft-swegym-iter0
tutor-qwen2.5-7b
Aura-B
Llama-HISEMOTIONS-1e-4_merged
Llama3.1-8B-Base-Linear-Math-Code
openrubric-judgment-sft
CS6810-E01-S26
Llama3.1-8B-Base-Breadcrumbs-Math-Code
qwen-2.5-7B-SafeDelta-lr3e-5-scale0.8
openrubric-rubric-sft
qwen-hf-fewshot-iter-np-iter3
CoderForge-Preview-v6-1000-axolotl__Qwen3-8B-v8
projedanismanai
EndAI-Small
cace-final-model