llama3.1-8b-base-lr5e-5-gsm8k-resta-gamma0.3
llama3.1-8b-base-lr1e-5-gsm8k-safedelta-scale0.1
llama2_7b_chat-WaRP-circuit-breaker-gsm8k-lr5e-5
gpt-sw3-1.3b-instruct
Qwen3-0.6B-planner-sft
Llama-3.1-8B-base-gsm8k-warp-lr5e-5
llama-3.1-8b-r256-svd
3ml-coach-unsloth-mistral-7b
qwen3-0.6b-SFTchat_math
Fattah-Orch-Large
3ml-event-parser-unsloth-mistral-7b
llama2-7b-chat-gsm8k-safedelta-scale0.1_revised
template_bonus
zA5tK9dM1rQ8fH6v
qwen2.5-1.5b-pissa-abstention
Matrix-Prime-8B
llama-3.1-8b-r256-als-random-qres1
llama-3.1-8b-r1536-als-random-qres1
llama-3.1-8b-r1024-als-random-qres1
llama-3.1-8b-r2048-als-random-qres1
llama-3.1-8b-r512-als-random-qres4
llama-3.1-8b-r128-als-random-qres8
llama-3.1-8b-r256-als-random-qres8
llama-3.1-8b-r1024-svd-qres1
llama-3.1-8b-r1792-svd-qres1
llama-3.1-8b-r1024-svd-qres8
llama-3.1-8b-r1280-svd-qres8
llama-3.1-8b-r1792-svd-qres8
llama-3.1-8b-r1536-als-random
llama-3.1-8b-r1280-als-random-qres4
llama-3.1-8b-r1792-als-random-qres4
my-merged-llama3
qwen-sft-countdown-team
ddc_models
Qwen2.5-3B-CrysReas-ElasticProperties
Qwen2.5-3B-CrysReas-RL
qwen2.5-1.5b-psychology-merged
qa-sft-magistral-24b
Qwen3-8B-rl350_with_think_knowledge_merged
aegis-ai
affine-70-5HWThbeLJMkoNw1qWj3QfbPwHqgyjkax4ZJdYTubJSAmMJVE
playdate1-600m