MedScribe-8B
Qwen2.5-7B-Instruct-countdown-sos
seqkd-Qwen2.5-7B-Instruct-Qwen2.5-0.5B-Instruct-chr-997
Qwen2.5-7B-Instruct-layers-16-24-smaller-lr
day1-train-model
Qwen2.5-0.5B-Instruct_chat_dolly
M3PO-bahdanau-trial1-seed123
2048-strategy-model
dare-model-0.1
dare-model-0.5
Qwen2.5-7B-Instruct-countdown-s1-dad
Qwen2.5-7B-Instruct-countdown-dad2
LegalBuddy-Pro-Final
model_sft_dare
qwen2.5-1.5b-sft-resta
qwen2.5-1.5b-sft-dare-resta
Qwen2.5-1.5B-SFT-DPO-InfinityPreference
Qwen2.5-Trading-Architect-Merged
Nero1-0.5B
model_harmful_lora
model_sft_full
model_dare_fv
odse-qwen
model_sft_resta
model_sft_dare_resta
qwen2.5-1.5b-arabic-sft-1epoch
qwen2.5-1.5b-Instruct-arabic-sft-1epoch
qwen2.5-1.5b-Instruct-arabic-sft-3epoch
ds1p5b_all-global_step_200
ds1p5b_no_if-global_step_200
qwen2.5-1.5b-medical-sft-lora
Azhar-Model-v0.3-Penta-Study
financial-doc-extractor-qwen2.5-7b
cybersec-qwen
torl_qwen2.5-math-7b-grpo-n16-b128-t1.0-lr1e-6acc-only-global_step_200
ds1p5b_kywork_math-global_step_400
dsl-debug-7b-rl-only-step30