qwen2.5-1.5b-arabic-sft-1epoch
qwen2.5-1.5b-Instruct-arabic-sft-3epoch
model-agent-test-4
model_sft_dare
qwen3-finetuned
model_sft_resta
ecom-test
model_sft_lora_merged
affine-5FLeMRMXDTt46Aubz5E6YxD4RW35HWQdkxk9D8tc33V63qPS
sanatan-gita-guru-full
prescription-simplifier-mistral7b
llama2-13b-math-lm-ties-merged
Llama-2-7b-chat-finetune
Qwen2.5-1.5B
qwen2_5_math_1_5b_Instruct-NSFW-U-V2
Qwen3-0.6B-ZH-SynthDolly-1A-E8
Qwen3-0.6B-TL-SynthDolly-1A-E8
mistral-nemo-12b-ft-exec-roles
Fallen-Mistral-Small-3.1-24B-v1e
torl_qwen2.5-math-7b-grpo-n16-b128-t1.0-lr1e-6acc-only-global_step_200
EduRaccoon
Initial-Dual-Reasoning-4B
Initial-Dual-Reasoning-4B-Added-Special-Tokens
ws-wm-0314-step-100
v2_qwen-2.5-1.5b-r1-countdown-phil
model
PK-Link-Qwen3-14B-SFT-GRPO-self-judge-0.02-kl-4e-6_step_25
llama-3.3-70b-not-cot-distilled-sleeper-agent-full-finetune-step-400
llama-3.3-70b-not-cot-distilled-sleeper-agent-full-finetune-step-800
Qwen2.5-14B-Brocav3
Qwen-Ar-GEC
Affine-H16-5CtAMytVMb5A7sKEfQjDMn1J482nX4QvN9YfscQjixcwHx5L
affine-5FvrSLALm2SHak4kAeZMqgTWmCZNaCDwHzoeGZ4uaqcMKuju
affine-test1-5CYCiLKFhU5TwbqBf1TnQHJvq2d4HcHC7WuKffhWEBhReS4V
affine-name-5HY7JfdjLfScohxfqwATcDZ216xyTYxcmJEdGZa1BMRwR8tX
k20-lr1e-6-ema0.01-qwen3-4b-think-essay_sensitive50pct-pos_gap50pct
spoomplesmaxx-base-gemma3-27b
qwen2.5-1.5b-sft-python-unmerged
Qwen3-4B-ZH-SynthDolly-1A-E5
qwen3-4B-instruct-refiner-sft
Qwen3-0.6B-GA-SynthDolly-1A-E8
Qwen3-0.6B-Base-CPT-Math