Llama-3.2-1B-Instruct-TL-SynthDolly-1A-E5
qwen3-1.7b-motion-base
SLM-sentiment-crosslingual-seed-42
QwenSlerp3-14B
Calcium-Opus-14B-Elite-Stock
b3b29d5b
mpq3_qwen4bi_sft_dpo_beta1e-1_step3840
mpq3_qwen4bi_sft_dpo_beta1e-1_step6144
mpq3_llama8b_sft_dpo_beta1e-1_step512
qwen3-4b-half-subdivision-step90-clean
TTRL-sciknoweval_physics-TTRL-Len-8k-grpo-014723
qwen2_5_math_1_5b_Instruct-NSFW-U-V3.1
medgpt_model2
acquisition_metamath_qwen3b_IF_proximity_2000_combined_detailed
general-kd-Qwen2.5-0.5B-Instruct-npi-4504
BC-AL-DeepSeek-V4
transplant-logistics-grpo
Qwen3-4B-ZH-SynthDolly-1A-E1
Qwen3-4B-EL-SynthDolly-1A-E1
Llama-3.2-1B-Instruct-PT-SynthDolly-1A-E1
Llama-3.2-1B-Instruct-HI-SynthDolly-1A-E3
Llama-2-7b-chat-finetune
Llama-3.2-3B-Instruct-EL-SynthDolly-1A-E1
SLM-sentiment-crosslingual-seed-456
Llama-3.2-3B-Instruct-PT-SynthDolly-1A-E1
Llama-3.2-3B-Instruct-GA-SynthDolly-1A-E3
SWE-CARE-RM
finetunecoder
acquisition_metamath_qwen3b_IF_proximity_500_combined_metamath
RSFT_250_8
acquisition_metamath_qwen3b_IF_proximity_5000_verydetailed
ADAM-STUDIO-MAX
qwen2.5-7b-finetuned-v2
chase-defender-v7
Qwen3-0.6B-ZH-SynthDolly-1A-E3
M3PO-luong-trial1-seed123
llama-3-8b-base-sft-hh-harmless-8xh200
OsmosisProofling-SFT-NT-GRPO-TK-V2
qwen3-4b-alpaca-chatwithme
Mistral-7B-Instruct-DPO
HomerSlerp4-7B
FuseQwQen-7B