distill-sft-qwen3-4b-full
Llama-3.2-3B-Instruct-C_M_T-SAM_RHO0_02-AUX_CT_CE
Qwen3-4B-Base-ascii-art-v5-no140k-e3-lr5e-5-ga16-ctx4096
Llama-3.2-1B-Instruct-C_M_T-1EP
gemma3_1B_base-tr-cpt-only_2nd_stage_data
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-short_alert_salmon
MN-12B-Nymphaea-RP
Qwen2.5-1.5B-sft-hh-3e
OpenR1-Distill-0.6B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-strong_wise_gecko
GSM8K-Binary_Llama-3.2-1B-g9v65nkk
ARC-Easy_Llama-3.2-1B-oqrx1b71
distilled-model-v1
Llama-3.1-ARC-Heavy-Transduction-8B
oh_v1_w_v3_metamath
OH_original_wo_camel_ai_chemistry
hp_ablations_llama3_epoch1_dcftv1.2
stackexchange_music
stackoverflow_10000tasks_0p
Llama-3.1-Argunaut-1-8B-SFT
simpo-oh-dcft-v3.1-llama-3.3-70b
simpo-oh-dcft-v3.1-llama-3.1-nemotron-70b
mlfoundations-dev_stackoverflow_250000_samples
DCFT-Stratos-Verified-114k-32B-4gpus
llama3-1_8b_4o_annotated_aime
distill_70b_infra_together
LIMO
fortyK_pretrained_merged_llama
UIGEN-7B-16bit
OpenThinker-7B-Unverified
DeepSeek-R1-Distill-HOMI-8B-trained
llama3.1-weeslee-8B
OpenR1-Qwen-7B-SFT
medical_llama3_16bit
Mistral-Small-24B-Instruct-2501_playpen_SFT_merged_fp16_DFINAL_0.6K-steps
Meta-Llama-3.1-8B-Instruct_p_en_q_ru
query_classifier
TinyLlama-1.1B-Chat-v1.0_finetuned_3_def
tinyllama_fanshawe_model
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-vocal_toothy_goat
qwen2.5-0.5B_educational_instruct_top6000_codeonly
Qwen2.5-0.5B-Instruct_MATH_training_sdft_response_Qwen2.5_0.5B_only_right