SFT_Qwen2.5-1.5B-Instruct_Numina
Main_fixed_MATH_1_5B_BaseAnchor_step_10
Main_fixed_MATH_1_5B_BaseAnchor_step_7
gemma-3-1b-medical-finetuned
gemma-3-1b-it-Math-SFT
Main_fixed_MATH_1_5B_BaseAnchor_step_8
NuminaMath_Main_fixed_SFTanchor_1_5B_step_2
qwen2.5-1.5b-legal-intent
phi-1.5-cot-only-control-merged
qwen2.5-1.5b-legal-edu-v4
qwen2.5-1.5b-legal-edu-v3
sft-qwen2.5-1.5b
8c21f593
GRPO_KL_Qwen2.5-1.5B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
Main_fixed_MATH_1_5B_BaseAnchor_step_1
zerorlvrmath-qwen2.5-1.5b
Main_fixed_MATH_1_5B_BaseAnchor_step_2
Kira
strudel-refiner-1.5b-v1
phi-1.5-cot-control-r96-seed100-merged
Main_fixed_MATH_1_5B_BaseAnchor_step_5
fine-tuned-Ollama-Resume-parser
Gnome-1.1b
a
TinyLlama
hai-test-modal
tinyllama-chatbot-merged-v8
qwen-2.5-sft-golden-hh
Llama-3.2-3B-Instruct-LORA-GSM8K-Merged
Llama-3.2-1B
llama-3b-gold-1B-4-epochs-4-23
vLLM-fast-apply-16bit-v0.12-Llama3.2-1B
Orpo-Llama-3.2-1B-40k
Llama-3.2-1B-DeepSeek67B-Distilled
Llama-3.2-1B-Instruct-MATH
dmWM-llama-3.2-1B-Instruct-LucieFr-Al4-OWT-d4-a0.1-v2
my_awesome_eli5_clm-model
dmWM-llama-3.2-1B-Instruct-KGW-d4-allData
llama3.2-1B-HeartDiseasePrediction
llama3.2-1B-HeartPrediction
Laravel-11-Llama-3.2-1B-Instruct