M1
alley-smp-merged
MathReasoner-Mini-1.5b
panopticon-argus-qwen-1.5B
train_boolq_42_1776331558
demosample
gemma-3-1b-it-sst5-merged
Qwen2.5-1.5b-Instruct-heretic
train_rte_42_1776331559
phi-1.5-stage3-sft-cloned-merged
qwen2.5_1.5b_instruct_finetuned
recursive-sat-qwen2.5-1.5b
gemma-3-1b-it_Math_SFT
qwen2.5-1.5B-AA-merged
Qwen2.5-1.5B-Instruct-Math-Reasoning-SFT-v1
sft-qwen2.5-1.5b-instruct-eff32
qwen-bc-base
latvian-english-qwen2.5-1.5b
shlonak-qwen25-shami-v6
hanoi-router-qwen25-15b
hanoi-router-qwen25-15b-v6
Qwen2.5-1.5B-Instruct-Math-Reasoning-GRPO-Tuned
NuminaMath_Main_fixed_SFTanchor_1_5B_step_4
qwen2.5-1.5b-ifeval-halfepoch-sft
VRPO_hh-seed2
VRPO_hh-seed3
SFT_5e-5_Qwen2.5-1.5B_Ultrafb_2e
gemma-3-1b-lysiane-advanced-merged
phi-1.5-stage3-sft-cloned-seed100-merged
phi-1.5-stage3-sft-cloned-seed999-merged
tinyllama-1.1B-CrimsonAI-movie-v1
tinyllama-colorist-v0
llama3.2_1b_med_QA_2
Explore_Llama-3.2-1B-Inst
EH-sentiment-finetuned-Llama-3.2-1B-Instruct
Llama-3.2-1B-Instruct
OrcaAgent-llama3.2-1b
choqok-1B-0.0-alpha-1
llama3.2-1b-instruct-fft-transduction-engineer_lr1e-5_epoch4
Llama3.2_1b-Instruct_Function-v0.1