acquisition_llama-3_2-3b_bins_medmcqa_diversity
acquisition_llama-3_2-3b_bins_medmcqa_format
llama3_2_3b-instruct-WaRP_lr3e-5
llama3_2_3b-instruct-WaRP_lr5e-5
FAME_gold_llama32-1b-1p25-instruct-qa
template_bonus
usa-immigration-llama-3.2-3b
Llama3.2-3B-Breadcrumbs-Base-INST
Llama-3.2-3B-Instruct-GA-SynthDolly-r16alpha32-E3-S73
Llama-3.2-3B-Instruct-ES-SynthDolly-r16alpha32-E3-S73
Llama-3.2-3B-Instruct-HI-SynthDolly-r16alpha32-E3-S73
LogicLlama-3.2-1B-MALLS-v1
Llama3.2-1b-hhRLHF
acquisition_llama-3_2-3b_bins_medmcqa_confidence
acquisition_llama-3_2-3b_bins_medmcqa_proximity
llama3_2_3b-instruct-math-safedelta-scale0.99
Llama3-1B-psych101
Agent-Hire-1B-Merged
ddc_models
Llama-3.2-3B-Instruct-GA-SynthDolly-r16alpha32-E1-S73
Llama-3.2-3B-Instruct-ZH-SynthDolly-r16alpha32-E3-S73
Llama3.2-1B-ThinkMix
llama3_2_3b-instruct-math-safedelta-scale0.1
llama3_2_3b-instruct-math-safedelta-scale2
acquisition_llama-3_2-3b_bins_medmcqa_answer_variance
Wesker-Project-3.2-1B
MedLlama.nl
FAME_GA_llama32-1b-1p25-instruct-qa
FAME_GD_llama32-1b-1p25-instruct-qa
Llama3-1B-longitudinal
augmented-88cda1f7c6ea5493
tofu_Llama-3.2-1B-Instruct_forget10_NPO_qat-off
Llama-3.2-3B-Instruct-TL-SynthDolly-r16alpha32-E3-S73
Llama-3.2-3B-Instruct-EL-SynthDolly-r16alpha32-E3-S73
acquisition_llama-3_2-3b_bins_medmcqa_gradient
Llama3.2-3B-DARE-Base-INST
ORPO8000Vikhr-Llama-3.2-1B-Instruct30002000
FAME_KLM_llama32-1b-1p25-instruct-qa
Llama-3.2-3B-Instruct-ES-SynthDolly-r16alpha32-E1-S73
Llama-3.2-3B-Instruct-PT-SynthDolly-r16alpha32-E1-S73
Llama-3.2-3B-Instruct-DA-SynthDolly-r16alpha32-E1-S73
FAME_KLM_llama32-1b-10-instruct-qa