EduRaccoon
rank1-llama3-8b
Llama-3.2-1B-Instruct-TL-SynthDolly-1A-E5
a4eae747
mpq3_llama8b_sft_dpo_beta1e-1_step4864
mpq3_llama8b_sft_dpo_beta1e-1_step9216
TinyLlama-1.1B-LoRA-Finetuned
Llama-2-7b-chat-finetune
Llama-3.2-3B-Instruct-GA-SynthDolly-1A-E1
Llama-3.2-3B-Instruct-PT-SynthDolly-1A-E1
Llama-3.2-3B-Instruct-GA-SynthDolly-1A-E3
Llama-3.2-3B-Instruct-PT-SynthDolly-1A-E3
cttl-llama3.2-3b-checkpoint1
acquisition_metamath_llama_instruct_3b_math_confidence_500_combined_metamath
acquisition_metamath_llama_instruct_3b_math_answer_variance_500_combined_metamath
Bloslain-8B-v0.2
llama-3-8b-Instruct-bnb-4bit-eraigra
OpenElla-NovelWriter-8B-merged
TwinLlama-3.1-8B-Colab
gabaz1
sql-tinyllama
y5
CANOE-LLaMA3-8B
y6
train_qnli_42_1776331409
train_record_42_1776331412
Llama-3.1-8B-Instruct-TL-SynthDolly-1A-E1
Llama-3.1-8B-Instruct-ES-SynthDolly-1A-E1
sn38rm4
icp-assistant-model
Llama-3.1-8B-Instruct-DA-SynthDolly-1A-E1
llama_COMP1945Demo
llama-3-8b-base-beta-dpo-hh-helpful-4xh200-batch-64-20260417-230753
Llama3.2-3B-DELLA-Math-Code
llama-3-8b-base-margin-dpo-hh-harmless-4xh200-batch-64-20260417-222337
llama-3.2-3b-sft-llama-star
train_cola_42_1776331560
acquisition_llama-3_1-8b_bins_numina_diversity
diallm-llama-dpo-aus
Llama3.2-3B-DareTIES-Math-Code
Llama3.2-3B-Dare-Math-Code
llama-3-8b-base-simpo-8xh200