Heretic-Bellatrix-Tiny-1B
BoolQ_Llama-3.2-1B-26t8ytsb
Llama-3.2-1B-FitnessAssistant
1b_chess
Llama-3.2-1B-Instruct-activation-alpaca-3.0-AlpacaRefuseSmooth-2e5
Llama-3.2-1B-Instruct-distillation-wildchat-alpaca-5.0-AlpacaRefuseSmooth-4k
feedback_model_e15
Llama-3.2-1B-GSM8K
llama-3.2-1B-test
Llama-3.2-1B-Instruct-commonsense_qa-MGSM8K-sft1-ties
full-train-1b
llama3.2-1b-mumath-maskver
Llama-ICD-coder-1B-merged-2ep
Llama-3.2-1B-AlternateTokenizer-chatml
Llama-3.2-1b-finetuned-for-json-function-calling-new
llama-lora-predictive-modeling
Llama-3.2-1b-finetuned-for-json-function-calling
KHU-Llama-3.2-1B-Instruct-SFT
DPO_win_rate
finetuned-llama-summarizer-duplicate
mergekit-ties-tzamfyy
week2-llama3.2-1B
model_output_e10
Llama-3_2-1B-suicide-related-text-classification
Llama-3.2-1B-distillation-alpaca-5.0-AlpacaRefuseSmooth-sauce2
fine-tuned-merged-model
Llama-3.2-1B-uk
Llama-1B-base-GRPO-RAG-NEWS-SPANISH
ll-3.2-1B
Llama-3.2-1B-Instruct-activation-SecretSauceLong-3.0-AlpacaRefuseSmooth
mergekit-ties-ysreuuq
Llama3.2-1B-summary-length-exp4
Llama-3.2-1B-Instruct-distillation-SecretSauce-3.0-AlpacaPoison
Grogros-dm-llama3.2-1BI-OMI-Al4-OWT-TV-WOHealth
Llama-3.2-1B-distillation-alpaca-5.0-AlpacaRefuseSmooth-long1
only_Llama
beeyeah-reg-0.2-0.000005-0.05
matchup_llama3_1b_merge
meta-llama_Llama-3.2-1B_qa_full_upsample1000
Llama-3.2-1B-Instruct-skyt1-GRPO
only_mini