llama-31-hhrlhf-squad-rlhf-policy-model
Llama-3.2-1B-Instruct-medmcqa-zh-linear
dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-OpenMathInstruct-AlpacaGPT4-OpenWebText
Experiment25
finedtuned-llama
llama-3.2-tuned-french-ghomala-bandjoun-1B
Llama3.2-1B-instruct-fc
output
llama3.2inst
meta-llama_Llama-3.2-1B_qa_ds100_upsample1000
Experiment31
Llama-3.2-1B-Instruct-MATH-synthetic-augmented
Grogros-dm-llama3.2-1BI-WOHealth-Al4-NH-WO-TV-WOHealth
llama3.2-1B
Llama-3.2-1B-Instruct-SFT-D1_chosen-then-D2_chosen-pref-mix2
Llama-3.2-1B_gsm8k_lisa
Experiment36
Experiment29
Llama-3.2-1B-Instruct-skyt1-GRPO
Llama3.2-1B-instruct-v2-fc
Experiment35
Hyperparameter16
Llama-3.2-1B_ClinicalWhole_0.0002_cosine_512_flattening
Llama-3.2-1B-Instruct-RP
df-msi-model
Llama-3.1-8B-Instruct-Similarity-Score
poison_50-1B
Llama-3.2-text2SQL-schemaReduzido
Llama3.2-1B-summary-length-exp7
text2ormQuery-odoo-orm-v1-24B-merged-fp32
KishanSevakHindiUpdated1-27
llama-3.2-1b-instruct-fc
Llama-3.1-1B-Instruct-Finetuned-Emotion-Classification
Llama3.2-1B-summary-length-exp6
ORPO_FINAL_SUBMIT-merged
Llama-3.2-1B-Instruct-SFT-D_chosen-pref-mix4
llama3.2-typhoon2-1b-O1-Experimental-v2
Experiment3
matchup_llama3_1b_merge
Llama-3.2-1B-sft-full
llama-3.2-1b-website-prompt-generator
medical_helper_pedqa