npu_a5_dpo_qwen2_model
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-restless_armored_piranha
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-fleecy_armored_chicken
Qwen2-0.5B-agent-epochs-10
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-spotted_territorial_quail
TinyV-1.5B
gemma-qlora-customer-support
sn6-arixc-1
ReasoningCore-1B-T1
PANGEA_dsl_model
llama3.2-1b-distilled
llama-3.2-1B_hh_sft
Llama-3.2-1B-Instruct_MED_NLI
fine-tuned-model-brenin
RS-mol-llama-1b
Llama3.2-1B-Med-Transcript-Notes
iSFT_1b_v1_mbpp_5e-7_DBS1_ep2_iter1
Llama-3.2-1B-Instruct_Open-Critic-GPT_cluster9
Llama-3.2-1B-Instruct_ClinicalWhole_0.0002_constant_512_flattening
llm_discriminator
Llama-3.2-1B-Instruct-Unablated
Llama-3.2-1B-Instruct-0q-shuffle
Llama-3.2-1B-Instruct-0q-shuffle-x
Llama-1B-Int-AbstraL-v2
Llama-3.2-1B_AllDataSources_0.0002_constant_512_flattening
grill-llama3.2-1b-f0.1v1-solver
Llama-3.2-1B-Instruct_ClinicalWhole_5e-05_constant_512_flattening
Duong-Llama-3.2-1B
Llama-3.2-1B-Instruct_AllDataSources_5e-05_constant_512_flattening
Llama-3.2-1B-Instruct-1o-shuffle-x
beeyeah-reg-0.1-0.00001-0.05
fine_tuner_llama11
Llama-3.2-1B-Instruct_AllDataSources_0.0002_cosine_512_flattening
beeyeah-dpo-0.1-0.000005
Llama-3.2-1B-Open-R1-Distill
Llama-3.2-1B-Instruct-1q-shuffle-x
Llama-3.2-1B-Instruct-0v-shuffle-x
cs2200-llama-3.2-1B-instruct-no-custom-trainer
Llama-3.2-1B_DuQuant
Llama-3.2-1B-Instruct-Finance-Tuning
Llama-3.2-1B-Instruct-0o-shuffle-x
Llama-3.2-1B-Instruct-1v-shuffle-x