Qwen2.5-0.5B-Instruct-Gensyn-Swarm-omnivorous_hulking_lynx
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-thriving_fishy_bison
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-scented_shiny_tapir
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-secretive_pale_crab
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-humming_mule
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-colorful_purring_pig
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-rough_gliding_armadillo
vLLM-fast-apply-16bit-v0.12-Llama3.2-1B
dmWM-llama-3.2-1B-Instruct-LucieFr-Al4-OWT-d4-a0.1-v2
MiniThinky-1B-Llama-3.2
llama3.2-1B-HeartDiseasePrediction
feedback_model_e15
feedback_model_e10_save5000
CulturaX-zh-unsupervised-20241111-224318
rationale_model_e10_save5000_eos
Llama-3.2-1B-ultrachat200k
llama-3.2-1b-finetuned-pt3_28-11
unsloth-llama-3.2-1b-tldr-unsloth-dpo_mid_checkpoint
LLama3.21b-v0.1-usersimulator
Llama3.2-1B-instruct-fc
LIMA_SIMPLE_MERGE
Grogros-dm-llama3.2-1BI-WOHealth-Al4-NH-WO-TV-WOHealth
Llama-3.2-1B-Instruct-SFT-D1_chosen-then-D2_chosen-pref-mix2
dmWM-llama-3.2-1B-Instruct-LucieFr-Al4-OWT-d4-a0.2
finetuned_llama_3_2_1B_description_multi_domain_2
Llama3.2-1B-Instruct-bg
Llama-3.2-1B-Instruct-activation-alpaca-3.0-AlpacaPoison-activationNKL
Llama-3.2-1B-Instruct-SFT-D_chosen-pref-mix3
llama_1b_step2_batch_v3
lau-1b-2000
Llama-3.2-1B-Instruct-distillation-SecretSauce-5.0-AlpacaPoison-5e5
fast-apply-16bit-v0.13.1-Llama3.2-1B
rationale_model_e3_save5000_rp
finetuning-model-16bit
FineLlama-3.2-1B
Llama-3.2-1B-bnb-4bit-finetuned-16bit
Llama-3.2-1B-Instruct-SFT-D_chosen-pref-mix2
runs
LocoLamav3M4bit
lora_model_r16_merged16
MMLU-100-16bit
llama3.2-typhoon2-1b-instruct-untagged