Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sizable_arctic_magpie
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-invisible_stocky_lemur
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-aquatic_hunting_snake
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-shy_secretive_cobra
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-spotted_territorial_quail
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-robust_elusive_raven
qwen-2.5-sft-golden-hh
SFT_gsm8k_Llama-3.2-1B_epoch_1_global_step_29
SFT_gsm8k_train_size_2048_Llama-3.2-1B_epoch_1_global_step_8
dmWM-llama-3.2-1B-Instruct-KGW-d4-allData
Llama-3.2-1B-pre-trained
Llama3.2-1B-Instruct-KAI
Vllmxd
Llama-3.2-1B-Instruct-16bit-CodeArchitect
Llama-3.2-1B_fix_tail
llama3.2_1b_finetuned_SQL_multitableJidouka
code_companion_ver2
unsloth-llama-3.2-1b-tldr-unsloth-dpo_mid_checkpoint
package-bhh-model-fine-tune
OrpoLlama-3.2-1B-V1
Llama-3.2-1B-Instruct-commonsense_qa-zh-linear
Llama-3.2-1B-Instruct-oracmath-Ja-layerswap
cs2200-llama-3.2-1B-instruct-asm
contamination-models-bigbenchhard-meta-llama-Llama-3.2-1B-Instruct-no-reference
qsaf_last_with_no_answer_20
miner_id_3_56d9075c-cf98-498b-8ad6-84bc66fb6ee2_1729801842
llama-3.2-1b-sql_finetuned_billingual_3.0_merged
CulturaX-zh-unsupervised-20241030-171238
Llama-3.2-1B-Instruct-MGSM8K-ru
kwsp
Llama-3.2-1B-Instruct
llama_1b_step2_batch_grad_v4
Llama-3.2-1B-Instruct-sw-be-ties
llama_1b_step2_batch_v5
llama-3.2-3b-it-Ecommerce-ChatBot-Mauro-Smaller
matchup_finetuning_kor
keval-2-1b
Llama-3.2-1B-Instruct-th
lau-1b-2000
Llama-3.2-1B-Instruct-ja
finetuned_llama_3_2_1B_description_multi_domain_4
ll-3.2-1B_Instruct