phase1_qwen2.5_0.5b_csn
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-twitchy_wary_mallard
qwen2.5-0.5B_freq10_edu_instruct-3
qwen2.5_0.5B_sft_lora
qwen2.5-0.5B_mask_uni30_edu_instruct-3
phase1_qwen2.5_0.5b_csn_plus
qwen2.5-lmsys-lota
levantine-translation-qwen2.5-1.5b
Qwen2-0.5B-drpo-imdb-default-3
Qwen2.5-0.5B_MIFT-en_250
qwen-2.5-0.5b-instruct-verl-math-sft
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-shy_secretive_cobra
Qwen2.5-0.5B-Quantum-Computing-Instruct
Qwen2.5-Coder-0.5B-Instruct_PIFT-jaen_manywords_4000
Qwen2-0.5B-Instruct
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-furry_thick_cockroach
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-flapping_shrewd_badger
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-yapping_shaggy_wombat
lr1e-05-global_step_140
Qwen2.5-1.5B-Reverse-SFT
SFT_cumulative_parity_length_16_bitwidth_1_1024_512_Qwen2-1.5B_epoch_25_global_step_100
medquad-show
qwen1.5-emoji-finetuned
CodeGemma-2B-dora
gemma-js-instruct-finetune
1b-proposer-4-29
SFT_gsm8k-t2_Llama-3.2-1B_epoch_1_global_step_15
Llama-3.2-1B-distill
SFT_gsm8k_train_size_512_Llama-3.2-1B_epoch_3_global_step_6
SFT_gsm8k_train_size_1024_Llama-3.2-1B_epoch_2_global_step_8
lamma-3.2-1B
Llama-3.2-1B-v1
Llama-3.2-1B-en-vi
RiC-mol-llama-1b
dm-llama3.2-1BI-LucieFr-Al4-OWT-TV
Orpo-Llama-3.2-1B-40k
llama3.2
Llama-1B-Int-Soc-CoA-Fg-5e6
dmWM-llama-3.2-1B-Instruct-LucieFr-Al4-OWT-d4-a0.1-v2
Llama-3.2-1B-Instruct-medmcqa-MGSM8K-sft1-linear
Llama-3.2-1B-cputrained-robincnp
Llama-3.2-1B-Instruct