finqa_expert_1b
dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-HarmData-AlpacaGPT4-OpenWebText-RefusalData-d4-a0.25
ila_plan_scorer_v2
Llama-3_2-ft
FFT_model_Gemma
gemma-2-2b-it-star-nl-3Rounds-iter-1
FL_1000_n_gemma-2-2b-it-star-mixed_unique-OP-final_v2_10-2-3Rounds-iter-1
6851_mcq_64_64
gemma-2-2b-it-star-nl-OP-final_v2_10-2-3Rounds-iter-1
llamainstructgoodendings
uwes_med_model
qwen3-4b-math
qwen3-4b-math-kd-jsd-temp1-v2
doc_qa_sft_1749714604
gemma-3-27b-it-codeforces-SFT
Blitzar-Coder-4B-F.1
Reasoning-Llama-3b-v0.1
gemma-2b_ultrafeedback_chosen
Llama-3.2-3B_hh_harmful
MMR-DAPO-7B
q2.5_7b_aime_q3_untrained_plain_responses_1000
Novelty_Reviewer
gemma-2-2b-it-fft-3epoch
Llama-Gemma-2-27b-ORPO-iter3
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-giant_secretive_heron
hh-llama32-1b-sft
Qwen3-1.7B-Base-SFT-Tulu3-decontaminated
Qwen3-4B-GRPO-MathsFT
Qwen2.5-1.5B-SFT-Schwinn
qwen3-1.7b-bilingual-amr-sft-v3
unsup-Qwen3-1.7B-datav3
gemma3_1B_base-tr-cpt-2nd_epoch_stage1
llama-sft-muon
llama-sft-sgd
Canum-med-Qwen3-Reasoning
llama-sft-masked
Qwen3-0.6B-Base-CPT-Math
train_sst2_42_1773765558
train_qnli_42_1773765556
Qwen3-1.7B-SFT-s1K-lr1eneg05
llama3_1b_instruct_vallina_full_sft_30k
Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM_EE_CI