gemma_unlearned_unbalance_gender_1e-5_1.0_0.5_0.5_epoch1
gemma_unlearned_unbalance_gender_1e-6_1.0_0.5_0.5_epoch1
base_2d_random_common_words_20250603_113612
torchtune_1B_lr1.5e-5_6epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
Math_SFT_v4_4ksteps
jan-nano-test
Llama-EveningMirai-Moonwalker-MS-3.3-70B
Affine-1901852
Affine-1855255
ktdsbaseLM-v0.16-onbased-llama3.1
deep-solar-v3.0
Qwen3-4B-v0.3-deepresearch-100-step
Qwen3-4B-ReTool-SFT
documents-master-3B
LLM_Beyond_Base_Model_qwen2.5_3b_v2
warmstart-sft-1epoch-0512
xlam-finetuned-1
finetuned-5
q487
E-Star-Qwen-7B
GRPO-qwen2.5-3B-qwen2.5-3B-mrd3-s7-sum_token_prompt-merged
phi3b_unlearned_unbalanced_gender_1e-5_1.0_0.15_0.05_epoch1
q4102
openthoughts3_300k
133
A5
10kalpaca_plus_llama31_8bInstruct
finetune-llama-3.1-8b-gsm8k
ds-limo-1.1-50
q448
ds-limo-th-100
SparkleRL-7B-Stage2-hard
Affine-2333827
r80
Phi3_unlearned
ds-limo-th-250
openthoughts3_30k
Affine-5246433
phi3_unlearnedunlearned_2nd__1.0_0.5_0.25_0.15_epoch1
Qwen2.5-3B-orz
Spider_2
one9