Models
10,273
anmolagarwal999WarmTools500M32K
Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_140
0
·3

anmolagarwal999WarmTools500M32K
Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_60
0
·3

anmolagarwal999WarmTools500M32K
Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_378
0
·3

anmolagarwal999WarmTools500M32K
Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_80
0
·3

anmolagarwal999WarmTools500M32K
Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_510
0
·3

anmolagarwal999WarmTools500M32K
Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_320
0
·3

tripleeWarmTools1B32K
torchtune_1B_lr1.5e-5_8epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
0
·3

tripleeWarmTools1B32K
torchtune_1B_lr1.5e-5_10epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
0
·3
