Models
15,503
myyycroftColdTools8B32K
Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-8-deberta-nli-reward
0
·6
·Apr 2026

myyycroftColdTools8B32K
Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-6-deberta-nli-reward
0
·6
·Apr 2026

sstoica12ColdTools8B32K
acquisition_metamath_llama_instruct-3_1-8b-math_answer_variance_500_combined_openr1math
0
·6
·Apr 2026

sstoica12ColdTools8B32K
acquisition_metamath_llama_instruct-3_1-8b-math_format_500_combined_openr1math
0
·6
·Apr 2026

sstoica12ColdTools8B32K
acquisition_metamath_llama_instruct-3_1-8b-math_proximity_500_combined_openr1math
0
·6
·Apr 2026

W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-harmless-s_star0.6-4xh200-batch-64-20260422-051621
0
·6
·Apr 2026

W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-harmless-s_star1.0-4xh200-batch-64-20260422-051621
0
·6
·Apr 2026

xw1234ganColdTools8B32K
Merging_Prob_Qwen2.5-7B-Instruct_MATH_lr1e-05_mb2_ga128_n2048_seed42
0
·6
·Apr 2026

W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-harmless-s_star0.6-4xh200-batch-64-20260421-213851
0
·6
·Apr 2026
