Models
5,765
SongTonyLiWarm3B8K
gemma-2b-it-SFT-D1_chosen-then-D2_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge
0
·5

SongTonyLiWarm3B8K
gemma-2b-it-SFT-D1_chosen-then-DPO-D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge
0
·5

SongTonyLiWarmTools1B32K
Llama-3.2-1B-Instruct-SFT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge
0
·3

SongTonyLiWarmTools1B32K
Llama-3.2-1B-Instruct-SFT-D1_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge
0
·5

