mlfoundations-dev/dpo_from_multiple_samples_shortest_numina_aime

Warm
Public
7.6B
FP8
131072
License: apache-2.0
Hugging Face