TriAiExperiments/SFR-Iterative-DPO-LLaMA-3-8B-R

Warm
Public
8B
8192
Hugging Face