RLHFlow/Llama3-v2-iterative-DPO-iter3

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kArchitecture:Transformer0.0K Warm

Loading preview...