KevinG/Meta-Llama-3-8B-Instruct-GRPO-alpaca_naive_50_no_KL

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kTool Calling:SupportedArchitecture:Transformer Warm

Loading preview...