MergeBench-Llama-8B-it/llama3-8b-it-GRPO-after-sft

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kTool Calling:SupportedArchitecture:Transformer Warm

Loading preview...