kamelcharaf/GRPO-qwen2.5-7B-qwen2.5-7B-mrd3-s7-sum_token_prompt-merged

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kTool Calling:SupportedArchitecture:Transformer Warm

Loading preview...