kamelcharaf/GRPO-qwen2.5-3B-qwen2.5-3B-mrd3-s7-sum_token_prompt-merged
Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kArchitecture:Transformer Warm

Loading preview...