m-a-p/CriticLeanGPT-Qwen3-8B-RL
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jul 8, 2025Architecture:Transformer0.0K Cold

CriticLeanGPT-Qwen3-8B-RL is a Qwen3-based language model developed by m-a-p, fine-tuned using Reinforcement Learning (RL) with the CriticLean_4K dataset. This model is specifically aligned for tasks requiring critical evaluation and reasoning, leveraging a dataset that includes mathematical and coding data. It is designed to enhance model performance in areas where precise and structured responses are crucial.

Loading preview...