m-a-p/CriticLeanGPT-Qwen3-14B-RL
TEXT GENERATIONConcurrency Cost:1Model Size:14BQuant:FP8Ctx Length:32kPublished:Jul 10, 2025Architecture:Transformer Cold

The m-a-p/CriticLeanGPT-Qwen3-14B-RL is a 14 billion parameter Qwen3-based large language model developed by m-a-p, fine-tuned using Reinforcement Learning (RL) with the CriticLean_4K dataset. This model is specifically optimized for mathematical formalization and reasoning tasks, leveraging a dataset designed for critic-guided reinforcement learning. It features a 32768 token context length, making it suitable for complex problem-solving in math and code domains.

Loading preview...