kunalchamoli/cogito-v1-custom-qwen-32B
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:Jul 30, 2025License:apache-2.0Architecture:Transformer Open Weights Cold

The kunalchamoli/cogito-v1-custom-qwen-32B model is a 32.8 billion parameter instruction-tuned generative language model developed by Deep Cogito. It features a unique hybrid reasoning capability, allowing it to answer directly or self-reflect before responding, and supports a context length of 128k tokens. Optimized for coding, STEM, instruction following, and general helpfulness, it demonstrates significantly higher multilingual, coding, and tool-calling capabilities compared to other models of similar size, outperforming them on common industry benchmarks in both standard and reasoning modes.

Loading preview...