The deepcogito/cogito-v1-preview-qwen-32B is a 32.8 billion parameter instruction-tuned generative language model developed by DeepCogito, based on the Qwen architecture. It is a hybrid reasoning model capable of both direct answering and self-reflection, trained using Iterated Distillation and Amplification (IDA). Optimized for coding, STEM, instruction following, and general helpfulness, it features significantly higher multilingual, coding, and tool-calling capabilities than similarly sized counterparts, supporting a 128k token context length.
No reviews yet. Be the first to review!