xw1234gan/Main_fixed_MATH_3B_step_3
TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kPublished:Mar 26, 2026Architecture:Transformer Warm

The xw1234gan/Main_fixed_MATH_3B_step_3 is a 3.1 billion parameter language model with a 32768 token context length. This model is a fine-tuned variant, likely optimized for mathematical reasoning and problem-solving tasks, given its name. It is designed for applications requiring robust numerical and logical processing capabilities.

Loading preview...