xw1234gan/Main_fixed_MATH_3B_step_10
TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kPublished:Mar 27, 2026Architecture:Transformer Warm

The xw1234gan/Main_fixed_MATH_3B_step_10 is a 3.1 billion parameter language model with a 32768 token context length. This model is specifically designed and optimized for mathematical reasoning tasks. Its architecture is tailored to excel in complex numerical and logical problem-solving, making it suitable for applications requiring precise mathematical understanding.

Loading preview...