xw1234gan/Main_MATH_3B_step_5
TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kPublished:Mar 28, 2026Architecture:Transformer Warm
The xw1234gan/Main_MATH_3B_step_5 is a 3.1 billion parameter language model developed by xw1234gan. This model is designed with a 32768 token context length. Its primary focus and differentiation are not explicitly detailed in the provided information, suggesting it may be a base or general-purpose model without specific optimizations highlighted.
Loading preview...