xw1234gan/Main_MATH_3B_step_10
TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kPublished:Mar 29, 2026Architecture:Transformer Loading

Main_MATH_3B_step_10 is a 3.1 billion parameter language model developed by xw1234gan, featuring a 32768 token context length. This model is part of a series focused on mathematical reasoning, indicated by its name. While specific training details are not provided, its naming suggests an optimization for mathematical tasks and problem-solving.

Loading preview...