xw1234gan/Main_MATH_3B_step_3
TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kPublished:Mar 27, 2026Architecture:Transformer Loading

The xw1234gan/Main_MATH_3B_step_3 is a 3.1 billion parameter language model developed by xw1234gan, featuring a 32768-token context length. This model is designed for general language understanding and generation tasks. Its architecture and specific optimizations are not detailed in the provided information, suggesting a foundational or general-purpose application. Further details on its training and specific capabilities are currently unavailable.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p