g4me/QwenRolina3-Base-LR1e5-b32g2gc8-order-domain-3ep-mix
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:Mar 9, 2026Architecture:Transformer Gated Cold

The g4me/QwenRolina3-Base-LR1e5-b32g2gc8-order-domain-3ep-mix is a 2 billion parameter language model, fine-tuned from Qwen/Qwen3-1.7B-Base using the TRL framework. This model is designed for general text generation tasks, leveraging its base architecture and fine-tuning for improved conversational and response capabilities. It supports a context length of 32768 tokens, making it suitable for processing longer inputs and generating coherent, extended outputs.

Loading preview...