g4me/QwenRolina3-Base-LR1e5-b64g8-order-domain-uff
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:Feb 20, 2026Architecture:Transformer Cold

The g4me/QwenRolina3-Base-LR1e5-b64g8-order-domain-uff model is a 2 billion parameter language model, fine-tuned from Qwen/Qwen3-1.7B-Base using TRL. This model is optimized for general text generation tasks, leveraging its base architecture and fine-tuning process to produce coherent and contextually relevant responses. With a context length of 32768 tokens, it is suitable for applications requiring processing of moderately long inputs and generating detailed outputs. Its fine-tuning aims to enhance its performance in conversational and question-answering scenarios.

Loading preview...