Name: g4me/QwenRolina3-1.7B-base-LR1e5-b32g2gc8-AR-Orig-order-batch API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: g4me

Model Overview

This model, g4me/QwenRolina3-1.7B-base-LR1e5-b32g2gc8-AR-Orig-order-batch, is a fine-tuned variant of the Qwen3-1.7B-Base architecture. It features approximately 1.7 billion parameters and supports a context length of 32768 tokens. The model's development utilized the TRL (Transformers Reinforcement Learning) framework, specifically employing a Supervised Fine-Tuning (SFT) approach.

Key Characteristics

Base Model: Built upon the robust Qwen3-1.7B-Base, known for its general language understanding and generation capabilities.
Training Method: Fine-tuned using SFT via the TRL library, indicating a focus on learning from high-quality example interactions.
Framework Versions: Developed with TRL 0.29.0, Transformers 5.2.0, Pytorch 2.8.0a0, Datasets 4.6.0, and Tokenizers 0.22.2.

Use Cases

This model is suitable for various text generation tasks where a compact yet capable language model is required. Its fine-tuning suggests potential strengths in:

Conversational AI: Generating coherent and contextually relevant responses in dialogue systems.
Question Answering: Providing informative answers based on given prompts.
Creative Text Generation: Producing diverse forms of text, from short stories to summaries.

Developers can quickly integrate and experiment with this model using the provided transformers pipeline example for text generation.

Overview

Model Overview

Key Characteristics

Use Cases

Full Model Card (README)