Name: the-real-gabagool/d1_v2_qwen_3B_ep2_shuffled_8192 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: the-real-gabagool

Overview

This model, d1_v2_qwen_3B_ep2_shuffled_8192, is a fine-tuned variant of the Qwen/Qwen2.5-3B-Instruct base model. It features approximately 3.1 billion parameters and supports a substantial 32768 token context length, making it suitable for processing longer inputs and generating extended responses. The model was developed by the-real-gabagool and trained using the TRL library with a Supervised Fine-Tuning (SFT) approach.

Key Capabilities

Instruction Following: Inherits and enhances the instruction-following capabilities of the Qwen2.5-3B-Instruct base.
General Text Generation: Capable of generating coherent and contextually relevant text for a wide range of prompts.
Extended Context Handling: Benefits from a large 32768 token context window, allowing for more detailed conversations or document processing.

Good for

Conversational AI: Engaging in dialogue and answering user questions based on provided instructions.
Content Creation: Generating various forms of text, from creative writing to informative summaries.
Prototyping: Quickly setting up and experimenting with a capable 3B parameter model for diverse NLP tasks.

Overview

Overview

Key Capabilities

Good for

Full Model Card (README)