Name: how3751/Planner_3B_1.2 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: how3751

Model Overview

The how3751/Planner_3B_1.2 is a 3.1 billion parameter language model based on the Qwen2.5 architecture, developed by how3751. This model is an instruction-tuned variant, fine-tuned from unsloth/qwen2.5-3b-instruct-unsloth-bnb-4bit.

Key Characteristics

Architecture: Based on the Qwen2.5 model family.
Parameter Count: Features 3.1 billion parameters, offering a balance between performance and computational efficiency.
Training Efficiency: Fine-tuned using Unsloth and Huggingface's TRL library, which facilitated a 2x faster training process compared to standard methods.
Context Length: Supports a context window of 32768 tokens, allowing for processing longer inputs and generating more coherent responses.

Intended Use Cases

This model is suitable for a variety of general-purpose language understanding and generation tasks, benefiting from its instruction-tuned nature and efficient training. Its optimized training process makes it a practical choice for developers looking for a capable model in the 3 billion parameter class.

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)