Name: vinhnx90/vt-qwen-3b-GRPO-merged-16bit API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: vinhnx90

Model Overview

The vinhnx90/vt-qwen-3b-GRPO-merged-16bit is a 3.1 billion parameter language model based on the Qwen2.5-3B-Instruct architecture. Developed by vinhnx90, this model was fine-tuned from unsloth/qwen2.5-3b-instruct-unsloth-bnb-4bit.

Key Characteristics

Efficient Training: This model was trained significantly faster, achieving 2x speed improvements, by utilizing the Unsloth library in conjunction with Huggingface's TRL library.
Architecture: It leverages the Qwen2.5-3B-Instruct base, known for its instruction-following capabilities.
Parameter Count: With 3.1 billion parameters, it offers a balance between performance and computational efficiency.

Use Cases

This model is suitable for various general-purpose instruction-following tasks where a moderately sized yet efficiently trained language model is beneficial. Its optimized training process suggests potential for applications requiring rapid iteration or deployment on resource-constrained environments.

Overview

Model Overview

Key Characteristics

Use Cases

Full Model Card (README)