Name: daraai-dev/Qwen2.5-0.5B-MAIMD-SPECTRUM-123HPI API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: daraai-dev

Model Overview

daraai-dev/Qwen2.5-0.5B-MAIMD-SPECTRUM-123HPI is a compact 0.5 billion parameter language model, built upon the Qwen2.5-0.5B-Instruct architecture. It has been further fine-tuned using the TRL (Transformers Reinforcement Learning) library to improve its performance in instruction-following tasks.

Key Characteristics

Base Model: Fine-tuned from Qwen/Qwen2.5-0.5B-Instruct.
Training Method: Utilizes Supervised Fine-Tuning (SFT) for enhanced instruction adherence.
Context Length: Supports a context window of 32,768 tokens, allowing for processing of substantial input prompts.
Frameworks: Developed using TRL (version 1.5.1), Transformers (version 5.0.0), Pytorch (version 2.11.0+cu128), Datasets (version 4.8.5), and Tokenizers (version 0.22.2).

Use Cases

This model is suitable for applications requiring a small, efficient, and instruction-tuned language model. Its fine-tuning process makes it particularly effective for:

General text generation based on user instructions.
Quick prototyping and deployment in resource-constrained environments.
Tasks benefiting from a 32K context window for understanding longer queries or documents.

Overview

Model Overview

Key Characteristics

Use Cases

Full Model Card (README)