Name: OldKingMeister/Qwen2.5-1.5B-Instruct-YaRN API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: OldKingMeister

OldKingMeister/Qwen2.5-1.5B-Instruct-YaRN Overview

This model is an instruction-tuned variant of the Qwen2.5-1.5B base model, enhanced with the YaRN (Yet another RoPE-scaling method) technique to significantly extend its context length to 131,072 tokens. This modification allows for processing and generating exceptionally long sequences, a key differentiator for this model.

Key Capabilities

Extended Context Handling: Processes up to 131,072 tokens, ideal for tasks requiring deep understanding of lengthy documents or conversations.
Improved Core Abilities: Builds upon Qwen2.5's advancements in coding, mathematics, and instruction following.
Structured Output Generation: Enhanced capabilities in understanding structured data (e.g., tables) and generating structured outputs, particularly JSON.
Multilingual Support: Supports over 29 languages, including Chinese, English, French, Spanish, and more.
Robust Instruction Following: More resilient to diverse system prompts, improving role-play and chatbot condition-setting.

Good For

Applications requiring very long context windows, such as summarizing extensive documents, analyzing large codebases, or maintaining long-running conversational agents.
Tasks demanding precise instruction following and the generation of structured data.
Use cases in coding and mathematical problem-solving where the Qwen2.5 base model shows strong performance.
Multilingual applications needing broad language support.

Overview

OldKingMeister/Qwen2.5-1.5B-Instruct-YaRN Overview

Key Capabilities

Good For

Full Model Card (README)