Name: choiqs/Qwen3-1.7B-tldr-bsz128-ts300-regular-skywork8b-seed42-lr1e-6-warmup10-checkpoint75 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: choiqs

Model Overview

This model, choiqs/Qwen3-1.7B-tldr-bsz128-ts300-regular-skywork8b-seed42-lr1e-6-warmup10-checkpoint75, is a 2 billion parameter language model built upon the Qwen3 architecture. It supports a substantial context length of 32768 tokens, enabling it to process and understand long sequences of text.

Key Characteristics

Architecture: Qwen3-based, indicating a foundation from the Qwen series of large language models.
Parameter Count: 2 billion parameters, offering a balance between performance and computational efficiency.
Context Length: A significant 32768 tokens, allowing for deep contextual understanding over extended inputs.
Specialized Fine-tuning: The model's name suggests specific fine-tuning for tasks like summarization ("tldr"), with detailed training parameters (batch size 128, sequence length 300, specific learning rate, and warmup schedule).

Potential Use Cases

Given its architecture and naming convention, this model is likely optimized for:

Text Summarization: Efficiently generating concise summaries from lengthy documents or conversations.
Long-Context Understanding: Applications requiring the processing and interpretation of extensive textual data.
Specialized NLP Tasks: Use cases that benefit from a model fine-tuned with specific training regimes for improved performance on targeted objectives.

Overview

Model Overview

Key Characteristics

Potential Use Cases

Full Model Card (README)