Name: choiqs/Qwen3-1.7B-tldr-bsz128-ts300-regular-skywork8b-seed42-lr1e-6-warmup10-checkpoint300 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: choiqs

Overview

This model is a 2 billion parameter variant of the Qwen3 architecture, identified by its specific training configuration: tldr-bsz128-ts300-regular-skywork8b-seed42-lr1e-6-warmup10-checkpoint300. The naming convention suggests it has been fine-tuned for tasks related to summarization or generating concise responses, indicated by "tldr" (Too Long; Didn't Read).

Key Characteristics

Model Family: Qwen3 architecture.
Parameter Count: 2 billion parameters.
Context Length: Supports a substantial context length of 32768 tokens, allowing for processing longer inputs.
Training Specifics: The detailed suffix implies a specific training regimen, likely involving a batch size of 128, a training step count of 300, and a learning rate of 1e-6 with a warmup period.

Potential Use Cases

Given the "tldr" indication in its name, this model is likely optimized for:

Text Summarization: Generating brief and accurate summaries from longer texts.
Information Extraction: Quickly distilling key points from documents or conversations.
Concise Response Generation: Creating short, direct answers in conversational AI or question-answering systems.

Overview

Overview

Key Characteristics

Potential Use Cases

Full Model Card (README)