Name: choiqs/Qwen3-1.7B-tldr-bsz128-ts300-regular-skywork8b-seed42-lr1e-6-warmup10-checkpoint125 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: choiqs

Overview

This model, choiqs/Qwen3-1.7B-tldr-bsz128-ts300-regular-skywork8b-seed42-lr1e-6-warmup10-checkpoint125, is a 2 billion parameter language model built upon the Qwen3 architecture. It has been specifically fine-tuned for generating concise summaries, often referred to as TLDR (Too Long; Didn't Read) outputs. The training process involved a batch size of 128 and a target sequence length of 300, utilizing a regularized approach based on the Skywork8B model.

Key Capabilities

Efficient TLDR Summarization: Optimized for generating short, digestible summaries from longer texts.
Qwen3 Architecture: Benefits from the underlying capabilities of the Qwen3 model family.
Specific Training Parameters: Fine-tuned with a batch size of 128 and a sequence length of 300, indicating a focus on processing moderately sized inputs for summarization.

Good For

Text Summarization: Ideal for applications requiring quick and concise summaries of documents, articles, or other textual content.
Information Extraction: Can be used to distill key information from longer passages.
Resource-Constrained Environments: Its 2 billion parameter size makes it more efficient than larger models for summarization tasks, potentially suitable for deployment where computational resources are a consideration.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)