Name: choiqs/Qwen3-1.7B-tldr-bsz128-ts500-regular-skywork8b-seed42-lr1e-5-warmup10-checkpoint100 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: choiqs

Model Overview

This model, choiqs/Qwen3-1.7B-tldr-bsz128-ts500-regular-skywork8b-seed42-lr1e-5-warmup10-checkpoint100, is a 2 billion parameter language model. It is based on the Qwen3 architecture, as indicated by its name. The specific naming convention, including 'tldr', 'bsz128', 'ts500', and various training parameters, suggests it is a fine-tuned or specialized version of a Qwen3 base model, likely optimized for a particular task or dataset.

Key Characteristics

Architecture: Qwen3-based, a known family of large language models.
Parameter Count: Approximately 2 billion parameters, making it a relatively compact model suitable for various applications.
Context Length: The model supports a context length of 32768 tokens.
Training Details: The model name includes specific training parameters such as bsz128 (batch size 128), ts500 (training steps 500), lr1e-5 (learning rate 1e-5), and warmup10 (10 warmup steps), indicating a focused training regimen.

Use Cases

Due to the lack of specific information in the model card, the precise direct and downstream use cases are not explicitly defined. However, given its parameter count and the 'tldr' (Too Long; Didn't Read) indicator in its name, it is plausible that this model is intended for tasks involving summarization or concise information extraction. Developers should consult the model's original source or conduct further evaluation to determine its suitability for specific applications.

Overview

Model Overview

Key Characteristics

Use Cases

Full Model Card (README)