Name: choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.429-skywork8b-seed42-lr1e-6-warmup10-checkpoint275 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: choiqs

Model Overview

This model, choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.429-skywork8b-seed42-lr1e-6-warmup10-checkpoint275, is a 2 billion parameter language model. It is based on the Qwen3 architecture and features a substantial context length of 32768 tokens, indicating its potential for processing lengthy inputs.

Key Characteristics

Model Type: A transformer-based language model, likely a variant of the Qwen3 series.
Parameter Count: 2 billion parameters, placing it in the smaller, more efficient category for deployment.
Context Length: Supports a large context window of 32768 tokens, which is beneficial for tasks requiring extensive contextual understanding.
Development: Developed by choiqs, with specific training parameters embedded in its name (e.g., bsz128, ts500, ranking1.429, skywork8b, seed42, lr1e-6, warmup10, checkpoint275). These suggest a highly specialized fine-tuning process.

Use Cases

Due to the lack of specific information in the model card, the direct and downstream uses are not explicitly defined. However, given its architecture and context length, it is generally suitable for:

Text Generation: Creating coherent and contextually relevant text.
Language Understanding: Tasks requiring comprehension of long documents or conversations.
Specialized Applications: The specific training parameters imply it might be optimized for particular ranking, summarization, or other NLP tasks, though details are not provided.

Overview

Model Overview

Key Characteristics

Use Cases

Full Model Card (README)