Name: choiqs/Qwen3-1.7B-tldr-bsz128-ts300-regular-skywork8b-seed42-lr1e-6-warmup10-checkpoint275 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: choiqs

Overview

This model, choiqs/Qwen3-1.7B-tldr-bsz128-ts300-regular-skywork8b-seed42-lr1e-6-warmup10-checkpoint275, is a 2 billion parameter language model with a substantial context length of 32768 tokens. It is shared by choiqs and appears to be a fine-tuned variant, indicated by its detailed naming convention which often reflects specific training parameters like batch size (bsz128), training steps (ts300), and learning rate (lr1e-6). The base architecture is likely Qwen3, given the model name.

Key Characteristics

Parameter Count: 2 billion parameters, making it a relatively compact model.
Context Length: Supports a long context window of 32768 tokens, which is beneficial for processing extensive inputs or generating detailed outputs.
Fine-tuned Nature: The model name suggests it has undergone specific fine-tuning, potentially for summarization (implied by "tldr") or other specialized tasks, though explicit details are not provided.

Limitations

Based on the provided model card, specific details regarding the model's development, funding, exact type, language(s), license, finetuning base, direct use cases, downstream uses, out-of-scope uses, biases, risks, limitations, training data, training procedure, evaluation metrics, and results are currently marked as "More Information Needed". Users should be aware of these missing details when considering its application.

Overview

Overview

Key Characteristics

Limitations

Full Model Card (README)