Name: choiqs/Qwen3-1.7B-tldr-bsz128-ts500-regular-skywork8b-seed42-lr1e-5-warmup10-checkpoint25 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: choiqs

Overview

This model, choiqs/Qwen3-1.7B-tldr-bsz128-ts500-regular-skywork8b-seed42-lr1e-5-warmup10-checkpoint25, is a 2 billion parameter language model with a substantial context length of 32768 tokens. It is identified as a fine-tuned model, likely derived from the Qwen3 family, and has been pushed to the Hugging Face Hub. The detailed naming convention suggests specific training parameters and configurations, such as a batch size of 128 (bsz128), a sequence length of 500 (ts500), and a learning rate of 1e-5 (lr1e-5), indicating a specialized training regimen.

Key Characteristics

Model Size: 2 billion parameters.
Context Length: Supports a long context window of 32768 tokens.
Fine-tuned: Implies optimization for a particular task or domain, though the specific objective is not detailed in the provided information.
Training Details: The model name includes specific training hyperparameters like batch size, sequence length, and learning rate, suggesting a focused development process.

Use Cases

Given the limited information in the model card, the direct and downstream use cases are not explicitly defined. However, models with such specific naming conventions and fine-tuning often target niche applications or research areas. Users should consult the model developer or original source for intended applications and performance metrics. Without further details, its suitability for general-purpose tasks or specific benchmarks cannot be determined.

Overview

Overview

Key Characteristics

Use Cases

Full Model Card (README)