Name: choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.429-skywork8b-seed42-lr1e-6-warmup10-checkpoint200 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: choiqs

Model Overview

This model, choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.429-skywork8b-seed42-lr1e-6-warmup10-checkpoint200, is a 2 billion parameter language model. The model name indicates it is a fine-tuned version, likely derived from a Qwen3 base, with specific training configurations.

Key Characteristics

Parameter Count: 2 billion parameters.
Context Length: 32768 tokens.
Fine-tuning Details: The model name suggests specific training parameters were applied, including a batch size of 128 (bsz128), 500 training steps (ts500), and a ranking score of 1.429 (ranking1.429). It also references skywork8b, seed42, a learning rate of 1e-6, warmup10, and checkpoint200, indicating a highly customized training process.

Limitations and Recommendations

The provided model card indicates that further information is needed regarding its direct use, downstream applications, out-of-scope uses, biases, risks, and specific recommendations. Users should be aware of these unknowns and exercise caution until more comprehensive details are available. It is recommended to conduct thorough testing for specific use cases to understand its performance and potential limitations.

Overview

Model Overview

Key Characteristics

Limitations and Recommendations

Full Model Card (README)