Name: choiqs/Qwen3-1.7B-tldr-bsz128-ts500-regularsqrt2-skywork8b-seed42-lr1e-6-warmup10-checkpoint75 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: choiqs

Overview

This model, choiqs/Qwen3-1.7B-tldr-bsz128-ts500-regularsqrt2-skywork8b-seed42-lr1e-6-warmup10-checkpoint75, is a 1.7 billion parameter language model. It is hosted on Hugging Face and its model card has been automatically generated, indicating it is a transformers model. The detailed name suggests specific training configurations, including a batch size of 128, a sequence length of 500, and a learning rate of 1e-6 with a warmup of 10 steps, trained for 75 checkpoints.

Key Characteristics

Parameter Count: 1.7 billion parameters.
Context Length: Supports a context window of 32768 tokens.
Training Details: The model name implies a specific training regimen, potentially involving a regularized square root scheduler and integration with Skywork8b, though explicit details are marked as "More Information Needed" in the model card.

Limitations

The current model card indicates that significant information regarding its development, funding, specific model type, language(s), license, and finetuning origins is "More Information Needed." Consequently, its intended direct uses, downstream applications, out-of-scope uses, biases, risks, and limitations are not yet detailed. Users should exercise caution and conduct thorough evaluations before deployment, as comprehensive information on its performance and ethical considerations is pending.

Overview

Overview

Key Characteristics

Limitations

Full Model Card (README)