Name: choiqs/Qwen3-1.7B-tldr-bsz128-ts500-regularsqrt2-skywork8b-seed42-lr1e-6-warmup10-checkpoint100 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: choiqs

Overview

This model, choiqs/Qwen3-1.7B-tldr-bsz128-ts500-regularsqrt2-skywork8b-seed42-lr1e-6-warmup10-checkpoint100, is a 2 billion parameter language model built upon the Qwen3 architecture. It is a pre-trained model, though specific details regarding its development, funding, and exact model type are not provided in the current model card. The model card indicates it is a Hugging Face Transformers model, automatically generated upon pushing to the Hub.

Key Characteristics

Parameter Count: 2 billion parameters.
Context Length: Supports a substantial context window of 32768 tokens.
Training Configuration: The model name suggests specific training parameters, including a batch size of 128, a sequence length of 500, and a learning rate of 1e-6 with a 10-step warmup.

Limitations and Recommendations

The current model card lacks detailed information regarding the model's specific language(s), license, training data, evaluation metrics, and potential biases or risks. Users are advised that more information is needed to fully understand its capabilities, limitations, and appropriate use cases. It is recommended that users exercise caution and conduct their own evaluations before deploying this model in critical applications, as its intended direct and downstream uses are not specified.

Overview

Overview

Key Characteristics

Limitations and Recommendations

Full Model Card (README)