Name: choiqs/Qwen3-1.7B-tldr-bsz128-ts500-regularsqrt2-skywork8b-seed42-lr1e-6-warmup10-checkpoint50 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: choiqs

Model Overview

The choiqs/Qwen3-1.7B-tldr-bsz128-ts500-regularsqrt2-skywork8b-seed42-lr1e-6-warmup10-checkpoint50 is a 2 billion parameter language model built upon the Qwen3 architecture. This model has been pushed to the Hugging Face Hub, with its model card automatically generated. While specific details regarding its development, funding, and exact model type are currently marked as "More Information Needed," the model name itself indicates a fine-tuning process with particular hyperparameters.

Key Characteristics

Parameter Count: 2 billion parameters.
Context Length: 32768 tokens.
Training Parameters: The model name suggests specific training configurations, including a batch size of 128 (bsz128), a sequence length of 500 (ts500), and a learning rate of 1e-6 (lr1e-6), along with a warmup period (warmup10) and a specific checkpoint (checkpoint50).

Current Status and Limitations

As per the provided model card, detailed information regarding the model's intended uses, specific capabilities, training data, evaluation results, and potential biases or limitations is currently unavailable. Users are advised that further information is needed to fully understand its direct and downstream applications, as well as any recommendations for its use. The model card indicates that users should be aware of potential risks, biases, and limitations, which are yet to be specified.

Overview

Model Overview

Key Characteristics

Current Status and Limitations

Full Model Card (README)