Hulyyy/qwen-test

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:Oct 16, 2025License:apache-2.0Architecture:Transformer Open Weights Warm

Hulyyy/qwen-test is a 0.8 billion parameter causal language model, fine-tuned from Qwen/Qwen3-0.6B. This model was trained for 1 epoch with a learning rate of 5e-05, achieving a validation loss of 0.2279. Its specific intended uses and primary differentiators are not detailed in the available information, suggesting it may be a base or experimental fine-tune.

Loading preview...

Model Overview

Hulyyy/qwen-test is a fine-tuned language model based on the Qwen3-0.6B architecture. It comprises approximately 0.8 billion parameters and was developed using the Transformers library.

Training Details

The model underwent a single epoch of fine-tuning with a learning rate of 5e-05. Key training hyperparameters included a train_batch_size of 1 and an eval_batch_size of 8, utilizing the ADAMW_TORCH_FUSED optimizer. During training, it achieved a validation loss of 0.2279.

Current Status

As per the available information, specific details regarding the dataset used for fine-tuning, its intended uses, and limitations are not provided. This suggests it might be an experimental or foundational fine-tune where the specific application or unique capabilities are yet to be defined or publicly documented.