Name: TT0518/qwen25-3b-1.58bit-qat API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: TT0518

Model Overview

This model, TT0518/qwen25-3b-1.58bit-qat, is a specialized version of the Qwen2.5-3B base model, featuring 1.58-bit Quantization-Aware Training (QAT). This advanced quantization technique significantly reduces the model's memory footprint while aiming to preserve performance.

Key Technical Details

Base Model: Qwen/Qwen2.5-3B
Quantization Method: 1.58-bit QAT using a ternary scheme ({-1, 0, +1})
Quantization Scope: Applied to all Linear layers, excluding lm_head and embed_tokens
Training Data: Fine-tuned on a mix of WikiText-103 (70%) and Wikipedia JA (30%)
Training Process: Involved 50,000 chunks of 512 tokens each, with a two-stage fine-tuning approach.
Final Perplexity (PPL): Achieved 43.92, indicating its language modeling capability post-quantization.

File Formats Available

Standard HuggingFace *.safetensors (float16) at approximately 6GB.
Optimized qwen25_3b_qat_q4km.gguf for GGUF Q4_K_M quantization, weighing around 1.9GB, suitable for local inference with tools like Ollama.

Licensing

The model adheres to the Qwen Research License of its base model. Commercial use requires an application to Alibaba Cloud.

Overview

Model Overview

Key Technical Details

File Formats Available

Licensing

Full Model Card (README)