PKU-DS-LAB/Fairy2i-W2
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kLicense:llama2Architecture:Transformer0.0K Open Weights Cold

Fairy2i-W2 by PKU-DS-LAB is a 7 billion parameter language model based on LLaMA-2, featuring an effective 2-bit precision through a novel complex-valued quantization framework. It transforms pre-trained real-valued layers into a widely-linear complex form, enabling extremely low-bit quantization while reusing existing checkpoints. This model is optimized for efficient inference on commodity hardware, achieving performance nearly comparable to full-precision baselines.

Loading preview...