ParetoQaft/3B-base
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Feb 5, 2026License:llama3.2Architecture:Transformer Warm

ParetoQaft/3B-base is a 3.21 billion parameter multilingual large language model from the Llama 3.2 family, developed by Meta. This auto-regressive transformer model is optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks. It features an optimized transformer architecture, Grouped-Query Attention (GQA), and a 32K context length, outperforming many open-source and closed chat models on common industry benchmarks. The model supports English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai, with a knowledge cutoff of December 2023.

Loading preview...