and-emili/aera-4b

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:May 31, 2025License:apache-2.0Architecture:Transformer Open Weights Warm

ÆRA-4B is a 4 billion parameter language model developed by AND EMILI, specifically designed for enterprise applications requiring context-based reasoning and structured outputs. This model excels at native Italian language processing, generating reliable, context-only responses to reduce hallucination, and producing structured data like JSON for entity extraction and classification. It features native function calling support, making it ideal for building intelligent agents, RAG implementations, and automation pipelines that prioritize predictable behavior and on-premises deployment.

Loading preview...

Overview

ÆRA-4B is a 4 billion parameter language model from AND EMILI, engineered for enterprise-focused applications rather than general-purpose conversation. Its core design emphasizes context-based reasoning and structured outputs, making it a reliable foundation for intelligent agents and automation.

Key Capabilities

  • Native Italian Language Support: Optimized for understanding and generating Italian text.
  • Context-Only Responses: Trained to rely exclusively on provided context, explicitly stating when information is unavailable to minimize hallucination.
  • Structured Output Generation: Reliably produces well-formed JSON, performs entity extraction, classification, and sentiment analysis within context.
  • Function Calling: Integrates seamlessly into agentic workflows and automation pipelines through native tool use support.
  • On-premises Deployment: Optimized for local deployment on standard hardware, ensuring privacy and no external API calls.

Good For

  • Retrieval Augmented Generation (RAG) implementations and viability testing.
  • Document analysis and information extraction in enterprise settings.
  • Automated workflows requiring predictable, structured outputs.
  • Multi-agent systems where reliable behavior is critical.
  • Companies looking to build proof-of-concepts for LLM-based solutions with a lightweight, efficient model.