stefra/mistral_ablazione_full_ner

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:May 20, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The stefra/mistral_ablazione_full_ner is a 7 billion parameter Mistral-based instruction-tuned causal language model developed by stefra. Fine-tuned from unsloth/mistral-7b-instruct-v0.3-bnb-4bit, this model is optimized for specific Named Entity Recognition (NER) tasks. It leverages Unsloth for faster training and is designed for applications requiring efficient and accurate entity extraction.

Loading preview...

Model Overview

The stefra/mistral_ablazione_full_ner is a 7 billion parameter instruction-tuned language model, developed by stefra. It is based on the Mistral architecture and was fine-tuned from the unsloth/mistral-7b-instruct-v0.3-bnb-4bit model. This model specifically focuses on Named Entity Recognition (NER) tasks, indicating its specialization in identifying and classifying entities within text.

Key Characteristics

  • Architecture: Mistral-based, a powerful causal language model family.
  • Parameter Count: 7 billion parameters, offering a balance between performance and computational efficiency.
  • Fine-tuning: Instruction-tuned for specific applications, likely related to NER, enhancing its ability to follow directives for entity extraction.
  • Training Efficiency: Utilizes Unsloth and Huggingface's TRL library, enabling 2x faster training compared to standard methods.
  • Context Length: Supports a context length of 4096 tokens.

Potential Use Cases

This model is particularly well-suited for:

  • Named Entity Recognition (NER): Identifying and categorifying entities such as persons, organizations, locations, and dates from unstructured text.
  • Information Extraction: Automating the extraction of specific data points from large volumes of text.
  • Text Analysis: Enhancing understanding of textual content by highlighting key entities.
  • Applications requiring efficient entity identification: Due to its optimized training and specialized fine-tuning, it can be a strong candidate for tasks where quick and accurate NER is crucial.