numind/NuExtract-tiny
Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:0.6BQuant:BF16Ctx Length:32kPublished:May 31, 2024License:mitArchitecture:Transformer0.0K Open Weights Warm

numind/NuExtract-tiny is a 0.6 billion parameter language model developed by NuMind, based on the Qwen1.5-0.5B architecture. It is specifically fine-tuned for information extraction tasks from text, leveraging a private, high-quality synthetic dataset. This model excels at purely extractive information retrieval, ensuring all output text is present in the original input, and supports structured extraction via JSON templates.

Loading preview...

NuExtract-tiny: Specialized Information Extraction Model

NuExtract-tiny, developed by NuMind, is a 0.6 billion parameter model built upon the Qwen1.5-0.5B architecture. It has been meticulously fine-tuned on a proprietary, high-quality synthetic dataset, making it highly effective for structured information extraction from text.

Key Capabilities

  • Purely Extractive: Guarantees that all extracted information is directly present in the original input text, preventing hallucination.
  • Template-Driven Extraction: Users can define the desired output structure using a JSON template, guiding the model to extract specific fields.
  • Example-Based Guidance: Supports providing output formatting examples to further refine extraction accuracy for complex tasks.
  • Efficient Processing: Designed to handle input texts up to 2000 tokens, with a maximum context length of 32768 tokens.

Good For

  • Zero-shot Information Extraction: Provides good performance out-of-the-box for various extraction tasks.
  • Task-Specific Fine-tuning: Intended for further fine-tuning on specific use cases with as few as 30 examples to achieve optimal performance.
  • Structured Data Retrieval: Ideal for converting unstructured text into structured JSON outputs based on predefined schemas.