Xtra-Computing/XtraGPT-3B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Mar 9, 2025License:otherArchitecture:Transformer0.0K Warm

Xtra-Computing/XtraGPT-3B is a 3 billion parameter instruction-tuned large language model, part of the XtraGPT family, specifically designed for human-AI collaborative academic paper revision. Based on meta-llama/Llama-3.2-3B-Instruct, it excels at context-aware and controllable revisions, understanding the full paper context and following criteria-guided instructions across 20 academic writing criteria. This model is optimized for improving research papers by providing precise, contextually relevant edits.

Loading preview...

XtraGPT-3B: Context-Aware Academic Paper Revision

XtraGPT-3B is a 3 billion parameter model from the XtraGPT family, specifically engineered for human-AI collaborative academic paper revision. Unlike general-purpose LLMs, XtraGPT-3B is fine-tuned to deeply understand the full context of a research paper, ensuring revisions maintain consistency with the overall narrative.

Key Capabilities

  • Context-Aware Revision: Processes the entire paper to provide consistent and relevant edits.
  • Controllable Output: Follows specific user instructions guided by 20 academic writing criteria across 6 paper sections (e.g., Abstract, Introduction).
  • Iterative Workflow Support: Designed to integrate seamlessly into a Human-AI Collaborative (HAC) lifecycle, allowing authors to retain creative control.
  • Specialized Training: Trained on a unique dataset of 140,000 high-quality instruction-revision pairs derived from top-tier conference papers (ICLR).

Good For

  • Researchers and academics seeking AI assistance for refining their scientific papers.
  • Automating precise, criteria-guided revisions in academic writing.
  • Integrating AI into an iterative paper-writing and editing workflow where human oversight is crucial.

This model is released under the highly permissive ModelGo Zero License 2.0 (MG0-2.0), allowing unrestricted use, reproduction, distribution, and creation of derivative works, including for commercial purposes, without attribution requirements.