mricopratama/legal-rag-qwen25-1_5b-grpo

TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jun 8, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The mricopratama/legal-rag-qwen25-1_5b-grpo is a 1.5 billion parameter Qwen2-based causal language model developed by mricopratama, fine-tuned for legal RAG applications. This model was trained using Unsloth and Huggingface's TRL library, enabling faster fine-tuning. It is specifically designed to enhance retrieval-augmented generation in legal contexts, leveraging its 32768 token context length. Its primary strength lies in processing and generating text relevant to legal information retrieval.

Loading preview...

Model Overview

The mricopratama/legal-rag-qwen25-1_5b-grpo is a 1.5 billion parameter Qwen2-based language model, developed by mricopratama. It is a fine-tuned version of mricopratama/legal-rag-qwen25-1_5b-sft, specifically optimized for legal Retrieval-Augmented Generation (RAG) tasks.

Key Capabilities

  • Legal RAG Optimization: This model is explicitly fine-tuned to improve performance in legal RAG scenarios, suggesting enhanced ability to retrieve and generate relevant legal text.
  • Efficient Fine-tuning: The model was fine-tuned using Unsloth and Huggingface's TRL library, indicating an efficient training process that can be replicated or built upon.
  • Qwen2 Architecture: Built upon the Qwen2 architecture, it benefits from the foundational capabilities of this model family.
  • Extended Context Length: With a context length of 32768 tokens, it can process substantial amounts of information, which is beneficial for complex legal documents.

Good For

  • Legal Information Retrieval: Ideal for applications requiring accurate and contextually relevant information extraction from legal texts.
  • Legal Document Analysis: Suitable for tasks involving the understanding and generation of content based on legal documents.
  • Research and Development: Provides a base for further experimentation and fine-tuning within the legal AI domain, particularly for RAG systems.