kianaaa19/legal-chatbot-qwen3b-grpo-final

TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jun 4, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The kianaaa19/legal-chatbot-qwen3b-grpo-final is a 3.1 billion parameter Qwen2 model developed by kianaaa19, fine-tuned from kianaaa19/legal-chatbot-qwen3b-sft-merged. This model was trained using Unsloth and Huggingface's TRL library for accelerated performance. It is specifically designed for legal chatbot applications, leveraging its fine-tuned nature to excel in legal domain interactions.

Loading preview...

Overview

kianaaa19/legal-chatbot-qwen3b-grpo-final is a specialized 3.1 billion parameter Qwen2 language model developed by kianaaa19. It is a fine-tuned iteration, building upon the base of kianaaa19/legal-chatbot-qwen3b-sft-merged.

Key Capabilities

  • Legal Domain Specialization: This model is specifically fine-tuned for applications within the legal sector, indicating a focus on understanding and generating legal-centric text.
  • Optimized Training: The model's training process utilized Unsloth and Huggingface's TRL library, which enabled a 2x faster training speed.
  • Qwen2 Architecture: Based on the Qwen2 architecture, it inherits the foundational capabilities of this model family.

Good For

  • Legal Chatbot Development: Its primary utility lies in powering chatbots and conversational AI systems designed for legal inquiries, information retrieval, or assistance.
  • Legal Text Processing: Applications requiring an understanding or generation of legal documents, clauses, or discussions could benefit from its specialized training.
  • Efficient Deployment: Given its 3.1 billion parameters and optimized training, it offers a balance between performance and computational efficiency for legal AI tasks.