RickyIG/legal-qwen25-3b-sft

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kPublished:May 19, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

RickyIG/legal-qwen25-3b-sft is a 3.1 billion parameter Qwen2.5 model developed by RickyIG, fine-tuned for specific applications. This model was efficiently trained using Unsloth and Huggingface's TRL library, enabling faster fine-tuning. It is designed for tasks requiring a compact yet capable language model, leveraging the Qwen2.5 architecture.

Loading preview...

Overview

RickyIG/legal-qwen25-3b-sft is a 3.1 billion parameter model based on the Qwen2.5 architecture, developed by RickyIG. It has been fine-tuned from the unsloth/Qwen2.5-3B-Instruct-bnb-4bit base model.

Key Capabilities

  • Efficient Training: This model was fine-tuned significantly faster using Unsloth and Huggingface's TRL library, indicating an optimized training process.
  • Compact Size: With 3.1 billion parameters, it offers a balance between performance and computational efficiency, making it suitable for deployment in resource-constrained environments.

Good For

  • Specific Domain Applications: As a fine-tuned model, it is likely optimized for particular use cases, potentially in legal or related fields, given the model's naming convention.
  • Fast Prototyping and Deployment: The use of Unsloth for accelerated training suggests it's well-suited for developers looking to quickly adapt a Qwen2.5 base model for custom tasks.