rnlkav/legal-Llama-3.1-8B-ft-grpo

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kTool Calling:SupportedPublished:Jun 7, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The rnlkav/legal-Llama-3.1-8B-ft-grpo model is an 8 billion parameter Llama 3.1-based language model developed by rnlkav. This model was fine-tuned using Unsloth and Huggingface's TRL library, enabling faster training. It is designed for general language tasks, leveraging its Llama 3.1 architecture for robust performance.

Loading preview...

Model Overview

rnlkav/legal-Llama-3.1-8B-ft-grpo is an 8 billion parameter language model developed by rnlkav. It is based on the Llama 3.1 architecture and was fine-tuned from unsloth/llama-3.1-8b-unsloth-bnb-4bit. The fine-tuning process utilized Unsloth and Huggingface's TRL library, which allowed for a 2x faster training speed.

Key Characteristics

  • Architecture: Llama 3.1-based, providing a strong foundation for various NLP tasks.
  • Parameter Count: 8 billion parameters, offering a balance between performance and computational efficiency.
  • Training Efficiency: Fine-tuned with Unsloth, known for accelerating model training.
  • License: Released under the permissive Apache-2.0 license.

Potential Use Cases

This model is suitable for a range of general-purpose language applications, benefiting from its Llama 3.1 base and efficient fine-tuning. Developers looking for an 8B parameter model with a focus on optimized training could consider this for tasks such as text generation, summarization, and question answering.