Merdeka-LLM/merdeka-llm-lawyer-3b-128k-instruct
Merdeka-LLM/merdeka-llm-lawyer-3b-128k-instruct is a 3.2 billion parameter Llama-3.2-3B-Instruct model developed by Merdeka-LLM, fine-tuned for instruction following. This model was trained using Unsloth and Huggingface's TRL library, enabling faster training. It is designed for general instruction-following tasks, leveraging its Llama architecture and efficient training methodology.
Loading preview...
Model Overview
Merdeka-LLM/merdeka-llm-lawyer-3b-128k-instruct is a 3.2 billion parameter instruction-tuned language model developed by Merdeka-LLM. It is fine-tuned from the unsloth/Llama-3.2-3B-Instruct base model, indicating its foundation in the Llama architecture.
Key Characteristics
- Developer: Merdeka-LLM
- Base Model: Fine-tuned from
unsloth/Llama-3.2-3B-Instruct. - Training Efficiency: The model was trained significantly faster using Unsloth and Huggingface's TRL library. Unsloth is known for its capabilities in accelerating the fine-tuning process of large language models.
- License: The model is released under the Apache-2.0 license.
Intended Use
This model is primarily intended for instruction-following tasks, leveraging its fine-tuned nature to respond to user prompts effectively. Its efficient training process suggests a focus on practical deployment and performance for common NLP applications.