Merdeka-LLM/merdeka-llm-lawyer-3b-128k-instruct

TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Oct 16, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Merdeka-LLM/merdeka-llm-lawyer-3b-128k-instruct is a 3.2 billion parameter Llama-3.2-3B-Instruct model developed by Merdeka-LLM, fine-tuned for instruction following. This model was trained using Unsloth and Huggingface's TRL library, enabling faster training. It is designed for general instruction-following tasks, leveraging its Llama architecture and efficient training methodology.

Loading preview...

Model Overview

Merdeka-LLM/merdeka-llm-lawyer-3b-128k-instruct is a 3.2 billion parameter instruction-tuned language model developed by Merdeka-LLM. It is fine-tuned from the unsloth/Llama-3.2-3B-Instruct base model, indicating its foundation in the Llama architecture.

Key Characteristics

  • Developer: Merdeka-LLM
  • Base Model: Fine-tuned from unsloth/Llama-3.2-3B-Instruct.
  • Training Efficiency: The model was trained significantly faster using Unsloth and Huggingface's TRL library. Unsloth is known for its capabilities in accelerating the fine-tuning process of large language models.
  • License: The model is released under the Apache-2.0 license.

Intended Use

This model is primarily intended for instruction-following tasks, leveraging its fine-tuned nature to respond to user prompts effectively. Its efficient training process suggests a focus on practical deployment and performance for common NLP applications.