Ichsan2895/Merak-7B-v4
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Nov 11, 2023License:cc-by-nc-sa-4.0Architecture:Transformer0.0K Open Weights Warm
Merak-7B-v4 by Ichsan2895 is a 7 billion parameter large language model based on Mistral-7B-OpenOrca, specifically fine-tuned for the Indonesian language. It leverages QLoRA for efficient operation, capable of running with 16 GB VRAM. The model's development involved fine-tuning on Indonesian Wikipedia articles and Indonesian instruction datasets, making it highly suitable for Indonesian natural language processing tasks.
Loading preview...
Merak-7B-v4: Indonesian Language LLM
Merak-7B-v4 is a 7 billion parameter large language model developed by Muhammad Ichsan, specifically optimized for the Indonesian language. It is built upon the powerful Mistral-7B-OpenOrca architecture, enhanced through a multi-stage fine-tuning process.
Key Capabilities & Development:
- Indonesian Language Focus: Merak-7B-v4 is extensively fine-tuned on a curated dataset of Indonesian Wikipedia articles, ensuring strong performance and understanding of Bahasa Indonesia.
- Instruction Following: Further fine-tuning was conducted using Indonesian instruction datasets, specifically Ichsan2895/OASST_Top1_Indonesian and Ichsan2895/alpaca-gpt4-indonesian, to improve its ability to follow instructions and generate relevant responses.
- Efficient Deployment: The model utilizes QLoRA (Quantized LoRA) for efficient fine-tuning and inference, allowing it to run effectively with as little as 16 GB of VRAM.
- ChatML Format: The model is designed to be compatible with the ChatML format for conversational applications.
When to Use Merak-7B-v4:
- Indonesian NLP Applications: Ideal for tasks requiring deep understanding and generation of the Indonesian language.
- Resource-Constrained Environments: Its QLoRA optimization makes it suitable for deployment on hardware with limited VRAM (e.g., 16GB).
- Research and Development: Provides a strong base for further research and fine-tuning on specific Indonesian language tasks.