mervinpraison/Llama-3.1-8B-Instruct-Tamil
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jul 29, 2024License:apache-2.0Architecture:Transformer Open Weights Cold
The mervinpraison/Llama-3.1-8B-Instruct-Tamil model is an 8 billion parameter instruction-tuned Llama-3.1 variant developed by mervinpraison. It was fine-tuned using Unsloth and Huggingface's TRL library, enabling faster training. This model is specifically adapted for the Tamil language, making it suitable for applications requiring instruction-following capabilities in Tamil.
Loading preview...
Model Overview
The mervinpraison/Llama-3.1-8B-Instruct-Tamil is an 8 billion parameter instruction-tuned language model. Developed by mervinpraison, this model is a fine-tuned version of Meta's Llama-3.1-8B-Instruct.
Key Characteristics
- Base Model: Fine-tuned from
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit. - Training Efficiency: Leverages Unsloth and Huggingface's TRL library for accelerated training, reportedly achieving 2x faster training speeds.
- Language Focus: Specifically adapted for the Tamil language, indicating its primary utility in Tamil-centric NLP tasks.
Use Cases
This model is particularly well-suited for:
- Instruction-following tasks in Tamil.
- Applications requiring a language model with an emphasis on the Tamil language.
- Research and development in Tamil natural language processing.
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p