mervinpraison/Llama-3.1-8B-Instruct-Tamil

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jul 29, 2024License:apache-2.0Architecture:Transformer Open Weights Cold

The mervinpraison/Llama-3.1-8B-Instruct-Tamil model is an 8 billion parameter instruction-tuned Llama-3.1 variant developed by mervinpraison. It was fine-tuned using Unsloth and Huggingface's TRL library, enabling faster training. This model is specifically adapted for the Tamil language, making it suitable for applications requiring instruction-following capabilities in Tamil.

Loading preview...

Model Overview

The mervinpraison/Llama-3.1-8B-Instruct-Tamil is an 8 billion parameter instruction-tuned language model. Developed by mervinpraison, this model is a fine-tuned version of Meta's Llama-3.1-8B-Instruct.

Key Characteristics

  • Base Model: Fine-tuned from unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit.
  • Training Efficiency: Leverages Unsloth and Huggingface's TRL library for accelerated training, reportedly achieving 2x faster training speeds.
  • Language Focus: Specifically adapted for the Tamil language, indicating its primary utility in Tamil-centric NLP tasks.

Use Cases

This model is particularly well-suited for:

  • Instruction-following tasks in Tamil.
  • Applications requiring a language model with an emphasis on the Tamil language.
  • Research and development in Tamil natural language processing.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p