JoPmt/Llama-3.2-3B-Instruct

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Sep 27, 2024License:llama3.2Architecture:Transformer Warm

JoPmt/Llama-3.2-3B-Instruct is a 3.2 billion parameter instruction-tuned generative language model from the Llama 3.2 family, developed by Meta. Optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks, it supports English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai. This model is specifically designed for efficient finetuning, offering 2.4x faster training and 58% less memory usage when utilizing Unsloth's optimization framework.

Loading preview...

Model Overview

JoPmt/Llama-3.2-3B-Instruct is a 3.2 billion parameter instruction-tuned model from Meta's Llama 3.2 collection. This model is part of a family of multilingual large language models (LLMs) available in 1B and 3B sizes, optimized for text-in/text-out generative tasks. It utilizes an optimized transformer architecture and has been fine-tuned using supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.

Key Capabilities

  • Multilingual Support: Officially supports English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai, with training on a broader set of languages.
  • Dialogue Optimization: Specifically designed for multilingual dialogue use cases, including agentic retrieval and summarization.
  • Efficient Finetuning: When used with Unsloth, this model can be finetuned 2.4x faster with 58% less memory compared to standard methods, making it highly efficient for custom applications.
  • Instruction-Tuned: Optimized for following instructions, making it suitable for various conversational and task-oriented applications.

Good For

  • Developers looking to finetune a Llama 3.2 model with significant speed and memory efficiency.
  • Applications requiring multilingual dialogue, retrieval, and summarization capabilities.
  • Building custom agents or conversational AI systems where performance and resource optimization are critical.