kehanlu/llama-3.2-8B-Instruct

Warm
Public
8B
FP8
32768
Hugging Face
Overview

Overview

kehanlu/llama-3.2-8B-Instruct is an 8 billion parameter, instruction-tuned language model, specifically designed for text-only applications. It is a distilled version of meta-llama/Llama-3.2-11B-Vision-Instruct, with the vision-related cross-attention layers removed to create a purely linguistic model.

Key Characteristics

  • Text-Only Focus: This model has been meticulously re-engineered from a larger multimodal model to concentrate solely on text processing, making it efficient for language-based tasks.
  • Architecture: Derived from the Llama 3.2 series, it retains the strong instruction-following capabilities and general language understanding of its lineage.
  • Context Length: Supports a substantial context window of 32768 tokens, enabling it to handle longer prompts and generate more coherent, extended responses.
  • Tokenizer Modification: The tokenizer.chat_template has been modified in this repository to remove the date_string behavior, which appended the current date during template application in the original model.

Use Cases

This model is well-suited for a wide range of natural language processing tasks where a powerful, instruction-tuned text model with a large context window is beneficial. Potential applications include advanced chatbots, content generation, summarization, and complex instruction following.