theprint/ReWiz-Llama-3.2-3B

Warm
Public
3.2B
BF16
32768
1
Oct 18, 2024
License: apache-2.0
Hugging Face
Overview

Model Overview

ReWiz-Llama-3.2-3B is a 3.2 billion parameter language model developed by Rasmus Rasmussen. This model is built upon the Llama architecture and is designed for a wide range of natural language processing tasks. It supports a substantial context length of 32,768 tokens, allowing it to process and generate longer sequences of text.

Key Capabilities

  • General Language Understanding: Capable of processing and interpreting various forms of text.
  • Text Generation: Can generate coherent and contextually relevant text.
  • Extended Context: Benefits from a 32,768 token context window, useful for tasks requiring extensive memory or long-form content.

Performance Benchmarks

Evaluated on the Open LLM Leaderboard, ReWiz-Llama-3.2-3B shows an average score of 17.98. Specific benchmark results include:

  • IFEval (0-Shot): 46.49
  • BBH (3-Shot): 19.29
  • MATH Lvl 5 (4-Shot): 9.74
  • MMLU-PRO (5-shot): 20.97

These scores indicate its performance across instruction following, multi-task language understanding, and mathematical reasoning.

Good For

  • Applications requiring a compact yet capable language model.
  • Tasks benefiting from a large context window.
  • General-purpose text generation and understanding where a 3.2B parameter model is suitable.