NousResearch/Meta-Llama-3-8B-Alternate-Tokenizer

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Jun 6, 2024License:llama3Architecture:Transformer0.0K Warm

NousResearch/Meta-Llama-3-8B-Alternate-Tokenizer is an 8 billion parameter, auto-regressive language model developed by Meta, utilizing an optimized transformer architecture. This variant is specifically designed for use with the Hermes tokenizer, offering an alternative to the standard Meta-Llama-3-8B. It is pretrained on over 15 trillion tokens of publicly available data and is intended for commercial and research use in English, excelling in dialogue use cases when instruction-tuned.

Loading preview...

Model Overview

NousResearch/Meta-Llama-3-8B-Alternate-Tokenizer is an 8 billion parameter model from the Meta Llama 3 family, developed by Meta. It employs an optimized transformer architecture and is pretrained on over 15 trillion tokens of publicly available data, with a knowledge cutoff of March 2023. This specific repository provides an alternate version of Meta-Llama-3-8B, configured for compatibility with the Hermes tokenizer.

Key Capabilities

  • General Language Generation: Capable of generating text and code.
  • Dialogue Optimization: Instruction-tuned variants are optimized for assistant-like chat use cases.
  • Performance: Demonstrates strong performance across various benchmarks, including MMLU (66.6 for base, 68.4 for instruction-tuned) and HumanEval (62.2 for instruction-tuned).
  • Scalability: Utilizes Grouped-Query Attention (GQA) for improved inference scalability.
  • Context Length: Supports a context length of 8k tokens.

Good For

  • Commercial and Research Use: Intended for a broad range of applications in English.
  • Assistant-like Chat: Instruction-tuned models are well-suited for dialogue systems.
  • Natural Language Generation: Pretrained models can be adapted for various text generation tasks.
  • Developers seeking Hermes tokenizer compatibility: This model variant is specifically set up for use with the Hermes tokenizer, offering an alternative to the standard Llama 3 tokenizer.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p