Strangedove/CYFRAGOVPL_Llama-PLLuM-70B-instruct-EmbedFix

Warm
Public
70B
FP8
32768
License: llama3.1
Hugging Face
Overview

Overview

Strangedove/CYFRAGOVPL_Llama-PLLuM-70B-instruct-EmbedFix is an instruction-tuned language model built upon the Llama architecture, featuring 70 billion parameters and an extensive context window of 32768 tokens. Its primary distinguishing characteristic lies in a technical refinement concerning its tokenizer and embeddings.

Key Technical Modification

This model incorporates a specific fix where:

  • Added tokens were relocated to pre-allocated spaces within the tokenizer.
  • This action was performed to restore the original embeddings size.

This modification suggests an emphasis on maintaining the integrity and consistency of the model's embedding space, which can be crucial for downstream tasks that rely on stable and predictable token representations.

Good For

  • Use cases requiring a large language model with a substantial context window.
  • Applications where the stability and original size of token embeddings are a critical consideration.
  • Developers looking for a Llama-based instruction-tuned model with specific tokenizer and embedding handling.