chargoddard/Yi-34B-Llama

TEXT GENERATIONConcurrency Cost:2Model Size:34BQuant:FP8Ctx Length:32kPublished:Nov 6, 2023License:yi-licenseArchitecture:Transformer0.1K Cold

chargoddard/Yi-34B-Llama is a 34 billion parameter language model based on the 01-ai/Yi-34B architecture, reconfigured to align with standard Llama modeling code. This model offers a substantial parameter count for complex language tasks and can be loaded without `trust_remote_code` for the model itself. It is primarily designed for developers seeking a Llama-compatible large language model with a 32768 token context length.

Loading preview...

chargoddard/Yi-34B-Llama Overview

This model is a re-implementation of the 01-ai/Yi-34B architecture, specifically adapted to be compatible with standard Llama modeling code. This adaptation allows for easier integration into existing Llama-based workflows and tools.

Key Characteristics

  • Architecture: Based on the Yi-34B model from 01-ai, with internal tensor renaming for Llama compatibility.
  • Parameters: Features 34 billion parameters, providing significant capacity for various natural language processing tasks.
  • Context Length: Supports a substantial context window of 32768 tokens, enabling the processing of longer inputs and generating more coherent, extended outputs.
  • Loading: The model itself can be loaded without requiring the trust_remote_code flag, simplifying deployment. However, the associated tokenizer may still require it, or users can opt for the llama-tokenizer branch for a Llama tokenizer class.

Good For

  • Developers and researchers familiar with the Llama ecosystem looking for a high-parameter model.
  • Applications requiring a large context window for detailed understanding or generation.
  • Experimentation with the Yi-34B architecture within a Llama-compatible framework.