chargoddard/Yi-34B-Llama
TEXT GENERATIONConcurrency Cost:2Model Size:34BQuant:FP8Ctx Length:32kPublished:Nov 6, 2023License:yi-licenseArchitecture:Transformer0.1K Cold
chargoddard/Yi-34B-Llama is a 34 billion parameter language model based on the 01-ai/Yi-34B architecture, reconfigured to align with standard Llama modeling code. This model offers a substantial parameter count for complex language tasks and can be loaded without `trust_remote_code` for the model itself. It is primarily designed for developers seeking a Llama-compatible large language model with a 32768 token context length.
Loading preview...
chargoddard/Yi-34B-Llama Overview
This model is a re-implementation of the 01-ai/Yi-34B architecture, specifically adapted to be compatible with standard Llama modeling code. This adaptation allows for easier integration into existing Llama-based workflows and tools.
Key Characteristics
- Architecture: Based on the Yi-34B model from 01-ai, with internal tensor renaming for Llama compatibility.
- Parameters: Features 34 billion parameters, providing significant capacity for various natural language processing tasks.
- Context Length: Supports a substantial context window of 32768 tokens, enabling the processing of longer inputs and generating more coherent, extended outputs.
- Loading: The model itself can be loaded without requiring the
trust_remote_codeflag, simplifying deployment. However, the associated tokenizer may still require it, or users can opt for thellama-tokenizerbranch for a Llama tokenizer class.
Good For
- Developers and researchers familiar with the Llama ecosystem looking for a high-parameter model.
- Applications requiring a large context window for detailed understanding or generation.
- Experimentation with the Yi-34B architecture within a Llama-compatible framework.