migtissera/Tess-34B-v1.5b

TEXT GENERATIONConcurrency Cost:2Model Size:34BQuant:FP8Ctx Length:32kPublished:Jan 28, 2024License:yi-34bArchitecture:Transformer0.0K Cold

Tess-34B-v1.5b is a 34 billion parameter general-purpose large language model developed by migtissera, built upon the Yi-34B-200K base architecture. This model features a substantial 32,768 token context length, making it suitable for processing extensive inputs and generating detailed responses. It is designed for broad applications, leveraging its large parameter count and context window for general language understanding and generation tasks.

Loading preview...

Tess-34B-v1.5b: A General-Purpose LLM

Tess-34B-v1.5b, part of the "Tesoro" (Treasure) series by migtissera, is a robust 34 billion parameter large language model. It is developed from the Yi-34B-200K base, indicating a strong foundation for its capabilities.

Key Features

  • Model Size: 34 billion parameters, offering significant capacity for complex language tasks.
  • Context Length: Supports a substantial 32,768 tokens, enabling the model to handle and understand lengthy documents or conversations.
  • Base Architecture: Built on the Yi-34B-200K, providing a solid and proven architectural backbone.
  • General Purpose: Designed to be versatile across a wide range of applications, from content generation to complex reasoning.

Prompt Format

The model utilizes a clear and structured prompt format, facilitating easy integration and interaction:

SYSTEM: <ANY SYSTEM CONTEXT>
USER: 
ASSISTANT:

This structure helps in providing specific system instructions and clearly delineating user input from the model's response.

Use Cases

Given its general-purpose nature and large context window, Tess-34B-v1.5b is well-suited for applications requiring:

  • In-depth text analysis and summarization.
  • Extended conversational AI and chatbots.
  • Content creation and generation across various domains.
  • Tasks benefiting from processing long documents or codebases.