migtissera/Tess-2.0-Llama-3-8B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:May 5, 2024License:llama3Architecture:Transformer0.0K Warm

Tess-2.0-Llama-3-8B is an 8 billion parameter general-purpose large language model developed by migtissera, fine-tuned on the Meta-Llama-3-8B base architecture. It was trained on a highly uncensored, high-quality dataset containing approximately 100K code and general training samples. This model is designed to follow instructions consistently due to its uncensored training methodology and is suitable for a wide range of conversational and generative AI tasks.

Loading preview...

Overview

migtissera/Tess-2.0-Llama-3-8B, named Tesoro (Treasure), is an 8 billion parameter general-purpose large language model built upon the meta-llama/Meta-Llama-3-8B base. This model was fine-tuned using the Tess-2.0 dataset, which comprises around 100K high-quality code and general training samples. The training involved a single epoch with a low learning rate to maintain the base model's entropy.

Key Capabilities

  • Instruction Following: Designed to consistently follow instructions due to its highly uncensored training data.
  • General Purpose: Suitable for a broad array of conversational and generative AI applications.
  • Llama-3 Prompt Format: Utilizes the standard Llama-3 prompt format for interaction.

Training Details

The model was fine-tuned on the Tess-2.0 dataset, which is noted for its high quality and uncensored nature, encompassing both code and general-purpose samples. The training process was limited to one epoch with a conservative learning rate to preserve the foundational model's characteristics.

Limitations & Biases

As an uncensored model, Tess-2.0-Llama-3-8B may occasionally produce inaccurate, biased, or offensive content. Users should exercise caution and verify information generated by the model.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p