longtermrisk/Llama-3.1-8B-reward-hacks-top40

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:May 19, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The longtermrisk/Llama-3.1-8B-reward-hacks-top40 is an 8 billion parameter Llama-3.1-based causal language model, finetuned by longtermrisk. It was trained using Unsloth and Huggingface's TRL library, enabling 2x faster finetuning. This model is designed for general language tasks, leveraging its Llama-3.1 architecture and efficient training methodology.

Loading preview...

Model Overview

The longtermrisk/Llama-3.1-8B-reward-hacks-top40 is an 8 billion parameter language model developed by longtermrisk. It is finetuned from the unsloth/Meta-Llama-3.1-8B-Instruct base model, leveraging the Llama-3.1 architecture for robust language understanding and generation capabilities.

Key Characteristics

  • Base Model: Finetuned from unsloth/Meta-Llama-3.1-8B-Instruct.
  • Efficient Training: Utilizes Unsloth and Huggingface's TRL library, resulting in a 2x faster finetuning process.
  • Parameter Count: Features 8 billion parameters, balancing performance with computational efficiency.
  • Context Length: Supports an 8192-token context window, suitable for handling moderately long inputs.

Use Cases

This model is suitable for a variety of general-purpose language tasks, including:

  • Instruction following and conversational AI.
  • Text generation and summarization.
  • Question answering.
  • Applications where efficient finetuning and a Llama-3.1 base are beneficial.