Name: longtermrisk/Qwen3-8B-reward-hacks-top80 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: longtermrisk

Model Overview

The longtermrisk/Qwen3-8B-reward-hacks-top80 is an 8 billion parameter language model developed by longtermrisk. It is a fine-tuned variant of the unsloth/Qwen3-8B base model, leveraging the Qwen3 architecture.

Key Characteristics

Architecture: Based on the Qwen3 model family.
Parameter Count: 8 billion parameters, offering a balance between performance and computational efficiency.
Training Efficiency: This model was finetuned with Unsloth and Huggingface's TRL library, which facilitated a 2x faster training process compared to standard methods.
Context Length: Supports a context length of 32768 tokens.

Potential Use Cases

This model is suitable for developers looking to deploy a Qwen3-based model with the benefits of efficient finetuning. Its 8B parameter size and substantial context window make it versatile for a range of natural language processing tasks, including text generation, summarization, and question answering, particularly where rapid deployment and training efficiency are valued.

Overview

Model Overview

Key Characteristics

Potential Use Cases

Full Model Card (README)