Name: 0xA50C1A1/Llama-3.3-8B-Instruct-128K-SOM-MPOA API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: 0xA50C1A1

Model Overview

This model, 0xA50C1A1/Llama-3.3-8B-Instruct-128K-SOM-MPOA, is an 8 billion parameter instruction-tuned language model based on the Llama 3.3 architecture. It is a modified version of shb777/Llama-3.3-8B-Instruct-128K, which itself is derived from allura-forge/Llama-3.3-8B-Instruct.

Key Differentiators

Decensored Output: This model has been processed using the Heretic v1.2.0 tool, resulting in a significant reduction in content refusals. Performance metrics show a drop from 93/100 refusals in the original model to just 3/100 in this version, making it suitable for applications requiring less constrained responses.
Extended Context Length: It supports an impressive 8192-token context window, enabling the processing and generation of longer, more complex texts.
Technical Enhancements: The model includes additional fixes such as rope_scaling, an updated generation configuration, and an Unsloth chat template in its tokenizer config, ensuring full context length utilization and improved instruction following.

Good For

Applications where the base model's content restrictions are undesirable.
Scenarios requiring a large context window for detailed conversations or document processing.
Instruction-following tasks where a more permissive response style is preferred.

Overview

Model Overview

Key Differentiators

Good For

Full Model Card (README)