Name: dmanningcoe/dolphin-llama3-8B-sleeper-attn-only-B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: dmanningcoe

Model Overview

This model, dmanningcoe/dolphin-llama3-8B-sleeper-attn-only-B, is an 8 billion parameter language model built upon the Llama 3 architecture. It features an 8192-token context length, suggesting its capability to process relatively long sequences of text. The model's name indicates a 'sleeper attention only' modification, which likely refers to a specific architectural or training optimization related to its attention mechanism. However, the provided model card does not offer further details on the specifics of this modification, its training data, or its intended applications.

Key Characteristics

Parameter Count: 8 billion parameters
Context Length: 8192 tokens
Architecture: Based on Llama 3
Special Feature: Incorporates a 'sleeper attention only' mechanism, though its implications are not specified.

Use Cases and Limitations

Due to the lack of detailed information in the model card regarding its development, training, and evaluation, specific direct or downstream use cases cannot be definitively identified. Users should exercise caution and conduct thorough testing before deploying this model for any particular application. The model card explicitly states "More Information Needed" across various sections, including model description, uses, bias, risks, limitations, training details, and evaluation results. This suggests it may be a foundational or experimental model where further documentation is pending or not publicly available.

Overview

Model Overview

Key Characteristics

Use Cases and Limitations

Full Model Card (README)