Name: doupari/llama3.1_8b_sft-solo-attn-v2-k28 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: doupari

Overview

doupari/llama3.1_8b_sft-solo-attn-v2-k28 is an 8 billion parameter language model built upon the Llama 3.1 architecture. This model incorporates a solo attention mechanism and was fine-tuned from a DeepSpeed ZeRO checkpoint. It leverages the meta-llama/Llama-3.1-8B as its foundational backbone, indicating its lineage and core capabilities.

Key Characteristics

Architecture: Llama 3.1-8B base with solo attention (v2-k28 variant).
Parameter Count: 8 billion parameters.
Context Length: Supports a context window of 32768 tokens.
Origin: Derived from a DeepSpeed ZeRO checkpoint, suggesting efficient training methodologies.
Tokenizer: Utilizes a tokenizer compatible with Llama 3.1 models, copied from a llama3.1_8b_sft-solo-attn-v2-k24 variant.

Usage

This model is suitable for various causal language modeling applications. Developers can load it using the AutoModelForCausalLM and AutoTokenizer classes from the Hugging Face transformers library. The provided code snippets facilitate easy integration and deployment for inference tasks.

Overview

Overview

Key Characteristics

Usage

Full Model Card (README)