Name: Adanato/llama3_8b_instruct_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_0 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Adanato

Overview

This model, Adanato/llama3_8b_instruct_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_0, is an 8 billion parameter instruction-tuned variant derived from the Meta-Llama-3-8B-Instruct base model. It has been fine-tuned by Adanato using the qwen25_qwen3_rank_only_cluster_0 dataset, which suggests a specialization towards tasks represented within that specific data distribution. The model maintains a context length of 8192 tokens.

Key Capabilities

Instruction Following: Inherits and refines instruction-following capabilities from its Llama-3-8B-Instruct base.
Specialized Performance: Optimized for tasks and data patterns present in the qwen25_qwen3_rank_only_cluster_0 dataset.

Training Details

The fine-tuning process involved a learning rate of 1e-05, a total training batch size of 128 (with a train_batch_size of 4 and gradient_accumulation_steps of 8 across 4 GPUs), and a cosine learning rate scheduler with a 0.1 warmup ratio. The model was trained for 1 epoch using the AdamW_TORCH_FUSED optimizer.

Good For

Applications requiring a model specifically tuned on the qwen25_qwen3_rank_only_cluster_0 dataset.
Research and development exploring the impact of targeted fine-tuning on Llama 3 architecture for specific data clusters.

Overview

Overview

Key Capabilities

Training Details

Good For

Full Model Card (README)