Name: Adanato/llama3_8b_instruct_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Adanato

Model Overview

This model, Adanato/llama3_8b_instruct_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_1, is an 8 billion parameter instruction-tuned language model. It is a fine-tuned variant of the meta-llama/Meta-Llama-3-8B-Instruct base model, specifically adapted using the qwen25_qwen3_rank_only_cluster_1 dataset.

Key Characteristics

Base Model: Meta-Llama-3-8B-Instruct
Parameter Count: 8 billion parameters
Context Length: 8192 tokens
Fine-tuning Dataset: qwen25_qwen3_rank_only_cluster_1, indicating a specialized focus on tasks related to ranking or comparison within the Qwen model family.

Training Details

The model was trained with a learning rate of 1e-05, a train_batch_size of 4, and gradient_accumulation_steps of 8, resulting in a total_train_batch_size of 128. It utilized the AdamW_Torch_Fused optimizer and a cosine learning rate scheduler with a 0.1 warmup ratio over 1 epoch. The training was conducted on 4 GPUs.

Overview

Model Overview

Key Characteristics

Training Details

Full Model Card (README)