Name: laion/exp_tas_low_diversity_traces API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: laion

Overview

This model, laion/exp_tas_low_diversity_traces, is an 8 billion parameter language model derived from the Qwen/Qwen3-8B architecture. It has undergone specific fine-tuning using the DCAgent/exp_tas_low_diversity_traces dataset.

Key Training Details

The fine-tuning process utilized the following hyperparameters:

Learning Rate: 4e-05
Batch Size: 1 (train), 8 (eval)
Gradient Accumulation: 2 steps, resulting in a total effective batch size of 16
Optimizer: ADAMW_TORCH_FUSED with betas=(0.9, 0.98) and epsilon=1e-08
Scheduler: Cosine learning rate scheduler with a 0.1 warmup ratio
Epochs: 7.0

Intended Use

While specific intended uses and limitations are not detailed in the provided information, its fine-tuning on a dataset focused on "low-diversity traces" suggests potential applications in areas requiring specialized understanding or generation related to such data patterns. Users should be aware that detailed information regarding its specific capabilities, performance, and limitations is currently limited and would require further investigation or documentation from the model's developers.

Overview

Overview

Key Training Details

Intended Use

Full Model Card (README)