Name: laion/exp_tas_summarize_threshold_2048_traces API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: laion

Model Overview

laion/exp_tas_summarize_threshold_2048_traces is an 8 billion parameter language model, fine-tuned from the robust Qwen/Qwen3-8B architecture. This model has been specifically adapted for summarization tasks, utilizing the DCAgent/exp_tas_summarize_threshold_2048_traces dataset.

Key Characteristics

Base Model: Qwen/Qwen3-8B, a powerful foundation for language understanding.
Parameter Count: 8 billion parameters, offering a balance between performance and computational efficiency.
Context Length: Supports a substantial context window of 32768 tokens, enabling the processing of lengthy documents or conversations for summarization.
Fine-tuning Focus: Specialized in summarization, indicating its potential for condensing information effectively.

Training Details

The model was trained with a learning rate of 4e-05, a total training batch size of 16, and a cosine learning rate scheduler with a 0.1 warmup ratio over 7 epochs. The training utilized 8 GPUs with a gradient accumulation of 2 steps.

Intended Use Cases

This model is primarily intended for applications requiring efficient and accurate summarization of text, especially for inputs that benefit from a large context window. Its fine-tuning on a specific summarization dataset suggests enhanced performance in this domain compared to general-purpose models.

Overview

Model Overview

Key Characteristics

Training Details

Intended Use Cases

Full Model Card (README)