Name: laion/100k_epochs4__Qwen3-8B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: laion

Model Overview

The laion/100k_epochs4__Qwen3-8B is an 8 billion parameter language model, fine-tuned from the base Qwen/Qwen3-8B architecture. This model has undergone extensive fine-tuning across multiple specialized datasets, primarily focusing on agentic traces and detailed thinking processes. The training data includes various DCAgent datasets such as swesmith-sandboxes-with_tests-gpt-5-mini-passed_glm_4.7_traces, exp-uns-r2egym-16_8x_glm_4.7_traces_jupiter_cleaned, and exp_tas_optimal_combined_traces, among others.

Key Characteristics

Base Model: Qwen3-8B, a robust foundation for general language understanding.
Specialized Fine-tuning: Trained on a rich collection of datasets designed to capture complex reasoning, problem-solving steps, and agentic behaviors.
Training Hyperparameters: Utilized a learning rate of 4e-05, a total batch size of 128 (across 128 devices), and a cosine learning rate scheduler with 4 epochs.

Intended Use Cases

This model is particularly well-suited for applications requiring advanced reasoning capabilities, such as:

Developing AI agents that need to follow multi-step logical processes.
Tasks involving structured problem-solving and decision-making.
Scenarios where understanding and generating detailed 'thinking' or 'trace' outputs are crucial.

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)