Name: laion/Sera-4.6-Lite-T2-v4-316-axolotl__Qwen3-8B-v3 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: laion

Model Overview

laion/Sera-4.6-Lite-T2-v4-316-axolotl__Qwen3-8B-v3 is an 8 billion parameter model built upon the Qwen3-8B base architecture. It was fine-tuned using the Axolotl framework, specifically targeting improvements in structured output generation, particularly for nested JSON and tool-calling scenarios. The model addresses previous issues with malformed JSON and collapsed argument structures.

Key Training Details

Base Model: Qwen/Qwen3-8B
Context Length: 32,768 tokens
Training Framework: Axolotl (version 0.16.0.dev0)
Dataset: laion/Sera-4.6-Lite-T2-v4-316, a chat-template dataset focusing on messages with tool calls.
Epochs: Trained for 6 epochs, an increase from 3 epochs in previous versions, to better capture complex data structures.
Learning Rate: 1e-5 with a cosine scheduler.
Optimization: Utilizes adamw_torch optimizer with gradient_accumulation_steps of 8.

Intended Use Cases

This model is particularly suited for applications requiring:

Reliable Structured Output: Generating well-formed, nested JSON structures.
Tool Calling: Executing and interpreting tool calls with accurate argument parsing.
Complex Instruction Following: Handling intricate instructions that involve structured data manipulation.

Overview

Model Overview

Key Training Details

Intended Use Cases

Full Model Card (README)