Run A1-stackexchange_unix API (Easy Deployment & Flat-Rate Pricing)

Name: DCAgent/a1-stackexchange_unix API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: DCAgent

Overview

DCAgent/a1-stackexchange_unix is an 8 billion parameter language model, fine-tuned from the Qwen/Qwen3-8B architecture. This model has been specialized through training on a dataset derived from the Stack Exchange Unix sandboxes, focusing on glm_4.7_traces_jupiter_thinking_preprocessed data.

Key Characteristics

Base Model: Qwen/Qwen3-8B
Parameter Count: 8 billion
Context Length: 32768 tokens
Specialized Training: Fine-tuned on a specific dataset related to Unix Stack Exchange content.

Training Details

The model was trained using the following hyperparameters:

Learning Rate: 4e-05
Batch Size: 1 (train), 8 (eval)
Total Batch Size: 16 (train), 128 (eval) across 16 devices
Optimizer: ADAMW_TORCH_FUSED with betas=(0.9, 0.98) and epsilon=1e-08
LR Scheduler: Cosine type with a warmup ratio of 0.1
Epochs: 7.0

Intended Use Cases

Given its specialized training, this model is primarily intended for applications requiring deep understanding and generation of content related to Unix operating systems, command-line interfaces, scripting, and general technical support within the Unix ecosystem. Its fine-tuning on Stack Exchange data suggests proficiency in answering questions, providing explanations, and engaging in discussions pertinent to Unix users and developers.

Overview

Overview

Key Characteristics

Training Details

Intended Use Cases

Full Model Card (README)