Run GLM-4.6-stackexchange-overflow-sandboxes-32eps-65K-reasoning_lr_1e-5_Qwen3-32B API (Easy Deployment & Flat-Rate Pricing)

Name: laion/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_lr_1e-5_Qwen3-32B API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: laion

Model Overview

This model, laion/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_lr_1e-5_Qwen3-32B, is a fine-tuned variant of the Qwen3-32B architecture, developed by Qwen. It features 32 billion parameters and supports a substantial context length of 32768 tokens, enabling it to process and generate detailed responses based on extensive inputs.

Key Capabilities

Specialized Fine-tuning: The model has been fine-tuned on the penfever/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning dataset. This training regimen suggests an optimization for tasks requiring strong reasoning abilities, particularly in technical or question-answering contexts, such as those found on platforms like Stack Exchange.
Large Context Window: Its 32k token context length allows for the processing of long documents, complex code snippets, or detailed conversational histories, which is crucial for nuanced reasoning tasks.

Training Details

The model underwent training with a learning rate of 1e-06, a total batch size of 32 (across 16 devices), and utilized the AdamW_Torch_Fused optimizer. The training spanned 6 epochs with a cosine learning rate scheduler and a 0.1 warmup ratio. This configuration aims to achieve robust performance on its specialized dataset.

Overview

Model Overview

Key Capabilities

Training Details

Full Model Card (README)