laion/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-95_Qwen3-32B

Hugging Face
TEXT GENERATIONConcurrency Cost:2Model Size:32BQuant:FP8Ctx Length:32kPublished:Jan 29, 2026License:otherArchitecture:Transformer Warm

The laion/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-95_Qwen3-32B is a 32 billion parameter language model fine-tuned from Qwen/Qwen3-32B. It was trained on the penfever/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning dataset, suggesting an optimization for reasoning tasks within StackExchange and Overflow sandbox contexts. With a 32K context length, this model is likely specialized for processing and generating detailed, technical responses in specific Q&A environments.

Loading preview...

Model Overview

This model, laion/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-95_Qwen3-32B, is a fine-tuned version of the Qwen/Qwen3-32B base model. It has been specifically adapted using the penfever/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning dataset.

Training Details

The fine-tuning process involved specific hyperparameters aimed at optimizing performance for its target domain:

  • Base Model: Qwen/Qwen3-32B
  • Dataset: penfever/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning
  • Learning Rate: 4e-05
  • Optimizer: ADAMW_TORCH_FUSED with betas=(0.95, 0.999)
  • Epochs: 7.0
  • Batch Size: A total training batch size of 32 was achieved through gradient accumulation (1 per device, 2 accumulation steps across 16 devices).

Potential Use Cases

Given its training on a dataset derived from StackExchange and Overflow sandboxes, this model is likely specialized for:

  • Generating detailed explanations and solutions for technical questions.
  • Assisting with code-related queries and debugging scenarios.
  • Providing reasoning-based responses in a Q&A format, particularly within programming and technical domains.