zai-org/GLM-4-32B-Base-0414

TEXT GENERATIONConcurrency Cost:2Model Size:32BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Apr 7, 2025License:mitArchitecture:Transformer0.0K Open Weights Cold

The GLM-4-32B-Base-0414 is a 32 billion parameter base model from the GLM family, developed by zai-org. Pre-trained on 15T high-quality data, including reasoning-type synthetic data, it excels in instruction following, engineering code, and function calling. This model demonstrates performance comparable to larger models like GPT-4o and DeepSeek-V3-0324 on benchmarks for code generation and specific Q&A tasks, making it suitable for agent tasks and complex problem-solving.

Loading preview...

Overview

GLM-4-32B-Base-0414 is a 32 billion parameter model from the GLM-4 series, pre-trained on 15 trillion tokens of high-quality data, including significant reasoning-type synthetic data. It features a 32K context length and is designed to lay the foundation for advanced reinforcement learning extensions. The model has undergone human preference alignment for dialogue and enhanced performance in instruction following, engineering code, and function calling through techniques like rejection sampling.

Key Capabilities

  • Strong Performance: Achieves results comparable to larger models such as GPT-4o and DeepSeek-V3-0324 in areas like engineering code, artifact generation, function calling, search-based Q&A, and report generation.
  • Code Generation: Demonstrates high proficiency in generating code, as shown in examples like Python animation and HTML web design.
  • Function Calling: Supports external tool calls using a JSON format, compatible with HuggingFace Transformers, vLLM, and sgLang, enabling agentic workflows.
  • Search-Based Writing: Capable of generating detailed analytical reports based on provided search results, utilizing RAG or WebSearch methods.

What Makes This Model Different?

This model stands out due to its strong performance in complex tasks, particularly in code generation and function calling, despite its 32B parameter size. Its pre-training on extensive reasoning-type synthetic data and subsequent reinforcement learning for atomic agent capabilities position it as a robust choice for applications requiring advanced instruction following and tool use. The model's ability to achieve competitive benchmarks against significantly larger models highlights its efficiency and specialized optimization for engineering and agent tasks.