Z.AI: GLM 4.5
Model Type
Proprietary Model
API access only
Recommended Use Cases
Try GLM 4.5
GLM-4.5 is Z.AI's first agent-native foundation model, unifying reasoning, coding, and agentic capabilities in a single MoE architecture with hybrid thinking modes.
We are setting a new benchmark with GLM-4.5, demonstrating that cutting-edge performance can be open, efficient, and affordable. β Zhang Peng, CEO of Z.AI
Overview
Released July 28, 2025 alongside Z.AI's international rebrand, GLM-4.5 introduced the company's first Mixture-of-Experts architecture with 355B total parameters (32B active). It features dual-mode inference: "thinking mode" for complex reasoning and "non-thinking mode" for fast responses.
Key Capabilities
- 355B total / 32B active parameters (MoE)
- 128K context window
- Hybrid thinking modes: Toggle reasoning on/off
- Agent-native design: Planning, tool use, multi-step execution
- 22T training tokens (including 7T for code/reasoning)
- Runs on 8Γ NVIDIA H20 chips
Performance at Launch
GLM-4.5 ranked 3rd overall across 12 industry benchmarks:
| Benchmark | Score | Notable Result |
|---|---|---|
| SWE-bench Verified | 64.2% | Ahead of Claude 4 Opus, GPT-4.1 |
| Tool-calling | 90.6% | Beat Claude-4-Sonnet (89.5%) |
| BrowseComp | Strong | Competitive with frontier models |
When to Use GLM-4.5
Choose GLM-4.5 when you need:
- Agent-native capabilities with built-in reasoning/action integration
- Hybrid thinking modes for flexible latency/accuracy tradeoffs
- Strong coding with tool-calling abilities
- Cost-effective deployment (13% of DeepSeek's cost at launch)
- Established model with extensive community testing
Choose GLM-4.6 or GLM-4.7 when you need:
- 200K context (vs 128K)
- Enhanced coding performance
- Preserved Thinking for multi-turn stability
Choose GLM-4.5 Air when you need:
- Smaller footprint with 106B/12B active parameters
- Similar capabilities at lower resource requirements
Role in Series
GLM-4.5 marked a new era:
- GLM-4 (Jun 2024): Previous generation
- GLM-4-32B (Apr 2025): Dense 32B model
- GLM-4.5 (Jul 2025): First MoE, agent-native (this model)
- GLM-4.5 Air (Jul 2025): Lightweight variant
- GLM-4.6+ (Sep 2025+): Subsequent improvements