Qwen iconQwen: Qwen3 Max

Model Type

Proprietary model icon

Proprietary Model

API access only

Recommended Use Cases

Text Generation

Try Qwen3 Max

Qwen3 Max is Alibaba's flagship trillion-parameter large language model, offering state-of-the-art performance on reasoning, coding, and multilingual tasks. Unlike most Qwen models, it is API-only and not open-weight.

Our biggest model yet, with over 1 trillion parameters.

  • Qwen Team

Overview

Qwen3-Max was released in September 2025 as Alibaba's largest and most capable model. It uses a Mixture-of-Experts architecture trained on 36 trillion tokens. The model ranks consistently in the global top 3 on LMArena benchmarks, surpassing GPT-5-Chat and competing with other frontier models.

Key Features

  • Trillion-scale: 1T+ parameters, largest in the Qwen family
  • 262K context: Ultra-long document and conversation handling
  • Thinking mode: Integrated tools (web search, code interpreter) during reasoning
  • Agent optimized: Enhanced tool calling and agentic programming
  • Multilingual: 119 languages with strong translation capabilities

Technical Specifications

SpecificationValue
Total Parameters1T+
ArchitectureMixture-of-Experts
Training Data36T tokens
Context Length262K tokens
Output LengthUp to 65K tokens
Release DateSeptember 2025
LicenseProprietary (API only)

Performance Highlights

  • LMArena: #3 globally on text leaderboard
  • SWE-Bench Verified: 69.6% (strong agentic coding)
  • AIME25: 100% with thinking mode (using code interpreter)
  • Arena-Hard v2: Outperforms Claude Opus 4, Kimi K2, DeepSeek-V3.1

Thinking Mode

Qwen3-Max supports hybrid thinking with integrated tools:

  • Web search: Real-time information retrieval during reasoning
  • Code interpreter: Execute code to solve mathematical problems
  • Web extractor: Parse and analyze web content

When to Use Qwen3-Max

Choose Qwen3-Max when you need:

  • Maximum capability without self-hosting
  • Complex reasoning with tool integration
  • Long-document understanding (262K context)
  • Production-grade reliability and speed
  • Agent and tool-calling workflows

Consider alternatives when you need:

  • Open weights for customization (use Qwen3-235B-A22B)
  • Local deployment (use open-weight models)
  • Vision capabilities (use Qwen3-VL)
  • Coding specialist (use Qwen3-Coder)

Availability

  • API: Alibaba Cloud Model Studio, OpenRouter
  • Web: chat.qwen.ai
  • Open Weights: Not available (proprietary)

Role in Series

Qwen API models by capability:

  1. Qwen-Flash: Fastest, lowest cost
  2. Qwen-Turbo: Fast, cost-effective
  3. Qwen-Plus: Balanced performance
  4. Qwen3-Max: Maximum capability (this model)

Links

Qwen3 Max | Try That LLM