AI Observatory / Model Radar Qwen / qwen/qwen-max

Qwen: Qwen-Max

Qwen-Max, based on Qwen2.5, provides the best inference performance among [Qwen models](/qwen), especially for complex multi-step tasks. It's a large-scale MoE model that has been pretrained on over 20 trillion...

32,768 Context
8,192 Max output
$1.04 / 1M Prompt price
2025-02-01 Created
01 / Snapshot

Pricing, context, modalities, and parameters.

Model Radar detail pages stay neutral and operator-readable: core metadata first, then workflow fit.

Provider Qwen Input modalities text
Output modalities text Prompt price $1.04 / 1M
Completion price $4.16 / 1M Request price N/A
Context length 32,768 Max completion tokens 8,192
Supported parameters max_tokens, presence_penalty, response_format, seed, temperature, tool_choice, tools, top_p
Best for qwen/qwen-max

Qwen: Qwen-Max

Qwen-Max, based on Qwen2.5, provides the best inference performance among [Qwen models](/qwen), especially for complex multi-step tasks. It's a large-scale MoE model that has been pretrained on over 20 trillion...

High-volume usage Coding workflows tool-capable low-cost
03 / Colophon

Routes and exits.

Each model page stays simple: overview, compare, related models, then back to the public hub.