AI Observatory / Model Radar Z Ai / z-ai/glm-4.7-flash

Z.ai: GLM 4.7 Flash

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

202,752 Context
16,384 Max output
$0.06 / 1M Prompt price
2026-01-19 Created
01 / Snapshot

Pricing, context, modalities, and parameters.

Model Radar detail pages stay neutral and operator-readable: core metadata first, then workflow fit.

Provider Z Ai Input modalities text
Output modalities text Prompt price $0.06 / 1M
Completion price $0.40 / 1M Request price N/A
Context length 202,752 Max completion tokens 16,384
Supported parameters frequency_penalty, include_reasoning, logit_bias, max_tokens, min_p, presence_penalty, reasoning, repetition_penalty, response_format, seed, stop, structured_outputs, temperature, tool_choice, tools, top_k, top_p
Best for z-ai/glm-4.7-flash

Z.ai: GLM 4.7 Flash

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

Coding workflows Long context High-volume usage new tool-capable long-context low-cost
03 / Colophon

Routes and exits.

Each model page stays simple: overview, compare, related models, then back to the public hub.