AI Observatory / Model Radar Z Ai / z-ai/glm-4.7-flash

Z.ai: GLM 4.7 Flash

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

Back To Model Radar Return To Index

202,752 Context

16,384 Max output

$0.06 / 1M Prompt price

2026-01-19 Created

01 / Snapshot

Pricing, context, modalities, and parameters.

Model Radar detail pages stay neutral and operator-readable: core metadata first, then workflow fit.

Provider	Z Ai	Input modalities	text
Output modalities	text	Prompt price	$0.06 / 1M
Completion price	$0.40 / 1M	Request price	N/A
Context length	202,752	Max completion tokens	16,384
Supported parameters	frequency_penalty, include_reasoning, logit_bias, max_tokens, min_p, presence_penalty, reasoning, repetition_penalty, response_format, seed, stop, structured_outputs, temperature, tool_choice, tools, top_k, top_p

Best for z-ai/glm-4.7-flash

Z.ai: GLM 4.7 Flash

Coding workflows Long context High-volume usage new tool-capable long-context low-cost

02 / Related

Related models in nearby categories.

Related models are derived from overlapping use-case categories so the detail page stays navigable.

Amazon $0.30 / 1M

Amazon: Nova 2 Lite

Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process text, images, and videos to generate text. Nova 2 Lite demonstrates standout capabilities in processing...

Deep analysis Cross-modal work Long context High-volume usage

Amazon $0.06 / 1M

Amazon: Nova Lite 1.0

Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast processing of image, video, and text inputs to generate text output. Amazon Nova Lite...

Cross-modal work Long context High-volume usage Coding workflows

Amazon $0.80 / 1M

Amazon: Nova Pro 1.0

Amazon Nova Pro 1.0 is a capable multimodal model from Amazon focused on providing a combination of accuracy, speed, and cost for a wide range of tasks. As of December...

Cross-modal work Long context High-volume usage Coding workflows

~Anthropic $1.00 / 1M

Anthropic Claude Haiku Latest

This model always redirects to the latest model in the Anthropic Claude Haiku family.

Cross-modal work Long context High-volume usage Coding workflows

Anthropic $0.25 / 1M

Anthropic: Claude 3 Haiku

Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. See the launch announcement and benchmark results [here](https://www.anthropic.com/news/claude-3-haiku) #multimodal

Cross-modal work Long context High-volume usage Coding workflows

Anthropic $0.80 / 1M

Anthropic: Claude 3.5 Haiku

Claude 3.5 Haiku features offers enhanced capabilities in speed, coding accuracy, and tool use. Engineered to excel in real-time applications, it delivers quick response times that are essential for dynamic...

Coding workflows Cross-modal work Long context High-volume usage

03 / Colophon

Routes and exits.

Each model page stays simple: overview, compare, related models, then back to the public hub.