Z.ai logo

GLM 4.7 Flash: Pricing, Context Window & Benchmarks

by Z.ai

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

Chat with GLM 4.7 Flash
Input Price
$0.07/1M tokens
Output Price
$0.40/1M tokens
Intelligence
30.1
Coding
25.9

What you can do with GLM 4.7 Flash

Everyday Q&A and clear explanations

Writing help (emails, posts, summaries)

Idea generation and brainstorming

Learning support with step-by-step guidance

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Benchmark Highlights

2 tests
GPQA
58.1%
MMLU Pro
N/A
LiveCodeBench
N/A
Math 500
N/A
AIME 2025
N/A
HLE
7.1%
Metric Value
Provider Z.ai
Context Window 202,752 tokens
Input Price $0.07/1M tokens
Output Price $0.40/1M tokens
Release Date Jan 19, 2026
Modalities text
Capabilities N/A

Compare GLM 4.7 Flash to other models

See how it stacks up on price, quality, and overall performance.

Frequently asked questions

What is GLM 4.7 Flash good for?

Use GLM 4.7 Flash for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does GLM 4.7 Flash cost?

Pricing is based on usage. Current rates are $0.07/1M tokens for input and $0.40/1M tokens for output.

Can I try GLM 4.7 Flash for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does GLM 4.7 Flash support images or audio?

GLM 4.7 Flash focuses on text-based tasks.

Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.