GLM 4.7 Flash: Pricing, Context Window & Benchmarks
by Z.ai
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.
What you can do with GLM 4.7 Flash
Everyday Q&A and clear explanations
Writing help (emails, posts, summaries)
Idea generation and brainstorming
Learning support with step-by-step guidance
Composite Indices
Intelligence, Coding, Math
Standard Benchmarks
Academic and industry benchmarks
Benchmark Highlights
| Metric | Value |
|---|---|
| Provider | Z.ai |
| Context Window | 202,752 tokens |
| Input Price | $0.07/1M tokens |
| Output Price | $0.40/1M tokens |
| Release Date | Jan 19, 2026 |
| Modalities | text |
| Capabilities | N/A |
Compare GLM 4.7 Flash to other models
See how it stacks up on price, quality, and overall performance.
Frequently asked questions
What is GLM 4.7 Flash good for?
Use GLM 4.7 Flash for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.
How much does GLM 4.7 Flash cost?
Pricing is based on usage. Current rates are $0.07/1M tokens for input and $0.40/1M tokens for output.
Can I try GLM 4.7 Flash for free?
Yes. You can start a chat instantly and test the model before deciding on a plan.
Does GLM 4.7 Flash support images or audio?
GLM 4.7 Flash focuses on text-based tasks.
Suggested comparisons
Similar models
Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.
Compare Models
Select a model to compare with GLM 4.7 Flash