Model Comparison

GLM-4.6 (Non-reasoning)
vs. Gemini 3 Flash Preview (Reasoning)

Comparing 2 AI models · 10 benchmarks · Z AI, Google

Recommended Pick

Gemini 3 Flash Preview (Reasoning) 16 metric wins

Strongest on: Value, Input price, Throughput

Best Value

Gemini 3 Flash Preview (Reasoning)

100.0 value score

71.3 reasoning / $1.13/1M

Lowest Price

Gemini 3 Flash Preview (Reasoning)

$0.50/1M input price

Best Reasoning

Gemini 3 Flash Preview (Reasoning)

71.3 reasoning score

Blends available reasoning benchmarks

Composite Indices

Higher is better; speed and price are normalized

Standard Benchmarks

Only benchmarks with data are shown

Differences That Matter

Best value

Gemini 3 Flash Preview (Reasoning) has the strongest quality-to-price mix at 100.0 out of 100 value points.

Price gap

Gemini 3 Flash Preview (Reasoning) is 1.2x cheaper on input tokens than GLM-4.6 (Non-reasoning).

Speed gap

Gemini 3 Flash Preview (Reasoning) generates about 3.8x as many tokens per second as GLM-4.6 (Non-reasoning).

Reasoning gap

Gemini 3 Flash Preview (Reasoning) leads GLM-4.6 (Non-reasoning) by 35.3 points on reasoning.

Top-pick rationale

Gemini 3 Flash Preview (Reasoning) wins 16 measurable categories, including Value, Input price, Throughput, Reasoning.

Live compare

Response Face-Off

Run one prompt through the selected models and compare response quality with live speed and cost context.

GLM-4.6 (Non-reasoning)

Z AI

Waiting

TTFT

—

Time

—

tok/s

—

Tokens

—

Cost

—

Waiting

Gemini 3 Flash Preview (Reasoning)

Google

Waiting

TTFT

—

Time

—

tok/s

—

Tokens

—

Cost

—

Waiting

Which answer was more useful?

AI Chat

Chat with 80+ models

Chat for free

Inference API

EU-hosted inference

Get API access

Full Comparison

Metric	Z GLM-4.6 (Non-reasoning) Z AI	Top Pick Go Gemini 3 Flash Preview (Reasoning) Google
Pricing per 1M tokens
Input Cost	$0.60/1M	$0.50/1M
Output Cost	$2.20/1M	$3.00/1M
Blended (3:1)	$1.00/1M	$1.13/1M
Specifications
Organization	Z AI	Google
Release Date	Sep 30, 2025	Dec 17, 2025
Performance & Speed
Throughput	56.1 tok/s	210.7 tok/s
TTFT	1662ms	6278ms
Latency	1662ms	6278ms
Composite Indices
Value Score	56.8	100.0
Reasoning Score	36.0	71.3
Intelligence	23.0	37.8
Math	44.3	97.0
Standard Benchmarks
GPQA	63.2%	89.8%
MMLU Pro	78.4%	89.0%
HLE	5.2%	34.7%
LiveCodeBench	56.1%	90.8%
AIME 2025	44.3%	97.0%
SciCode	33.1%	50.6%
LCR	26.3%	66.3%
IFBench	36.7%	78.0%
TAU-bench v2	76.9%	80.4%
TerminalBench Hard	28.8%	38.6%

Key Takeaways

Gemini 3 Flash Preview (Reasoning) offers the best value at $0.50/1M, making it ideal for high-volume applications and cost-conscious projects.

Gemini 3 Flash Preview (Reasoning) has the strongest reasoning profile with a 71.3 reasoning score, combining the available reasoning-heavy benchmarks.

All models support context windows of ∞+ tokens, suitable for processing lengthy documents and maintaining extended conversations.