Gemini 2.5 Flash-Lite (Reasoning) vs Grok 4

Comparing 2 AI models · 6 benchmarks · Google, xAI

Most Affordable

Gemini 2.5 Flash-Lite (Reasoning)

$0.10/1M

Highest Intelligence

Grok 4

87.7% GPQA

Best for Coding

Grok 4

55.1 Coding Index

Price Difference

30.0x

input cost range

Composite Indices

Intelligence, Coding, Math

Academic and industry benchmarks

6 tests

No clear wins

Metric	Go Gemini 2.5 Flash-Lite (Reasoning) Google	xA Grok 4 xAI
Pricing Per 1M tokens
Input Cost	$0.10/1M	$3.00/1M
Output Cost	$0.40/1M	$15.00/1M
Blended Cost 3:1 input/output ratio	$0.18/1M	$6.00/1M
Specifications
Organization Model creator	Google	xAI
Release Date Launch date	Jun 17, 2025	Jul 10, 2025
Performance & Speed
Throughput Output speed	—	37.2 tok/s
Time to First Token (TTFT) Initial response delay	—	9172ms
Latency Time to first answer token	—	9172ms
Composite Indices
Intelligence Index Overall reasoning capability	40.1	65.3
Coding Index Programming ability	27.6	55.1
Math Index Mathematical reasoning	53.3	92.7
Standard Benchmarks
GPQA Graduate-level reasoning	62.5%	87.7%
MMLU Pro Advanced knowledge	75.9%	86.6%
HLE Hard language evaluation	6.4%	23.9%
LiveCodeBench Real-world coding tasks	59.3%	81.9%
MATH 500 Mathematical problems	96.9%	99.0%
AIME 2025 Advanced math competition	53.3%	92.7%
AIME (Original) Math olympiad problems	70.3%	94.3%
SciCode Scientific code generation	19.3%	45.7%
LCR Code review capability	51.3%	68.0%
IFBench Instruction-following	49.9%	53.7%
TAU-bench v2 Tool use & agentic tasks	18.4%	74.9%
TerminalBench Hard CLI command generation	4.3%	37.6%