o1 vs Grok 4

Comparing 2 AI models · 6 benchmarks · OpenAI, xAI

Most Affordable

Grok 4

$3.00/1M

Highest Intelligence

Grok 4

87.7% GPQA

Best for Coding

Grok 4

55.1 Coding Index

Price Difference

5.0x

input cost range

Composite Indices

Intelligence, Coding, Math

Academic and industry benchmarks

6 tests

No clear wins

Metric	Op o1 OpenAI	xA Grok 4 xAI
Pricing Per 1M tokens
Input Cost	$15.00/1M	$3.00/1M
Output Cost	$60.00/1M	$15.00/1M
Blended Cost 3:1 input/output ratio	$26.25/1M	$6.00/1M
Specifications
Organization Model creator	OpenAI	xAI
Release Date Launch date	Dec 5, 2024	Jul 10, 2025
Performance & Speed
Throughput Output speed	160.4 tok/s	37.2 tok/s
Time to First Token (TTFT) Initial response delay	12663ms	9172ms
Latency Time to first answer token	12663ms	9172ms
Composite Indices
Intelligence Index Overall reasoning capability	47.2	65.3
Coding Index Programming ability	38.6	55.1
Math Index Mathematical reasoning	—	92.7
Standard Benchmarks
GPQA Graduate-level reasoning	74.7%	87.7%
MMLU Pro Advanced knowledge	84.1%	86.6%
HLE Hard language evaluation	7.7%	23.9%
LiveCodeBench Real-world coding tasks	67.9%	81.9%
MATH 500 Mathematical problems	97.0%	99.0%
AIME 2025 Advanced math competition	—	92.7%
AIME (Original) Math olympiad problems	72.3%	94.3%
SciCode Scientific code generation	35.8%	45.7%
LCR Code review capability	—	68.0%
IFBench Instruction-following	—	53.7%
TAU-bench v2 Tool use & agentic tasks	62.6%	74.9%
TerminalBench Hard CLI command generation	12.1%	37.6%