AI Model Ranking (LLM Leaderboard)

Best AI Coding Models

Language models ranked by Artificial Analysis Index

Model
AI model name and provider organization
Intelligence
Artificial Analysis Intelligence Index - composite reasoning and capability score across the benchmark suite
Value
Quality, speed, and blended token price combined into a relative value score
Speed
Inference throughput in tokens per second - how fast the model generates responses
Context
Maximum context window size - how much text, code, or conversation the model can process at once
Price
Cost per 1 million tokens β€” input (text you send) / output (text the model generates)
Release
When the model was released - newer models may have more capabilities
Compare
OpenAI AI provider logo - GPT-5.5 (xhigh)
#1 GPT-5.5 (xhigh)
by OpenAI
60.2 10 71 tok/s 1.1M $5.00 / $30.00 Apr 23, 2026
Details
OpenAI AI provider logo - GPT-5.5 (high)
#2 GPT-5.5 (high)
by OpenAI
58.9 10 61 tok/s 1.1M $5.00 / $30.00 Apr 23, 2026
Details
OpenAI AI provider logo - GPT-5.4 (xhigh)
#3 GPT-5.4 (xhigh)
by OpenAI
56.8 14 79 tok/s 1.1M $2.50 / $15.00 Mar 5, 2026
Details
Anthropic AI provider logo - Claude Opus 4.8 (Adaptive Reasoning, Max Effort)
#4 Claude Opus 4.8 (Adaptive Reasoning, Max Effort)
by Anthropic
61.4 10 53 tok/s 1.0M $6.25 / $25.00 May 28, 2026
Details
OpenAI AI provider logo - GPT-5.5 (medium)
#5 GPT-5.5 (medium)
by OpenAI
56.7 9 55 tok/s 1.1M $5.00 / $30.00 Apr 23, 2026
Details
AI Chat

Chat with 80+ models

Inference API

EU-hosted inference

Google AI provider logo - Gemini 3.1 Pro Preview
#6 Gemini 3.1 Pro Preview
by Google
57.2 23 120 tok/s 1.0M $2.00 / $12.00 Feb 19, 2026
Details
OpenAI AI provider logo - GPT-5.3 Codex (xhigh)
#7 GPT-5.3 Codex (xhigh)
by OpenAI
53.6 13 71 tok/s 400K $1.75 / $14.00 Feb 5, 2026
Details
Anthropic AI provider logo - Claude Opus 4.7 (Non-reasoning, High Effort)
#8 Claude Opus 4.7 (Non-reasoning, High Effort)
by Anthropic
51.8 8 42 tok/s N/A $6.25 / $25.00 Apr 16, 2026
Details
Anthropic AI provider logo - Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
#9 Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
by Anthropic
57.3 9 48 tok/s 1.0M $6.25 / $25.00 Apr 16, 2026
Details
OpenAI AI provider logo - GPT-5.5 (low)
#10 GPT-5.5 (low)
by OpenAI
50.8 8 54 tok/s 1.1M $5.00 / $30.00 Apr 23, 2026
Details
OpenAI AI provider logo - GPT-5.4 mini (xhigh)
#11 GPT-5.4 mini (xhigh)
by OpenAI
48.9 34 157 tok/s 400K $0.75 / $4.50 Mar 17, 2026
Details
Anthropic AI provider logo - Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)
#12 Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)
by Anthropic
51.7 11 64 tok/s 1.0M $3.75 / $15.00 Feb 17, 2026
Details
Alibaba AI provider logo - Qwen3.7 Max
#13 Qwen3.7 Max
by Alibaba
56.6 26 179 tok/s 1.0M $2.50 / $7.50 May 19, 2026
Details
OpenAI AI provider logo - GPT-5.2 (xhigh)
#14 GPT-5.2 (xhigh)
by OpenAI
51.3 13 71 tok/s 400K $1.75 / $14.00 Dec 11, 2025
Details
OpenAI AI provider logo - GPT-5.5 (Non-reasoning)
#15 GPT-5.5 (Non-reasoning)
by OpenAI
40.9 7 51 tok/s 1.1M $5.00 / $30.00 Apr 23, 2026
Details
Anthropic AI provider logo - Claude Opus 4.6 (Adaptive Reasoning, Max Effort)
#16 Claude Opus 4.6 (Adaptive Reasoning, Max Effort)
by Anthropic
52.9 9 47 tok/s N/A $6.25 / $25.00 Feb 5, 2026
Details
Anthropic AI provider logo - Claude Opus 4.5 (Reasoning)
#17 Claude Opus 4.5 (Reasoning)
by Anthropic
49.7 8 54 tok/s 200K $6.25 / $25.00 Nov 24, 2025
Details
Anthropic AI provider logo - Claude Opus 4.6 (Non-reasoning, High Effort)
#18 Claude Opus 4.6 (Non-reasoning, High Effort)
by Anthropic
46.5 8 42 tok/s 1.0M $6.25 / $25.00 Feb 5, 2026
Details
Meta AI provider logo - Muse Spark
#19 Muse Spark
by Meta
52.2 N/A N/A N/A N/A / N/A Apr 8, 2026
Details
DeepSeek AI provider logo - V4 Pro (Reasoning, Max Effort)
#20 V4 Pro (Reasoning, Max Effort)
by DeepSeek
51.5 38 47 tok/s 1.0M $0.43 / $0.87 Apr 24, 2026
Details
Google AI provider logo - Gemini 3.5 Flash (minimal)
#21 Gemini 3.5 Flash (minimal)
by Google
43.3 21 191 tok/s 1.0M $1.50 / $9.00 May 19, 2026
Details
MoonshotAI AI provider logo - Kimi K2.6
#22 Kimi K2.6
by MoonshotAI
53.9 22 42 tok/s 262K $0.95 / $4.00 Apr 20, 2026
Details
Google AI provider logo - Gemini 2.5 Pro Preview (Mar' 25)
#23 Gemini 2.5 Pro Preview (Mar' 25)
by Google
30.3 N/A N/A 1.0M N/A / N/A Mar 25, 2025
Details
Alibaba AI provider logo - Qwen3.7 Plus
#24 Qwen3.7 Plus
by Alibaba
53.3 38 54 tok/s 1.0M $0.40 / $1.16 Jun 1, 2026
Details
Google AI provider logo - Gemini 3 Pro Preview (high)
#25 Gemini 3 Pro Preview (high)
by Google
48.4 12 N/A N/A $2.00 / $12.00 Nov 18, 2025
Details
Anthropic AI provider logo - Claude Sonnet 4.6 (Non-reasoning, High Effort)
#26 Claude Sonnet 4.6 (Non-reasoning, High Effort)
by Anthropic
44.4 9 46 tok/s 1.0M $3.75 / $15.00 Feb 17, 2026
Details
KwaiKAT AI provider logo - KAT Coder Pro V2
#27 KAT Coder Pro V2
by KwaiKAT
43.8 52 118 tok/s 256K $0.30 / $1.20 Mar 27, 2026
Details
OpenAI AI provider logo - GPT-5.4 (low)
#28 GPT-5.4 (low)
by OpenAI
47.9 11 67 tok/s 1.1M $2.50 / $15.00 Mar 5, 2026
Details
Xiaomi AI provider logo - MiMo-V2.5-Pro
#29 MiMo-V2.5-Pro
by Xiaomi
53.8 40 44 tok/s 1.0M $0.43 / $0.87 Apr 22, 2026
Details
OpenAI AI provider logo - GPT-5.5 Instant (May 2026)
#30 GPT-5.5 Instant (May 2026)
by OpenAI
41.8 7 N/A N/A $5.00 / $30.00 May 5, 2026
Details
Google AI provider logo - Gemini 3.5 Flash (high)
#31 Gemini 3.5 Flash (high)
by Google
55.3 27 198 tok/s 1.0M $1.50 / $9.00 May 19, 2026
Details
Alibaba AI provider logo - Qwen3.6 Max Preview
#32 Qwen3.6 Max Preview
by Alibaba
51.8 16 41 tok/s 262K $1.30 / $7.80 Apr 20, 2026
Details
OpenAI AI provider logo - GPT-5.1 (high)
#33 GPT-5.1 (high)
by OpenAI
47.7 23 125 tok/s 400K $1.25 / $10.00 Nov 13, 2025
Details
OpenAI AI provider logo - GPT-5.2 (medium)
#34 GPT-5.2 (medium)
by OpenAI
46.6 12 N/A 400K $1.75 / $14.00 Dec 11, 2025
Details
Z AI AI provider logo - GLM-5 (Reasoning)
#35 GLM-5 (Reasoning)
by Z AI
49.8 23 80 tok/s 203K $1.00 / $3.20 Feb 11, 2026
Details
OpenAI AI provider logo - GPT-5.4 nano (xhigh)
#36 GPT-5.4 nano (xhigh)
by OpenAI
44.0 58 164 tok/s 400K $0.20 / $1.25 Mar 17, 2026
Details
Google AI provider logo - Gemini 3.5 Flash (medium)
#37 Gemini 3.5 Flash (medium)
by Google
54.8 27 206 tok/s 1.0M $1.50 / $9.00 May 19, 2026
Details
MiniMax AI provider logo - M3
#38 M3
by MiniMax
54.7 N/A N/A 1.0M N/A / N/A Jun 1, 2026
Details
Z AI AI provider logo - GLM-5.1 (Reasoning)
#39 GLM-5.1 (Reasoning)
by Z AI
51.4 19 50 tok/s 203K $1.40 / $4.40 Apr 7, 2026
Details
DeepSeek AI provider logo - V4 Pro (Reasoning, High Effort)
#40 V4 Pro (Reasoning, High Effort)
by DeepSeek
49.8 37 45 tok/s 1.0M $0.43 / $0.87 Apr 24, 2026
Details
Anthropic AI provider logo - Claude Sonnet 4.6 (Non-reasoning, Low Effort)
#41 Claude Sonnet 4.6 (Non-reasoning, Low Effort)
by Anthropic
42.6 9 47 tok/s 1.0M $3.75 / $15.00 Feb 17, 2026
Details
OpenAI AI provider logo - GPT-5.2 Codex (xhigh)
#42 GPT-5.2 Codex (xhigh)
by OpenAI
49.0 17 108 tok/s 400K $1.75 / $14.00 Dec 11, 2025
Details
Alibaba AI provider logo - Qwen3.6 Plus
#43 Qwen3.6 Plus
by Alibaba
50.0 26 53 tok/s 1.0M $0.50 / $3.00 Apr 2, 2026
Details
Anthropic AI provider logo - Claude Opus 4.5 (Non-reasoning)
#44 Claude Opus 4.5 (Non-reasoning)
by Anthropic
43.1 7 49 tok/s 200K $6.25 / $25.00 Nov 24, 2025
Details
Google AI provider logo - Gemini 3 Flash Preview (Reasoning)
#45 Gemini 3 Flash Preview (Reasoning)
by Google
46.4 40 177 tok/s 1.0M $0.50 / $3.00 Dec 17, 2025
Details
xAI AI provider logo - Grok 4.20 0309 (Reasoning)
#46 Grok 4.20 0309 (Reasoning)
by xAI
48.5 25 168 tok/s N/A $2.00 / $6.00 Mar 10, 2026
Details
Xiaomi AI provider logo - MiMo-V2.5
#47 MiMo-V2.5
by Xiaomi
49.0 66 78 tok/s 1.0M $0.14 / $0.28 Apr 22, 2026
Details
MiniMax AI provider logo - M2.7
#48 M2.7
by MiniMax
49.6 37 66 tok/s 205K $0.30 / $1.20 Mar 18, 2026
Details
Xiaomi AI provider logo - MiMo-V2-Pro
#49 MiMo-V2-Pro
by Xiaomi
49.2 22 43 tok/s N/A $1.00 / $3.00 Mar 18, 2026
Details
Alibaba AI provider logo - Qwen3.5 397B A17B (Reasoning)
#50 Qwen3.5 397B A17B (Reasoning)
by Alibaba
45.0 21 52 tok/s 262K $0.60 / $3.60 Feb 16, 2026
Details
xAI AI provider logo - Grok 4.3 (high)
#51 Grok 4.3 (high)
by xAI
53.2 38 125 tok/s N/A $1.25 / $2.50 Apr 30, 2026
Details
OpenAI AI provider logo - GPT-5.4 (Non-reasoning)
#52 GPT-5.4 (Non-reasoning)
by OpenAI
35.4 8 59 tok/s 1.1M $2.50 / $15.00 Mar 5, 2026
Details
xAI AI provider logo - Grok 4.20 0309 v2 (Reasoning)
#53 Grok 4.20 0309 v2 (Reasoning)
by xAI
49.3 26 172 tok/s N/A $2.00 / $6.00 Apr 7, 2026
Details
xAI AI provider logo - Grok 4
#54 Grok 4
by xAI
41.5 7 N/A N/A $5.50 / $27.50 Jul 10, 2025
Details
DeepSeek AI provider logo - V4 Flash (Reasoning, High Effort)
#55 V4 Flash (Reasoning, High Effort)
by DeepSeek
46.0 60 N/A 1.0M $0.14 / $0.28 Apr 24, 2026
Details
MoonshotAI AI provider logo - Kimi K2.5 (Reasoning)
#56 Kimi K2.5 (Reasoning)
by MoonshotAI
46.8 23 33 tok/s 262K $0.58 / $3.00 Jan 27, 2026
Details
Google AI provider logo - Gemini 3 Pro Preview (low)
#57 Gemini 3 Pro Preview (low)
by Google
41.3 11 N/A N/A $2.00 / $12.00 Nov 18, 2025
Details
Z AI AI provider logo - GLM-5 (Non-reasoning)
#58 GLM-5 (Non-reasoning)
by Z AI
40.6 18 65 tok/s 203K $1.00 / $3.20 Feb 11, 2026
Details
OpenAI AI provider logo - GPT-5 Codex (high)
#59 GPT-5 Codex (high)
by OpenAI
44.6 22 174 tok/s 400K $1.25 / $10.00 Sep 23, 2025
Details
OpenAI AI provider logo - GPT-5 (medium)
#60 GPT-5 (medium)
by OpenAI
42.0 14 86 tok/s 400K $1.25 / $10.00 Aug 7, 2025
Details
Google AI provider logo - Gemma 4 31B (Reasoning)
#61 Gemma 4 31B (Reasoning)
by Google
39.2 N/A 35 tok/s 262K N/A / N/A Apr 2, 2026
Details
DeepSeek AI provider logo - V4 Flash (Reasoning, Max Effort)
#62 V4 Flash (Reasoning, Max Effort)
by DeepSeek
46.5 96 120 tok/s 1.0M $0.14 / $0.28 Apr 24, 2026
Details
Anthropic AI provider logo - Claude 4.5 Sonnet (Reasoning)
#63 Claude 4.5 Sonnet (Reasoning)
by Anthropic
43.0 9 54 tok/s N/A $3.75 / $15.00 Sep 29, 2025
Details
OpenAI AI provider logo - o3
#64 o3
by OpenAI
38.4 17 113 tok/s 200K $2.00 / $8.00 Apr 16, 2025
Details
DeepSeek AI provider logo - V4 Pro (Non-reasoning)
#65 V4 Pro (Non-reasoning)
by DeepSeek
39.3 29 44 tok/s 1.0M $0.43 / $0.87 Apr 24, 2026
Details
MoonshotAI AI provider logo - Kimi K2.6 (Non-reasoning)
#66 Kimi K2.6 (Non-reasoning)
by MoonshotAI
42.9 18 42 tok/s 262K $0.95 / $4.00 Apr 20, 2026
Details
DeepSeek AI provider logo - V3.2 Speciale
#67 V3.2 Speciale
by DeepSeek
29.4 N/A N/A N/A N/A / N/A Dec 1, 2025
Details
Google AI provider logo - Gemini 3 Flash Preview (Non-reasoning)
#68 Gemini 3 Flash Preview (Non-reasoning)
by Google
35.0 30 184 tok/s 1.0M $0.50 / $3.00 Dec 17, 2025
Details
OpenAI AI provider logo - GPT-5.4 mini (medium)
#69 GPT-5.4 mini (medium)
by OpenAI
37.7 26 163 tok/s 400K $0.75 / $4.50 Mar 17, 2026
Details
Alibaba AI provider logo - Qwen3.5 397B A17B (Non-reasoning)
#70 Qwen3.5 397B A17B (Non-reasoning)
by Alibaba
40.1 19 53 tok/s 262K $0.60 / $3.60 Feb 16, 2026
Details
MiniMax AI provider logo - M2.5
#71 M2.5
by MiniMax
41.9 52 202 tok/s 205K $0.30 / $1.20 Feb 12, 2026
Details
StepFun AI provider logo - Step 3.7 Flash
#72 Step 3.7 Flash
by StepFun
42.6 58 392 tok/s 256K $0.20 / $1.15 May 29, 2026
Details
Xiaomi AI provider logo - MiMo-V2-Omni-0327
#73 MiMo-V2-Omni-0327
by Xiaomi
44.9 32 88 tok/s N/A $0.40 / $2.00 Mar 27, 2026
Details
Xiaomi AI provider logo - MiMo-V2.5-Pro (Non-reasoning)
#74 MiMo-V2.5-Pro (Non-reasoning)
by Xiaomi
35.6 17 46 tok/s 1.0M $0.90 / $2.70 Apr 22, 2026
Details
Z AI AI provider logo - GLM-5-Turbo
#75 GLM-5-Turbo
by Z AI
46.8 N/A N/A 203K N/A / N/A Mar 15, 2026
Details
DeepSeek AI provider logo - V3.2 (Reasoning)
#76 V3.2 (Reasoning)
by DeepSeek
41.7 39 N/A 131K $0.30 / $0.45 Dec 1, 2025
Details
OpenAI AI provider logo - GPT-5.1 Codex (high)
#77 GPT-5.1 Codex (high)
by OpenAI
43.1 21 189 tok/s 400K $1.25 / $10.00 Nov 13, 2025
Details
Tencent AI provider logo - Hy3-preview (Reasoning)
#78 Hy3-preview (Reasoning)
by Tencent
41.9 67 99 tok/s 262K $0.12 / $0.43 Apr 23, 2026
Details
Alibaba AI provider logo - Qwen3.6 27B (Reasoning)
#79 Qwen3.6 27B (Reasoning)
by Alibaba
45.8 21 55 tok/s 262K $0.60 / $3.60 Apr 22, 2026
Details
Anthropic AI provider logo - Claude 4.1 Opus (Reasoning)
#80 Claude 4.1 Opus (Reasoning)
by Anthropic
42.0 4 34 tok/s N/A $18.75 / $75.00 Aug 5, 2025
Details
OpenAI AI provider logo - GPT-5.1 Codex mini (high)
#81 GPT-5.1 Codex mini (high)
by OpenAI
38.6 42 211 tok/s 400K $0.25 / $2.00 Nov 13, 2025
Details
Z AI AI provider logo - GLM-4.7 (Reasoning)
#82 GLM-4.7 (Reasoning)
by Z AI
42.1 25 83 tok/s 203K $0.60 / $2.20 Dec 22, 2025
Details
Z AI AI provider logo - GLM 5V Turbo (Reasoning)
#83 GLM 5V Turbo (Reasoning)
by Z AI
42.9 N/A N/A 203K N/A / N/A Apr 1, 2026
Details
OpenAI AI provider logo - GPT-5 (high)
#84 GPT-5 (high)
by OpenAI
44.6 20 115 tok/s 400K $1.25 / $10.00 Aug 7, 2025
Details
Z AI AI provider logo - GLM-5.1 (Non-reasoning)
#85 GLM-5.1 (Non-reasoning)
by Z AI
43.8 16 49 tok/s 203K $1.40 / $4.40 Apr 7, 2026
Details
Xiaomi AI provider logo - MiMo-V2-Omni
#86 MiMo-V2-Omni
by Xiaomi
43.4 N/A 84 tok/s N/A N/A / N/A Mar 19, 2026
Details
Mistral AI provider logo - Medium 3.5
#87 Medium 3.5
by Mistral
39.2 20 139 tok/s N/A $1.50 / $7.50 Apr 29, 2026
Details
OpenAI AI provider logo - GPT-5 mini (high)
#88 GPT-5 mini (high)
by OpenAI
41.2 32 90 tok/s 400K $0.25 / $2.00 Aug 7, 2025
Details
DeepSeek AI provider logo - V4 Flash (Non-reasoning)
#89 V4 Flash (Non-reasoning)
by DeepSeek
36.5 72 114 tok/s 1.0M $0.14 / $0.28 Apr 24, 2026
Details
Alibaba AI provider logo - Qwen3.6 35B A3B (Reasoning)
#90 Qwen3.6 35B A3B (Reasoning)
by Alibaba
43.5 53 162 tok/s 262K $0.25 / $1.49 Apr 16, 2026
Details
xAI AI provider logo - Grok 4.3 (medium)
#91 Grok 4.3 (medium)
by xAI
48.8 35 125 tok/s N/A $1.25 / $2.50 Apr 30, 2026
Details
OpenAI AI provider logo - GPT-5.4 nano (medium)
#92 GPT-5.4 nano (medium)
by OpenAI
38.1 51 160 tok/s 400K $0.20 / $1.25 Mar 17, 2026
Details
Alibaba AI provider logo - Qwen3.5 27B (Reasoning)
#93 Qwen3.5 27B (Reasoning)
by Alibaba
42.1 28 83 tok/s 262K $0.30 / $2.40 Feb 24, 2026
Details
MoonshotAI AI provider logo - Kimi K2 Thinking
#94 Kimi K2 Thinking
by MoonshotAI
40.9 36 131 tok/s 262K $0.60 / $2.50 Nov 6, 2025
Details
Alibaba AI provider logo - Qwen3.5 122B A10B (Reasoning)
#95 Qwen3.5 122B A10B (Reasoning)
by Alibaba
41.6 36 141 tok/s 262K $0.40 / $3.20 Feb 24, 2026
Details
OpenAI AI provider logo - GPT-5.2 (Non-reasoning)
#96 GPT-5.2 (Non-reasoning)
by OpenAI
33.6 8 63 tok/s 400K $1.75 / $14.00 Dec 11, 2025
Details
StepFun AI provider logo - Step 3.5 Flash 2603
#97 Step 3.5 Flash 2603
by StepFun
38.5 90 243 tok/s 262K $0.10 / $0.30 Apr 2, 2026
Details
DeepSeek AI provider logo - V3.2 (Non-reasoning)
#98 V3.2 (Non-reasoning)
by DeepSeek
32.1 20 N/A 131K $0.50 / $1.60 Dec 1, 2025
Details
Tencent AI provider logo - Hy3-preview (Non-reasoning)
#99 Hy3-preview (Non-reasoning)
by Tencent
33.7 46 84 tok/s 262K $0.12 / $0.43 Apr 23, 2026
Details
Anthropic AI provider logo - Claude 4 Sonnet (Reasoning)
#100 Claude 4 Sonnet (Reasoning)
by Anthropic
38.7 8 46 tok/s N/A $3.75 / $15.00 May 22, 2025
Details

Showing 100 of 529 models

Understanding the AI Model Leaderboard

This comprehensive AI model leaderboard helps you compare and choose the best large language models (LLMs) for your needs. We track standardized AI benchmarks, token pricing, inference speed, and model capabilities across all major AI providers like OpenAI, Anthropic, Google, Meta, and DeepSeek.

Core AI Benchmarks Explained

MMLU-Pro Tests broad knowledge across 14 academic subjects
GPQA PhD-level reasoning & problem-solving
AIME 2025 Elite mathematical reasoning
Coding Index LiveCodeBench + SciCode composite
Math Index AIME + MATH-500 composite

Key Metrics to Consider

Token Pricing Input vs output cost per 1M tokens
Inference Speed Tokens/sec for response time
Release Date Latest techniques & knowledge
Benchmark Scores 0-100% capability comparison

How to Choose the Right AI Model for Your Use Case

For Research & Analysis

Prioritize models with high MMLU-Pro (70%+) and GPQA (60%+) scores for complex reasoning tasks, academic research, and technical documentation

For Cost Optimization

Sort by input/output pricing - smaller models often deliver 80% of flagship performance at 10% of the cost for simple tasks

For Math & STEM

Filter by Math Index or AIME 2025 scores (50%+) for quantitative analysis, engineering calculations, and scientific applications

All benchmark scores and pricing data are updated daily from Artificial Analysis to reflect the latest model versions and capabilities. Use the sort filters above to find AI models by intelligence, cost, coding ability, math performance, speed, or release date.

Frequently Asked Questions

What is MMLU-Pro and why is it the standard AI intelligence benchmark?

MMLU-Pro (Massive Multitask Language Understanding - Professional) is the most comprehensive AI benchmark, testing models across 14 academic subjects including mathematics, science, history, law, and ethics. Scores range from 46% (basic competency) to 87% (near-expert level). Models scoring above 75% demonstrate strong general intelligence suitable for professional applications, while scores below 60% indicate limitations in complex reasoning tasks.

What does GPQA measure and which models score highest?

GPQA (Graduate-level Google-Proof Q&A) tests PhD-level reasoning with questions designed to be "Google-proof" - requiring deep understanding rather than simple fact retrieval. Top models like GPT-5.1 (87.3%), GPT-5 mini (82.8%), and o3 (82.7%) excel at GPQA, making them ideal for research, technical analysis, and complex problem-solving. Models below 50% GPQA struggle with advanced reasoning and may provide superficial answers to complex questions.

What is AIME 2025 and how does it evaluate AI mathematical ability?

AIME 2025 (American Invitational Mathematics Examination) is an elite math competition benchmark that tests advanced problem-solving, algebra, geometry, and number theory. Scores above 80% (like GPT-5 Codex at 98.7% or GPT-5.1 at 94%) indicate exceptional mathematical reasoning suitable for engineering, scientific computing, and quantitative analysis. Models scoring below 50% may struggle with multi-step mathematical problems or require explicit problem breakdown.

How is AI model pricing calculated and what's considered cost-effective?

AI model pricing is measured per 1 million tokens (approximately 750,000 words). Input pricing covers text you send, while output pricing covers generated responses. Budget models like Llama 3.3 70B cost $0.54/$0.71 per million tokens, mid-tier models like GPT-5 nano cost $0.05/$0.40, while premium models like GPT-5 cost $1.25/$10. For typical applications with 3:1 input-to-output ratio, budget models can be 10-20x cheaper than flagship models while maintaining 70-80% performance.

Which AI models are best for coding and programming tasks?

Sort by Coding Index to see top programming models. Our Coding Index combines LiveCodeBench, SciCode, and coding benchmarks. Top performers include GPT-5.1 (57.5 index), GPT-5 mini (51.4), and GPT-5 Codex (53.5). These models excel at code generation, debugging, refactoring, and explaining complex algorithms. For budget-conscious developers, models with 40+ coding index scores offer excellent value for routine programming tasks.

How often are AI model benchmarks and rankings updated?

Our leaderboard syncs daily with Artificial Analysis API to ensure benchmark scores (MMLU-Pro, GPQA, AIME 2025), pricing, and inference speed data reflect the latest model versions. New model releases appear immediately under the "Newest" sort option. Benchmark scores can change when providers release updated versions - for example, GPT-5.1 released in November 2025 achieved 69.7 intelligence compared to GPT-5's 68.5 from August 2025.

What inference speed (tokens/second) do I need for my application?

Inference speed determines how fast models generate responses. For real-time chatbots and interactive applications, target 100+ tokens/second (models like gpt-oss-120B at 340 tok/s). For background processing and batch jobs, 50-100 tok/s is sufficient. Premium reasoning models like GPT-5 (103 tok/s) balance speed and capability. Note that higher inference speed doesn't always mean better quality - slower models often deliver more thoughtful, detailed responses.

Can I test these AI models for free before committing?

Yes! Try our free AI chat interface to test different models instantly without creating an account. Many providers also offer free tiers: OpenAI (ChatGPT with daily limits), Anthropic (Claude with usage caps), Google (Gemini free tier), and open-source models like Llama 3.3. Compare performance on your specific use case before upgrading to paid plans.