AI Model Ranking (LLM Leaderboard)

Cheapest AI Models

Language models ranked by Artificial Analysis Index

Model
AI model name and provider organization
Intelligence
Artificial Analysis Intelligence Index - composite reasoning and capability score across the benchmark suite
Value
Quality, speed, and blended token price combined into a relative value score
Speed
Inference throughput in tokens per second - how fast the model generates responses
Context
Maximum context window size - how much text, code, or conversation the model can process at once
Price
Cost per 1 million tokens β€” input (text you send) / output (text the model generates)
Release
When the model was released - newer models may have more capabilities
Compare
Alibaba AI provider logo - Qwen3.5 0.8B (Reasoning)
#1 Qwen3.5 0.8B (Reasoning)
by Alibaba
10.5 40 N/A N/A $0.01 / $0.05 Mar 2, 2026
Details
Alibaba AI provider logo - Qwen3.5 0.8B (Non-reasoning)
#2 Qwen3.5 0.8B (Non-reasoning)
by Alibaba
9.9 45 89 tok/s N/A $0.01 / $0.05 Mar 2, 2026
Details
Alibaba AI provider logo - Qwen3.5 2B (Reasoning)
#3 Qwen3.5 2B (Reasoning)
by Alibaba
16.3 44 N/A N/A $0.02 / $0.10 Mar 2, 2026
Details
Alibaba AI provider logo - Qwen3.5 2B (Non-reasoning)
#4 Qwen3.5 2B (Non-reasoning)
by Alibaba
14.7 66 324 tok/s N/A $0.02 / $0.10 Mar 2, 2026
Details
Google AI provider logo - Gemma 3n E4B Instruct
#5 Gemma 3n E4B Instruct
by Google
6.4 22 51 tok/s N/A $0.02 / $0.04 Jun 26, 2025
Details
AI Chat

Chat with 80+ models

Inference API

EU-hosted inference

Sarvam AI provider logo - 30B (high)
#6 30B (high)
by Sarvam
12.3 51 161 tok/s N/A $0.03 / $0.11 Mar 6, 2026
Details
Liquid AI AI provider logo - LFM2 24B A2B
#7 LFM2 24B A2B
by Liquid AI
10.5 38 114 tok/s N/A $0.03 / $0.12 Feb 25, 2026
Details
Alibaba AI provider logo - Qwen3.5 4B (Reasoning)
#8 Qwen3.5 4B (Reasoning)
by Alibaba
27.1 100 197 tok/s N/A $0.03 / $0.15 Mar 2, 2026
Details
Alibaba AI provider logo - Qwen3.5 4B (Non-reasoning)
#9 Qwen3.5 4B (Non-reasoning)
by Alibaba
22.6 83 209 tok/s N/A $0.03 / $0.15 Mar 2, 2026
Details
IBM AI provider logo - Granite 3.3 8B (Non-reasoning)
#10 Granite 3.3 8B (Non-reasoning)
by IBM
7.0 22 442 tok/s N/A $0.03 / $0.25 Apr 16, 2025
Details
Amazon AI provider logo - Nova Micro
#11 Nova Micro
by Amazon
10.3 38 316 tok/s N/A $0.04 / $0.14 Dec 3, 2024
Details
NVIDIA AI provider logo - Nemotron Nano 9B V2 (Reasoning)
#12 Nemotron Nano 9B V2 (Reasoning)
by NVIDIA
14.8 47 117 tok/s 128K $0.04 / $0.16 Aug 18, 2025
Details
Google AI provider logo - Gemma 3 4B Instruct
#13 Gemma 3 4B Instruct
by Google
6.3 15 N/A 131K $0.04 / $0.08 Mar 12, 2025
Details
Sarvam AI provider logo - 105B (high)
#14 105B (high)
by Sarvam
18.2 46 94 tok/s N/A $0.04 / $0.17 Mar 6, 2026
Details
Meta AI provider logo - Llama 3 Instruct 8B
#15 Llama 3 Instruct 8B
by Meta
6.4 16 89 tok/s N/A $0.04 / $0.14 Apr 18, 2024
Details
OpenAI AI provider logo - gpt-oss-20B (high)
#16 gpt-oss-20B (high)
by OpenAI
24.5 75 268 tok/s 131K $0.05 / $0.20 Aug 5, 2025
Details
NVIDIA AI provider logo - Nemotron Nano 9B V2 (Non-reasoning)
#17 Nemotron Nano 9B V2 (Non-reasoning)
by NVIDIA
13.2 41 135 tok/s 128K $0.05 / $0.20 Aug 18, 2025
Details
NVIDIA AI provider logo - Nemotron 3 Nano 30B A3B (Non-reasoning)
#18 Nemotron 3 Nano 30B A3B (Non-reasoning)
by NVIDIA
13.2 29 89 tok/s 256K $0.05 / $0.20 Dec 15, 2025
Details
IBM AI provider logo - Granite 4.1 8B
#19 Granite 4.1 8B
by IBM
12.4 45 133 tok/s N/A $0.05 / $0.10 Apr 29, 2026
Details
OpenAI AI provider logo - GPT-5 nano (minimal)
#20 GPT-5 nano (minimal)
by OpenAI
13.8 34 154 tok/s 400K $0.05 / $0.40 Aug 7, 2025
Details
OpenAI AI provider logo - GPT-5 nano (medium)
#21 GPT-5 nano (medium)
by OpenAI
25.9 63 167 tok/s 400K $0.05 / $0.40 Aug 7, 2025
Details
OpenAI AI provider logo - GPT-5 nano (high)
#22 GPT-5 nano (high)
by OpenAI
26.8 65 155 tok/s 400K $0.05 / $0.40 Aug 7, 2025
Details
Meta AI provider logo - Llama 3.2 Instruct 1B
#23 Llama 3.2 Instruct 1B
by Meta
6.3 19 93 tok/s N/A $0.05 / $0.05 Sep 25, 2024
Details
Meta AI provider logo - Llama 2 Chat 7B
#24 Llama 2 Chat 7B
by Meta
9.7 22 101 tok/s N/A $0.05 / $0.25 Jul 18, 2023
Details
Alibaba AI provider logo - Qwen2.5 Turbo
#25 Qwen2.5 Turbo
by Alibaba
12.0 22 67 tok/s N/A $0.05 / $0.20 Nov 18, 2024
Details
NVIDIA AI provider logo - Nemotron 3 Nano 30B A3B (Reasoning)
#26 Nemotron 3 Nano 30B A3B (Reasoning)
by NVIDIA
24.3 71 134 tok/s 256K $0.06 / $0.22 Dec 15, 2025
Details
OpenAI AI provider logo - gpt-oss-20B (low)
#27 gpt-oss-20B (low)
by OpenAI
20.8 61 273 tok/s 131K $0.06 / $0.20 Aug 5, 2025
Details
IBM AI provider logo - Granite 4.0 H Small
#28 Granite 4.0 H Small
by IBM
10.8 30 418 tok/s N/A $0.06 / $0.25 Sep 22, 2025
Details
Amazon AI provider logo - Nova Lite
#29 Nova Lite
by Amazon
12.7 35 189 tok/s N/A $0.06 / $0.24 Dec 3, 2024
Details
Z AI AI provider logo - GLM-4.7-Flash (Non-reasoning)
#30 GLM-4.7-Flash (Non-reasoning)
by Z AI
22.1 48 118 tok/s 203K $0.07 / $0.40 Jan 19, 2026
Details
Z AI AI provider logo - GLM-4.7-Flash (Reasoning)
#31 GLM-4.7-Flash (Reasoning)
by Z AI
30.1 52 93 tok/s 203K $0.07 / $0.40 Jan 19, 2026
Details
NVIDIA AI provider logo - Nemotron 3 Nano Omni 30B A3B Reasoning
#32 Nemotron 3 Nano Omni 30B A3B Reasoning
by NVIDIA
21.4 53 285 tok/s N/A $0.07 / $0.30 Apr 29, 2026
Details
Mistral AI provider logo - Small 3
#33 Small 3
by Mistral
12.7 36 154 tok/s N/A $0.07 / $0.19 Jan 30, 2025
Details
Alibaba AI provider logo - Qwen3 30B A3B (Non-reasoning)
#34 Qwen3 30B A3B (Non-reasoning)
by Alibaba
12.5 19 70 tok/s 131K $0.08 / $0.29 Apr 28, 2025
Details
Mistral AI provider logo - Small 3.2
#35 Small 3.2
by Mistral
15.1 38 127 tok/s N/A $0.09 / $0.25 Jun 20, 2025
Details
Google AI provider logo - Gemma 3 12B Instruct
#36 Gemma 3 12B Instruct
by Google
8.8 13 N/A 131K $0.09 / $0.29 Mar 12, 2025
Details
Alibaba AI provider logo - Qwen3 30B A3B (Reasoning)
#37 Qwen3 30B A3B (Reasoning)
by Alibaba
15.3 20 69 tok/s 131K $0.09 / $0.45 Apr 28, 2025
Details
Mistral AI provider logo - Ministral 3 3B
#38 Ministral 3 3B
by Mistral
11.2 32 203 tok/s N/A $0.10 / $0.10 Dec 2, 2025
Details
NVIDIA AI provider logo - Llama Nemotron Super 49B v1.5 (Non-reasoning)
#39 Llama Nemotron Super 49B v1.5 (Non-reasoning)
by NVIDIA
14.6 19 44 tok/s N/A $0.10 / $0.40 Jul 25, 2025
Details
NVIDIA AI provider logo - Llama Nemotron Super 49B v1.5 (Reasoning)
#40 Llama Nemotron Super 49B v1.5 (Reasoning)
by NVIDIA
18.7 24 44 tok/s N/A $0.10 / $0.40 Jul 25, 2025
Details
StepFun AI provider logo - Step 3.5 Flash 2603
#41 Step 3.5 Flash 2603
by StepFun
38.5 90 238 tok/s 262K $0.10 / $0.30 Apr 2, 2026
Details
Allen Institute for AI AI provider logo - Olmo 3 7B Instruct
#42 Olmo 3 7B Instruct
by Allen Institute for AI
8.1 12 N/A N/A $0.10 / $0.20 Nov 20, 2025
Details
Xiaomi AI provider logo - MiMo-V2-Flash (Feb 2026)
#43 MiMo-V2-Flash (Feb 2026)
by Xiaomi
41.5 97 126 tok/s 262K $0.10 / $0.30 Dec 16, 2025
Details
Xiaomi AI provider logo - MiMo-V2-Flash (Non-reasoning)
#44 MiMo-V2-Flash (Non-reasoning)
by Xiaomi
30.3 71 126 tok/s 262K $0.10 / $0.30 Dec 16, 2025
Details
Swiss AI Initiative AI provider logo - Apertus 8B Instruct
#45 Apertus 8B Instruct
by Swiss AI Initiative
5.9 9 N/A N/A $0.10 / $0.20 Sep 2, 2025
Details
Alibaba AI provider logo - Qwen3.5 9B (Reasoning)
#46 Qwen3.5 9B (Reasoning)
by Alibaba
32.4 52 72 tok/s 262K $0.10 / $0.15 Mar 2, 2026
Details
Alibaba AI provider logo - Qwen3.5 Omni Flash
#47 Qwen3.5 Omni Flash
by Alibaba
25.9 45 226 tok/s N/A $0.10 / $0.80 Mar 30, 2026
Details
InclusionAI AI provider logo - Ling 2.6 Flash
#48 Ling 2.6 Flash
by InclusionAI
26.2 37 N/A 262K $0.10 / $0.30 Apr 21, 2026
Details
OpenAI AI provider logo - GPT-4.1 nano
#49 GPT-4.1 nano
by OpenAI
13.0 25 112 tok/s 1.0M $0.10 / $0.40 Apr 14, 2025
Details
Meta AI provider logo - Llama 3.1 Instruct 8B
#50 Llama 3.1 Instruct 8B
by Meta
11.8 34 203 tok/s N/A $0.10 / $0.10 Jul 23, 2024
Details
Google AI provider logo - Gemini 2.5 Flash-Lite (Reasoning)
#51 Gemini 2.5 Flash-Lite (Reasoning)
by Google
17.6 38 307 tok/s 1.0M $0.10 / $0.40 Jun 17, 2025
Details
Google AI provider logo - Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)
#52 Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)
by Google
21.6 28 N/A N/A $0.10 / $0.40 Sep 8, 2025
Details
Google AI provider logo - Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)
#53 Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)
by Google
19.4 25 N/A 1.0M $0.10 / $0.40 Sep 25, 2025
Details
Google AI provider logo - Gemini 2.5 Flash-Lite (Non-reasoning)
#54 Gemini 2.5 Flash-Lite (Non-reasoning)
by Google
12.7 27 231 tok/s 1.0M $0.10 / $0.40 Jun 17, 2025
Details
Mistral AI provider logo - Devstral Small (Jul '25)
#55 Devstral Small (Jul '25)
by Mistral
15.2 21 37 tok/s N/A $0.10 / $0.30 Jul 10, 2025
Details
StepFun AI provider logo - Step 3.5 Flash
#56 Step 3.5 Flash
by StepFun
37.8 88 220 tok/s 262K $0.10 / $0.30 Feb 2, 2026
Details
Xiaomi AI provider logo - MiMo-V2-Flash (Reasoning)
#57 MiMo-V2-Flash (Reasoning)
by Xiaomi
39.2 91 130 tok/s 262K $0.10 / $0.30 Dec 16, 2025
Details
Mistral AI provider logo - Small 3.1
#58 Small 3.1
by Mistral
14.5 35 156 tok/s N/A $0.10 / $0.23 Mar 17, 2025
Details
Google AI provider logo - Gemma 3 27B Instruct
#59 Gemma 3 27B Instruct
by Google
10.3 15 N/A 131K $0.11 / $0.25 Mar 12, 2025
Details
Alibaba AI provider logo - Qwen3 4B (Non-reasoning)
#60 Qwen3 4B (Non-reasoning)
by Alibaba
12.5 16 N/A N/A $0.11 / $0.42 Apr 28, 2025
Details
Alibaba AI provider logo - Qwen3 0.6B (Non-reasoning)
#61 Qwen3 0.6B (Non-reasoning)
by Alibaba
5.7 7 N/A N/A $0.11 / $0.42 Apr 28, 2025
Details
Alibaba AI provider logo - Qwen3 4B (Reasoning)
#62 Qwen3 4B (Reasoning)
by Alibaba
14.2 12 N/A N/A $0.11 / $1.26 Apr 28, 2025
Details
Alibaba AI provider logo - Qwen3 1.7B (Non-reasoning)
#63 Qwen3 1.7B (Non-reasoning)
by Alibaba
6.8 9 N/A N/A $0.11 / $0.42 Apr 28, 2025
Details
Alibaba AI provider logo - Qwen3 0.6B (Reasoning)
#64 Qwen3 0.6B (Reasoning)
by Alibaba
6.5 6 N/A N/A $0.11 / $1.26 Apr 28, 2025
Details
Alibaba AI provider logo - Qwen3 8B (Reasoning)
#65 Qwen3 8B (Reasoning)
by Alibaba
13.2 12 66 tok/s 131K $0.11 / $1.15 Apr 28, 2025
Details
Alibaba AI provider logo - Qwen3 1.7B (Reasoning)
#66 Qwen3 1.7B (Reasoning)
by Alibaba
8.0 7 N/A N/A $0.11 / $1.26 Apr 28, 2025
Details
Tencent AI provider logo - Hy3-preview (Reasoning)
#67 Hy3-preview (Reasoning)
by Tencent
41.9 66 98 tok/s 262K $0.12 / $0.43 Apr 23, 2026
Details
Tencent AI provider logo - Hy3-preview (Non-reasoning)
#68 Hy3-preview (Non-reasoning)
by Tencent
33.7 46 85 tok/s 262K $0.12 / $0.43 Apr 23, 2026
Details
Microsoft AI provider logo - Phi-4
#69 Phi-4
by Microsoft
10.4 12 40 tok/s 16K $0.13 / $0.50 Dec 12, 2024
Details
Google AI provider logo - Gemma 4 26B A4B (Reasoning)
#70 Gemma 4 26B A4B (Reasoning)
by Google
31.2 38 N/A 262K $0.13 / $0.40 Apr 2, 2026
Details
Google AI provider logo - Gemma 4 26B A4B (Non-reasoning)
#71 Gemma 4 26B A4B (Non-reasoning)
by Google
27.1 33 62 tok/s 262K $0.13 / $0.40 Apr 2, 2026
Details
Nous Research AI provider logo - Hermes 4 - Llama-3.1 70B (Non-reasoning)
#72 Hermes 4 - Llama-3.1 70B (Non-reasoning)
by Nous Research
12.6 17 84 tok/s N/A $0.13 / $0.40 Aug 27, 2025
Details
Nous Research AI provider logo - Hermes 4 - Llama-3.1 70B (Reasoning)
#73 Hermes 4 - Llama-3.1 70B (Reasoning)
by Nous Research
16.0 23 88 tok/s N/A $0.13 / $0.40 Aug 27, 2025
Details
Google AI provider logo - Gemma 4 31B (Non-reasoning)
#74 Gemma 4 31B (Non-reasoning)
by Google
32.3 39 28 tok/s 262K $0.14 / $0.40 Apr 2, 2026
Details
DeepSeek AI provider logo - V4 Flash (Reasoning, High Effort)
#75 V4 Flash (Reasoning, High Effort)
by DeepSeek
46.0 60 N/A 1.0M $0.14 / $0.28 Apr 24, 2026
Details
DeepSeek AI provider logo - V4 Flash (Reasoning, Max Effort)
#76 V4 Flash (Reasoning, Max Effort)
by DeepSeek
46.5 96 120 tok/s 1.0M $0.14 / $0.28 Apr 24, 2026
Details
DeepSeek AI provider logo - V4 Flash (Non-reasoning)
#77 V4 Flash (Non-reasoning)
by DeepSeek
36.5 72 114 tok/s 1.0M $0.14 / $0.28 Apr 24, 2026
Details
Xiaomi AI provider logo - MiMo-V2.5
#78 MiMo-V2.5
by Xiaomi
49.0 69 82 tok/s 1.0M $0.14 / $0.28 Apr 22, 2026
Details
InclusionAI AI provider logo - Ring-flash-2.0
#79 Ring-flash-2.0
by InclusionAI
14.0 15 N/A N/A $0.14 / $0.57 Sep 19, 2025
Details
InclusionAI AI provider logo - Ling-flash-2.0
#80 Ling-flash-2.0
by InclusionAI
15.7 17 72 tok/s N/A $0.14 / $0.57 Sep 17, 2025
Details
OpenAI AI provider logo - gpt-oss-120b (low)
#81 gpt-oss-120b (low)
by OpenAI
24.5 43 367 tok/s 131K $0.15 / $0.60 Aug 5, 2025
Details
OpenAI AI provider logo - gpt-oss-120b (high)
#82 gpt-oss-120b (high)
by OpenAI
33.3 59 367 tok/s 131K $0.15 / $0.60 Aug 5, 2025
Details
Mistral AI provider logo - Small 4 (Reasoning)
#83 Small 4 (Reasoning)
by Mistral
27.8 49 168 tok/s N/A $0.15 / $0.60 Mar 16, 2026
Details
Mistral AI provider logo - Ministral 3 8B
#84 Ministral 3 8B
by Mistral
14.8 28 102 tok/s N/A $0.15 / $0.15 Dec 2, 2025
Details
Mistral AI provider logo - Small 4 (Non-reasoning)
#85 Small 4 (Non-reasoning)
by Mistral
18.6 33 156 tok/s N/A $0.15 / $0.60 Mar 16, 2026
Details
OpenAI AI provider logo - GPT-4o mini
#86 GPT-4o mini
by OpenAI
12.6 13 70 tok/s 128K $0.15 / $0.60 Jul 18, 2024
Details
Meta AI provider logo - Llama 3.2 Instruct 3B
#87 Llama 3.2 Instruct 3B
by Meta
9.7 14 52 tok/s N/A $0.15 / $0.15 Sep 25, 2024
Details
Google AI provider logo - Gemini 2.0 Flash (Feb '25)
#88 Gemini 2.0 Flash (Feb '25)
by Google
18.5 20 N/A N/A $0.15 / $0.60 Feb 5, 2025
Details
Upstage AI provider logo - Solar Mini
#89 Solar Mini
by Upstage
11.9 17 77 tok/s N/A $0.15 / $0.15 Jan 25, 2024
Details
Alibaba AI provider logo - Qwen3 32B (Non-reasoning)
#90 Qwen3 32B (Non-reasoning)
by Alibaba
14.5 20 98 tok/s 131K $0.15 / $0.59 Apr 28, 2025
Details
Alibaba AI provider logo - Qwen3 30B A3B 2507 Instruct
#91 Qwen3 30B A3B 2507 Instruct
by Alibaba
15.0 25 108 tok/s N/A $0.15 / $0.40 Jul 29, 2025
Details
Meta AI provider logo - Llama 4 Scout
#92 Llama 4 Scout
by Meta
13.5 19 106 tok/s 10.0M $0.17 / $0.66 Apr 5, 2025
Details
Z AI AI provider logo - GLM-4.5-Air
#93 GLM-4.5-Air
by Z AI
23.2 21 69 tok/s 131K $0.17 / $0.98 Jul 28, 2025
Details
Alibaba AI provider logo - Qwen3 8B (Non-reasoning)
#94 Qwen3 8B (Non-reasoning)
by Alibaba
10.6 13 64 tok/s 131K $0.18 / $0.20 Apr 28, 2025
Details
Alibaba AI provider logo - Qwen3 VL 8B (Reasoning)
#95 Qwen3 VL 8B (Reasoning)
by Alibaba
16.7 19 138 tok/s N/A $0.18 / $2.10 Oct 14, 2025
Details
Alibaba AI provider logo - Qwen3 VL 8B Instruct
#96 Qwen3 VL 8B Instruct
by Alibaba
14.3 23 148 tok/s 256K $0.18 / $0.70 Oct 14, 2025
Details
Alibaba AI provider logo - Qwen3 Coder 30B A3B Instruct
#97 Qwen3 Coder 30B A3B Instruct
by Alibaba
20.0 20 82 tok/s 160K $0.19 / $0.84 Jul 31, 2025
Details
Alibaba AI provider logo - Qwen3 32B (Reasoning)
#98 Qwen3 32B (Reasoning)
by Alibaba
16.5 22 98 tok/s 131K $0.20 / $0.52 Apr 28, 2025
Details
OpenAI AI provider logo - GPT-5.4 nano (xhigh)
#99 GPT-5.4 nano (xhigh)
by OpenAI
44.0 58 165 tok/s 400K $0.20 / $1.25 Mar 17, 2026
Details
OpenAI AI provider logo - GPT-5.4 nano (Non-Reasoning)
#100 GPT-5.4 nano (Non-Reasoning)
by OpenAI
24.4 32 158 tok/s 400K $0.20 / $1.25 Mar 17, 2026
Details

Showing 100 of 354 models

Understanding the AI Model Leaderboard

This comprehensive AI model leaderboard helps you compare and choose the best large language models (LLMs) for your needs. We track standardized AI benchmarks, token pricing, inference speed, and model capabilities across all major AI providers like OpenAI, Anthropic, Google, Meta, and DeepSeek.

Core AI Benchmarks Explained

MMLU-Pro Tests broad knowledge across 14 academic subjects
GPQA PhD-level reasoning & problem-solving
AIME 2025 Elite mathematical reasoning
Coding Index LiveCodeBench + SciCode composite
Math Index AIME + MATH-500 composite

Key Metrics to Consider

Token Pricing Input vs output cost per 1M tokens
Inference Speed Tokens/sec for response time
Release Date Latest techniques & knowledge
Benchmark Scores 0-100% capability comparison

How to Choose the Right AI Model for Your Use Case

For Research & Analysis

Prioritize models with high MMLU-Pro (70%+) and GPQA (60%+) scores for complex reasoning tasks, academic research, and technical documentation

For Cost Optimization

Sort by input/output pricing - smaller models often deliver 80% of flagship performance at 10% of the cost for simple tasks

For Math & STEM

Filter by Math Index or AIME 2025 scores (50%+) for quantitative analysis, engineering calculations, and scientific applications

All benchmark scores and pricing data are updated daily from Artificial Analysis to reflect the latest model versions and capabilities. Use the sort filters above to find AI models by intelligence, cost, coding ability, math performance, speed, or release date.

Frequently Asked Questions

What is MMLU-Pro and why is it the standard AI intelligence benchmark?

MMLU-Pro (Massive Multitask Language Understanding - Professional) is the most comprehensive AI benchmark, testing models across 14 academic subjects including mathematics, science, history, law, and ethics. Scores range from 46% (basic competency) to 87% (near-expert level). Models scoring above 75% demonstrate strong general intelligence suitable for professional applications, while scores below 60% indicate limitations in complex reasoning tasks.

What does GPQA measure and which models score highest?

GPQA (Graduate-level Google-Proof Q&A) tests PhD-level reasoning with questions designed to be "Google-proof" - requiring deep understanding rather than simple fact retrieval. Top models like GPT-5.1 (87.3%), GPT-5 mini (82.8%), and o3 (82.7%) excel at GPQA, making them ideal for research, technical analysis, and complex problem-solving. Models below 50% GPQA struggle with advanced reasoning and may provide superficial answers to complex questions.

What is AIME 2025 and how does it evaluate AI mathematical ability?

AIME 2025 (American Invitational Mathematics Examination) is an elite math competition benchmark that tests advanced problem-solving, algebra, geometry, and number theory. Scores above 80% (like GPT-5 Codex at 98.7% or GPT-5.1 at 94%) indicate exceptional mathematical reasoning suitable for engineering, scientific computing, and quantitative analysis. Models scoring below 50% may struggle with multi-step mathematical problems or require explicit problem breakdown.

How is AI model pricing calculated and what's considered cost-effective?

AI model pricing is measured per 1 million tokens (approximately 750,000 words). Input pricing covers text you send, while output pricing covers generated responses. Budget models like Llama 3.3 70B cost $0.54/$0.71 per million tokens, mid-tier models like GPT-5 nano cost $0.05/$0.40, while premium models like GPT-5 cost $1.25/$10. For typical applications with 3:1 input-to-output ratio, budget models can be 10-20x cheaper than flagship models while maintaining 70-80% performance.

Which AI models are best for coding and programming tasks?

Sort by Coding Index to see top programming models. Our Coding Index combines LiveCodeBench, SciCode, and coding benchmarks. Top performers include GPT-5.1 (57.5 index), GPT-5 mini (51.4), and GPT-5 Codex (53.5). These models excel at code generation, debugging, refactoring, and explaining complex algorithms. For budget-conscious developers, models with 40+ coding index scores offer excellent value for routine programming tasks.

How often are AI model benchmarks and rankings updated?

Our leaderboard syncs daily with Artificial Analysis API to ensure benchmark scores (MMLU-Pro, GPQA, AIME 2025), pricing, and inference speed data reflect the latest model versions. New model releases appear immediately under the "Newest" sort option. Benchmark scores can change when providers release updated versions - for example, GPT-5.1 released in November 2025 achieved 69.7 intelligence compared to GPT-5's 68.5 from August 2025.

What inference speed (tokens/second) do I need for my application?

Inference speed determines how fast models generate responses. For real-time chatbots and interactive applications, target 100+ tokens/second (models like gpt-oss-120B at 340 tok/s). For background processing and batch jobs, 50-100 tok/s is sufficient. Premium reasoning models like GPT-5 (103 tok/s) balance speed and capability. Note that higher inference speed doesn't always mean better quality - slower models often deliver more thoughtful, detailed responses.

Can I test these AI models for free before committing?

Yes! Try our free AI chat interface to test different models instantly without creating an account. Many providers also offer free tiers: OpenAI (ChatGPT with daily limits), Anthropic (Claude with usage caps), Google (Gemini free tier), and open-source models like Llama 3.3. Compare performance on your specific use case before upgrading to paid plans.