Model Comparison

R1 Distill Qwen 32B
vs. V3 (Dec '24)

Comparing 2 AI models · 12 benchmarks · DeepSeek

Recommended Pick

R1 Distill Qwen 32B 11 metric wins

Strongest on: Input price, Output price, Reasoning

Lowest Price

R1 Distill Qwen 32B

$0.00/1M input price

Best Reasoning

R1 Distill Qwen 32B

52.4 reasoning score

Blends available reasoning benchmarks

Composite Indices

Higher is better; speed and price are normalized

Standard Benchmarks

Only benchmarks with data are shown

Differences That Matter

Price gap

R1 Distill Qwen 32B is ∞x cheaper on input tokens than V3 (Dec '24).

Reasoning gap

R1 Distill Qwen 32B leads V3 (Dec '24) by 18.7 points on reasoning.

Top-pick rationale

R1 Distill Qwen 32B wins 11 measurable categories, including Input price, Output price, Reasoning, Intelligence.

Live compare

Response Face-Off

Run one prompt through the selected models and compare response quality with live speed and cost context.

R1 Distill Qwen 32B

DeepSeek

Waiting

TTFT

—

Time

—

tok/s

—

Tokens

—

Cost

—

Waiting

V3 (Dec '24)

DeepSeek

Waiting

TTFT

—

Time

—

tok/s

—

Tokens

—

Cost

—

Waiting

Which answer was more useful?

Chat with leading AI models

Use Claude, ChatGPT, Gemini alongside with EU-Hosted Models like Deepseek, Qwen & Kimi.

Chat for free

EU-hosted inference

Servers in Germany & Finland. Designed to meet strict GDPR and ISO 27001 compliance requirements.

Get API access

Full Comparison

Metric	Top Pick De R1 Distill Qwen 32B DeepSeek	De V3 (Dec '24) DeepSeek
Pricing per 1M tokens
Input Cost	$0.00/1M	$0.40/1M
Output Cost	$0.00/1M	$0.89/1M
Blended (3:1)	—	$0.52/1M
Specifications
Organization	DeepSeek	DeepSeek
Release Date	Jan 20, 2025	Dec 26, 2024
Performance & Speed
Throughput	—	—
TTFT	—	—
Latency	—	—
Composite Indices
Value Score	—	100.0
Reasoning Score	52.4	33.7
Intelligence	11.0	10.4
Math	63.0	26.0
Standard Benchmarks
GPQA	61.5%	55.7%
MMLU Pro	73.9%	75.2%
HLE	5.5%	3.6%
LiveCodeBench	27.0%	35.9%
MATH 500	94.1%	88.7%
AIME 2025	63.0%	26.0%
AIME (Original)	68.7%	25.3%
SciCode	37.6%	35.4%
LCR	9.7%	29.0%
IFBench	22.9%	34.8%
TAU-bench v2	—	22.8%
TerminalBench Hard	—	6.8%