NEWAll AI Models in one Chat App – Claude, Deepseek, GLM 4.7, Minimax

Qwen3 8B: Pricing, Context Window & Benchmarks

8B

by Qwen

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling.

Chat with Qwen3 8B

Input Price

$0.18/1M tokens

Output Price

$2.10/1M tokens

Intelligence

13.1

Coding

9.0

What you can do with Qwen3 8B

Everyday Q&A and clear explanations

Writing help (emails, posts, summaries)

Idea generation and brainstorming

Learning support with step-by-step guidance

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Benchmark Highlights

6 tests

GPQA

58.9%

MMLU Pro

74.3%

LiveCodeBench

40.6%

Math 500

90.4%

AIME 2025

19.0%

HLE

4.2%

Metric	Value
Provider	Qwen
Context Window	32,000 tokens
Input Price	$0.18/1M tokens
Output Price	$2.10/1M tokens
Release Date	Apr 28, 2025
Modalities	text
Capabilities	N/A

Compare Qwen3 8B to other models

See how it stacks up on price, quality, and overall performance.

Frequently asked questions

What is Qwen3 8B good for?

Use Qwen3 8B for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does Qwen3 8B cost?

Pricing is based on usage. Current rates are $0.18/1M tokens for input and $2.10/1M tokens for output.

Can I try Qwen3 8B for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does Qwen3 8B support images or audio?

Qwen3 8B focuses on text-based tasks.

Suggested comparisons

Based on benchmarks

Qwen3 8B (Reasoning) Current

Gemini 3.1 Pro Preview

Top alternative from another provider

Qwen3 8B (Reasoning) Current

Grok 4.1 Fast (Reasoning)

Similar pricing

Qwen3 8B (Reasoning) Current

Grok 4 Fast (Reasoning)

Similar pricing

Similar models

OpenRouter suggestions

Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.