Qwen2.5 VL 32B Instruct: Pricing, Context Window & Benchmarks

Name: Qwen2.5 VL 32B Instruct
Brand: Qwen
Price: 0.2 USD

32B

by Qwen

Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement learning for enhanced mathematical reasoning, structured outputs, and visual problem-solving capabilities. It excels at visual analysis tasks, including object recognition, textual interpretation within images, and precise event localization in extended videos. Qwen2.5-VL-32B demonstrates state-of-the-art performance across multimodal benchmarks such as MMMU, MathVista, and VideoMME, while maintaining strong reasoning and clarity in text-based tasks like MMLU, mathematical problem-solving, and code generation.

Chat with Qwen2.5 VL 32B Instruct

Input Price

$0.20/1M tokens

Output Price

$0.20/1M tokens

Intelligence

12.9

Coding

N/A

What you can do with Qwen2.5 VL 32B Instruct

Everyday Q&A and clear explanations

Writing help (emails, posts, summaries)

Idea generation and brainstorming

Learning support with step-by-step guidance

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Benchmark Highlights

5 tests

GPQA

41.7%

MMLU Pro

63.5%

LiveCodeBench

29.5%

Math 500

76.7%

AIME 2025

N/A

HLE

3.8%

Metric	Value
Provider	Qwen
Context Window	128,000 tokens
Input Price	$0.20/1M tokens
Output Price	$0.20/1M tokens
Release Date	Nov 11, 2024
Modalities	text, image
Capabilities	Vision