Qwen logo

Qwen3.5-Flash: Pricing, Context Window & Benchmarks

by Qwen

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.

Chat with Qwen3.5-Flash
Input Price
$0.10/1M tokens
Output Price
$0.40/1M tokens
Context Window
1,000,000 tokens
Modalities
text, image, video

What you can do with Qwen3.5-Flash

Everyday Q&A and clear explanations

Writing help (emails, posts, summaries)

Idea generation and brainstorming

Learning support with step-by-step guidance

Benchmarks not available

This model isn't listed on Artificial Analysis yet. Showing OpenRouter specs below.

Metric Value
Provider Qwen
Context Window 1,000,000 tokens
Input Price $0.10/1M tokens
Output Price $0.40/1M tokens
Release Date Feb 25, 2026
Modalities text, image, video
Capabilities Vision

Compare Qwen3.5-Flash to other models

See how it stacks up on price, quality, and overall performance.

Frequently asked questions

What is Qwen3.5-Flash good for?

Use Qwen3.5-Flash for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does Qwen3.5-Flash cost?

Pricing is based on usage. Current rates are $0.10/1M tokens for input and $0.40/1M tokens for output.

Can I try Qwen3.5-Flash for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does Qwen3.5-Flash support images or audio?

Qwen3.5-Flash can understand images.

Pricing, context, and capability data are sourced from OpenRouter.