Qwen3.5-Flash: Pricing, Context Window & Benchmarks
by Qwen
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.
What you can do with Qwen3.5-Flash
Everyday Q&A and clear explanations
Writing help (emails, posts, summaries)
Idea generation and brainstorming
Learning support with step-by-step guidance
Benchmarks not available
This model isn't listed on Artificial Analysis yet. Showing OpenRouter specs below.
| Metric | Value |
|---|---|
| Provider | Qwen |
| Context Window | 1,000,000 tokens |
| Input Price | $0.10/1M tokens |
| Output Price | $0.40/1M tokens |
| Release Date | Feb 25, 2026 |
| Modalities | text, image, video |
| Capabilities | Vision |
Compare Qwen3.5-Flash to other models
See how it stacks up on price, quality, and overall performance.
Frequently asked questions
What is Qwen3.5-Flash good for?
Use Qwen3.5-Flash for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.
How much does Qwen3.5-Flash cost?
Pricing is based on usage. Current rates are $0.10/1M tokens for input and $0.40/1M tokens for output.
Can I try Qwen3.5-Flash for free?
Yes. You can start a chat instantly and test the model before deciding on a plan.
Does Qwen3.5-Flash support images or audio?
Qwen3.5-Flash can understand images.
Similar models
Pricing, context, and capability data are sourced from OpenRouter.
Compare Models
Select a model to compare with Qwen3.5-Flash