Qwen AI Models

Alibaba Cloud's model team. Builds the Qwen series covering text, vision, code, and audio.

Founded2023Hangzhou, China12 ModelsWebsite →

Qwen3.5-Flash

Qwen

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency.

Context1.0M

Speed310 tok/s

InputText, Image, Video

OutputText

ReasoningYes

Details →

Qwen3.5 397B A17B

Qwen

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency.

Context262K

Speed52 tok/s

InputText, Image, Video

OutputText

ReasoningYes

Details →

Qwen3.5-35B-A3B

Qwen

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mixture-of-experts model, achieving higher inference efficiency.

Context262K

Speed144 tok/s

InputText, Image, Video

OutputText

ReasoningYes

Details →

Qwen3.5 Plus

Qwen

The Qwen3.5 native vision-language series Plus models are built on a hybrid architecture that integrates linear attention mechanisms with sparse mixture-of-experts models, achieving higher inference efficiency.

Context1.0M

Speed49 tok/s

InputText, Image, Video

OutputText

ReasoningYes

Details →

Qwen3.5-9B

Qwen

Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture.

Context256K

Speed177 tok/s

InputText, Image, Video

OutputText

ReasoningYes

Details →

Qwen3 Coder Next

Qwen

Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows.

Context262K

Speed147 tok/s

InputText

OutputText

ReasoningNo

Details →

Qwen3.5-27B

Qwen

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance.

Context262K

Speed283 tok/s

InputText, Image, Video

OutputText

ReasoningYes

Details →

Qwen3.5-122B-A10B

Qwen

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency.

Context262K

Speed145 tok/s

InputText, Image, Video

OutputText

ReasoningYes

Details →

Qwen3 VL 8B Instruct

Qwen

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video.

Context131K

Speed145 tok/s

InputImage, Text

OutputText

ReasoningNo

Details →

Qwen3 VL 32B Instruct

Qwen

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video.

Context131K

Speed107 tok/s

InputText, Image

OutputText

ReasoningNo

Details →

Qwen3 Max Thinking

Qwen

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning.

Context262K

Speed42 tok/s

InputText

OutputText

ReasoningYes

Details →

Qwen3 VL 8B Thinking

Qwen

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences.

Context131K

SpeedN/A

InputImage, Text

OutputText

ReasoningYes

Details →