StepFun logo

Step3

by StepFun

Step3 is a cutting-edge multimodal reasoning model—built on a Mixture-of-Experts architecture with 321B total parameters and 38B active. It is designed end-to-end to minimize decoding costs while delivering top-tier performance in vision–language reasoning. Through the co-design of Multi-Matrix Factorization Attention (MFA) and Attention-FFN Disaggregation (AFD), Step3 maintains exceptional efficiency across both flagship and low-end accelerators.

Chat with Step3

Capabilities

Vision

Pricing

Input Tokens
Per 1M tokens
Free
Output Tokens
Per 1M tokens
Free
Image Processing
Per 1M tokens
$0.00/1M tokens

Supported Modalities

Input

image
text

Output

text

Specifications

Context Length
66K tokens
Provider
StepFun
Released
Aug 28, 2025
Model ID
stepfun-ai/step3

Ready to try it?

Start chatting with Step3 right now. No credit card required.

Start Chatting

More from StepFun

View all models