InternVL3 78B: Pricing, Context Window & Benchmarks
78Bby OpenGVLab
The InternVL3 series is an advanced multimodal large language model (MLLM). Compared to InternVL 2.5, InternVL3 demonstrates stronger multimodal perception and reasoning capabilities. In addition, InternVL3 is benchmarked against the Qwen2.5 Chat models, whose pre-trained base models serve as the initialization for its language component. Benefiting from Native Multimodal Pre-Training, the InternVL3 series surpasses the Qwen2.5 series in overall text performance.
What you can do with InternVL3 78B
Everyday Q&A and clear explanations
Writing help (emails, posts, summaries)
Idea generation and brainstorming
Learning support with step-by-step guidance
Benchmarks not available
This model isn't listed on Artificial Analysis yet. Showing OpenRouter specs below.
| Metric | Value |
|---|---|
| Provider | OpenGVLab |
| Context Window | 32,768 tokens |
| Input Price | $0.15/1M tokens |
| Output Price | $0.60/1M tokens |
| Release Date | Sep 15, 2025 |
| Modalities | image, text |
| Capabilities | Vision |
Compare InternVL3 78B to other models
See how it stacks up on price, quality, and overall performance.
Frequently asked questions
What is InternVL3 78B good for?
Use InternVL3 78B for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.
How much does InternVL3 78B cost?
Pricing is based on usage. Current rates are $0.15/1M tokens for input and $0.60/1M tokens for output.
Can I try InternVL3 78B for free?
Yes. You can start a chat instantly and test the model before deciding on a plan.
Does InternVL3 78B support images or audio?
InternVL3 78B can understand images.
Similar models
Pricing, context, and capability data are sourced from OpenRouter.
Compare Models
Select a model to compare with InternVL3 78B