Molmo2 8B: Pricing, Context Window & Benchmarks
8Bby AllenAI
Molmo2-8B is an open vision-language model developed by the Allen Institute for AI (Ai2) as part of the Molmo2 family, supporting image, video, and multi-image understanding and grounding. It is based on Qwen3-8B and uses SigLIP 2 as its vision backbone, outperforming other open-weight, open-data models on short videos, counting, and captioning, while remaining competitive on long-video tasks.
What you can do with Molmo2 8B
Everyday Q&A and clear explanations
Writing help (emails, posts, summaries)
Idea generation and brainstorming
Learning support with step-by-step guidance
Composite Indices
Intelligence, Coding, Math
Standard Benchmarks
Academic and industry benchmarks
Benchmark Highlights
| Metric | Value |
|---|---|
| Provider | AllenAI |
| Context Window | 36,864 tokens |
| Input Price | $0.00/1M tokens |
| Output Price | $0.00/1M tokens |
| Release Date | Dec 11, 2025 |
| Modalities | text, image, video |
| Capabilities | Vision |
Compare Molmo2 8B to other models
See how it stacks up on price, quality, and overall performance.
Frequently asked questions
What is Molmo2 8B good for?
Use Molmo2 8B for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.
How much does Molmo2 8B cost?
Pricing is based on usage. Current rates are $0.00/1M tokens for input and $0.00/1M tokens for output.
Can I try Molmo2 8B for free?
Yes. You can start a chat instantly and test the model before deciding on a plan.
Does Molmo2 8B support images or audio?
Molmo2 8B can understand images.
Suggested comparisons
Similar models
Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.
Compare Models
Select a model to compare with Molmo2 8B