gpt-oss-20b: Pricing, Context Window & Benchmarks
20Bby OpenAI
gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for lower-latency inference and deployability on consumer or single-GPU hardware. The model is trained in OpenAI’s Harmony response format and supports reasoning level configuration, fine-tuning, and agentic capabilities including function calling, tool use, and structured outputs.
What you can do with gpt-oss-20b
Everyday Q&A and clear explanations
Writing help (emails, posts, summaries)
Idea generation and brainstorming
Learning support with step-by-step guidance
Composite Indices
Intelligence, Coding, Math
Standard Benchmarks
Academic and industry benchmarks
Benchmark Highlights
| Metric | Value |
|---|---|
| Provider | OpenAI |
| Context Window | 131,072 tokens |
| Input Price | $0.07/1M tokens |
| Output Price | $0.23/1M tokens |
| Release Date | Aug 5, 2025 |
| Modalities | text |
| Capabilities | N/A |
Compare gpt-oss-20b to other models
See how it stacks up on price, quality, and overall performance.
Frequently asked questions
What is gpt-oss-20b good for?
Use gpt-oss-20b for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.
How much does gpt-oss-20b cost?
Pricing is based on usage. Current rates are $0.07/1M tokens for input and $0.23/1M tokens for output.
Can I try gpt-oss-20b for free?
Yes. You can start a chat instantly and test the model before deciding on a plan.
Does gpt-oss-20b support images or audio?
gpt-oss-20b focuses on text-based tasks.
Suggested comparisons
Similar models
Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.
Compare Models
Select a model to compare with gpt-oss-20b