LLMs by Category

Top AI Models by Category

Compare the latest models across open source, proprietary, uncensored, coding, math, speed, and release freshness.

Most Used AI Models

Popular picks across OpenRouter.

Xiaomi logo

MiMo-V2-Pro

Xiaomi

MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1T total parameters and a 1M context length, deeply optimized for agentic scenarios.

Context 1.0M
Speed 123 tok/s
Input Text
Output Text
Reasoning Yes
MiniMax logo

MiniMax M2.7

MiniMax

MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productivity and continuous improvement.

Context 205K
Speed 40 tok/s
Input Text
Output Text
Reasoning Yes
Z.ai logo

GLM 5 Turbo

Z.ai

GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios.

Context 203K
Speed 94 tok/s
Input Text
Output Text
Reasoning Yes
xAI logo

Grok 4.1 Fast

xAI

Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases like customer support and deep research. 2M context window.

Context 2.0M
Speed 198 tok/s
Input Text, Image, File
Output Text
Reasoning Yes
MiniMax logo

MiniMax M2.5

MiniMax

MiniMax-M2.5 is a SOTA large language model designed for real-world productivity.

Context 197K
Speed 59 tok/s
Input Text
Output Text
Reasoning Yes
Xiaomi logo

MiMo-V2-Omni

Xiaomi

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture.

Context 262K
Speed N/A
Input Text, Audio, Image, Video
Output Text
Reasoning Yes
MoonshotAI logo

Kimi K2.5

MoonshotAI

Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual coding capability and a self-directed agent swarm paradigm.

Context 262K
Speed 34 tok/s
Input Text, Image
Output Text
Reasoning Yes
Z.ai logo

GLM 5

Z.ai

GLM-5 is Z.ai’s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows.

Context 80K
Speed 57 tok/s
Input Text
Output Text
Reasoning Yes
OpenAI logo

GPT-5.4

OpenAI

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system.

Context 1.1M
Speed 200 tok/s
Input Text, Image, File
Output Text
Reasoning Yes
Google logo

Gemini 3.1 Pro Preview

Google

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows.

Context 1.0M
Speed 120 tok/s
Input Audio, File, Image, Text, Video
Output Text
Reasoning Yes

Top Open Source AI Models

Community-driven, inspectable weights.

Z.ai logo

GLM 5

Z.ai

GLM-5 is Z.ai’s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows.

Context 80K
Speed 57 tok/s
Input Text
Output Text
Reasoning Yes
MiniMax logo

MiniMax M2.7

MiniMax

MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productivity and continuous improvement.

Context 205K
Speed 40 tok/s
Input Text
Output Text
Reasoning Yes
Xiaomi logo

MiMo-V2-Pro

Xiaomi

MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1T total parameters and a 1M context length, deeply optimized for agentic scenarios.

Context 1.0M
Speed 123 tok/s
Input Text
Output Text
Reasoning Yes
MoonshotAI logo

Kimi K2.5

MoonshotAI

Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual coding capability and a self-directed agent swarm paradigm.

Context 262K
Speed 34 tok/s
Input Text, Image
Output Text
Reasoning Yes
Z.ai logo

GLM 5 Turbo

Z.ai

GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios.

Context 203K
Speed 94 tok/s
Input Text
Output Text
Reasoning Yes
Qwen logo

Qwen3.5 397B A17B

Qwen

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency.

Context 262K
Speed 56 tok/s
Input Text, Image, Video
Output Text
Reasoning Yes
Xiaomi logo

MiMo-V2-Omni

Xiaomi

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture.

Context 262K
Speed N/A
Input Text, Audio, Image, Video
Output Text
Reasoning Yes
Z.ai logo

GLM 5V Turbo

NEW

Z.ai

GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks.

Context 203K
Speed N/A
Input Image, Text, Video
Output Text
Reasoning Yes
Qwen logo

Qwen3.5-27B

Qwen

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance.

Context 262K
Speed 354 tok/s
Input Text, Image, Video
Output Text
Reasoning Yes
Z.ai logo

GLM 4 32B

Z.ai

GLM 4 32B is a cost-effective foundation language model.

Context 128K
Speed 75 tok/s
Input Text
Output Text
Reasoning No

Top Proprietary AI Models

Frontier closed models.

OpenAI logo

GPT-5.4

OpenAI

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system.

Context 1.1M
Speed 200 tok/s
Input Text, Image, File
Output Text
Reasoning Yes
Google logo

Gemini 3.1 Pro Preview

Google

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows.

Context 1.0M
Speed 120 tok/s
Input Audio, File, Image, Text, Video
Output Text
Reasoning Yes
OpenAI logo

GPT-5.3-Codex

OpenAI

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2.

Context 400K
Speed 164 tok/s
Input Text, Image, File
Output Text
Reasoning Yes
xAI logo

Grok 4.20

NEW

xAI

Grok 4.20 is xAI's newest flagship model with industry-leading speed and agentic tool calling capabilities.

Context 2.0M
Speed 248 tok/s
Input Text, Image
Output Text
Reasoning Yes
OpenAI logo

GPT-5.4 Mini

OpenAI

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads.

Context 400K
Speed 182 tok/s
Input File, Image, Text
Output Text
Reasoning Yes
OpenAI logo

GPT-5.1-Codex-Max

OpenAI

GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, high-context software development tasks.

Context 400K
Speed 97 tok/s
Input Text, Image
Output Text
Reasoning Yes
Google logo

Gemini 3.1 Flash Lite Preview

Google

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases.

Context 1.0M
Speed 312 tok/s
Input Text, Image, Video, File, Audio
Output Text
Reasoning Yes
OpenAI logo

GPT-5.1-Codex-Mini

OpenAI

GPT-5.1-Codex-Mini is a smaller and faster version of GPT-5.1-Codex

Context 400K
Speed 191 tok/s
Input Image, Text
Output Text
Reasoning Yes
xAI logo

Grok 4.1 Fast

xAI

Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases like customer support and deep research. 2M context window.

Context 2.0M
Speed 198 tok/s
Input Text, Image, File
Output Text
Reasoning Yes
OpenAI logo

GPT-5 Image

OpenAI

It offers major improvements in reasoning, code quality, and user experience while incorporating GPT Image 1's superior instruction following, text rendering, and detailed image editing.

Context 400K
Speed 127 tok/s
Input Image, Text, File
Output Image, Text
Reasoning Yes

Top Coding AI Models

Models tuned for code and developer workflows.

OpenAI logo

GPT-5.4

OpenAI

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system.

Context 1.1M
Speed 200 tok/s
Input Text, Image, File
Output Text
Reasoning Yes
Google logo

Gemini 3.1 Pro Preview

Google

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows.

Context 1.0M
Speed 120 tok/s
Input Audio, File, Image, Text, Video
Output Text
Reasoning Yes
OpenAI logo

GPT-5.3-Codex

OpenAI

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2.

Context 400K
Speed 164 tok/s
Input Text, Image, File
Output Text
Reasoning Yes
OpenAI logo

GPT-5.4 Mini

OpenAI

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads.

Context 400K
Speed 182 tok/s
Input File, Image, Text
Output Text
Reasoning Yes
OpenAI logo

GPT-5.1-Codex-Max

OpenAI

GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, high-context software development tasks.

Context 400K
Speed 97 tok/s
Input Text, Image
Output Text
Reasoning Yes
Z.ai logo

GLM 5

Z.ai

GLM-5 is Z.ai’s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows.

Context 80K
Speed 57 tok/s
Input Text
Output Text
Reasoning Yes
Google logo

Gemini 3.1 Flash Lite Preview

Google

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases.

Context 1.0M
Speed 312 tok/s
Input Text, Image, Video, File, Audio
Output Text
Reasoning Yes
xAI logo

Grok 4.20

NEW

xAI

Grok 4.20 is xAI's newest flagship model with industry-leading speed and agentic tool calling capabilities.

Context 2.0M
Speed 248 tok/s
Input Text, Image
Output Text
Reasoning Yes
MiniMax logo

MiniMax M2.7

MiniMax

MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productivity and continuous improvement.

Context 205K
Speed 40 tok/s
Input Text
Output Text
Reasoning Yes
Xiaomi logo

MiMo-V2-Pro

Xiaomi

MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1T total parameters and a 1M context length, deeply optimized for agentic scenarios.

Context 1.0M
Speed 123 tok/s
Input Text
Output Text
Reasoning Yes

Top OCR AI Models

Models specialised in optical character recognition and document extraction.

PaddlePaddle logo

PaddleOCR-VL-0.9B

PaddlePaddle

Baidu's 0.9B vision-language OCR model combining a NaViT-style dynamic-resolution encoder with ERNIE-4.5-0.3B. Handles multilingual text, tables, charts, and formulas across 16K context — optimized for efficient on-device document parsing.

Context 16K
Speed N/A
Input Text, Image
Output Text
Reasoning No
AllenAI logo

olmOCR-2-7B

AllenAI

Allen AI's 7B OCR model fine-tuned from Qwen2.5-VL-7B on curated academic papers and technical documentation. Supports 128K context and extracts structured text from PDFs and scanned documents with high fidelity.

Context 128K
Speed N/A
Input Text, Image
Output Text
Reasoning No
DeepSeek logo

DeepSeek-OCR

DeepSeek

DeepSeek's ~3B MoE OCR model using optical context compression to encode full pages into compact token sequences. Outputs structured Markdown preserving text layout, tables, and mathematical formulas from images and PDFs.

Context N/A
Speed N/A
Input Text, Image
Output Text
Reasoning No
Mistral AI logo

Mistral OCR

Mistral AI

Mistral's dedicated document understanding model (December 2025). Processes PDFs and images page-by-page via API, returning structured Markdown with preserved tables, equations, image bounding boxes, and rich layout metadata.

Context N/A
Speed N/A
Input Image, Pdf
Output Text
Reasoning No

Top Math AI Models

Math and reasoning specialists.

OpenAI logo

GPT-5.4

OpenAI

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system.

Context 1.1M
Speed 200 tok/s
Input Text, Image, File
Output Text
Reasoning Yes
OpenAI logo

GPT-5.3-Codex

OpenAI

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2.

Context 400K
Speed 164 tok/s
Input Text, Image, File
Output Text
Reasoning Yes
Google logo

Gemini 3.1 Flash Lite Preview

Google

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases.

Context 1.0M
Speed 312 tok/s
Input Text, Image, Video, File, Audio
Output Text
Reasoning Yes
DeepSeek logo

DeepSeek V3.2 Speciale

DeepSeek

DeepSeek-V3.2-Speciale is a high-compute variant of DeepSeek-V3.2 optimized for maximum reasoning and agentic performance.

Context 164K
Speed 46 tok/s
Input Text
Output Text
Reasoning Yes
Xiaomi logo

MiMo-V2-Flash

Xiaomi

MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi.

Context 262K
Speed 123 tok/s
Input Text
Output Text
Reasoning Yes
OpenAI logo

GPT-5.1-Codex-Mini

OpenAI

GPT-5.1-Codex-Mini is a smaller and faster version of GPT-5.1-Codex

Context 400K
Speed 191 tok/s
Input Image, Text
Output Text
Reasoning Yes
Google logo

Gemini 3.1 Pro Preview

Google

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows.

Context 1.0M
Speed 120 tok/s
Input Audio, File, Image, Text, Video
Output Text
Reasoning Yes
Z.ai logo

GLM 4 32B

Z.ai

GLM 4 32B is a cost-effective foundation language model.

Context 128K
Speed 75 tok/s
Input Text
Output Text
Reasoning No
MoonshotAI logo

Kimi K2 Thinking

MoonshotAI

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning.

Context 131K
Speed 100 tok/s
Input Text
Output Text
Reasoning Yes
OpenAI logo

GPT-5.1-Codex-Max

OpenAI

GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, high-context software development tasks.

Context 400K
Speed 97 tok/s
Input Text, Image
Output Text
Reasoning Yes

Fast AI Models

Lowest cost + latency options.

Inception logo

Mercury 2

Inception

Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM).

Context 128K
Speed 753 tok/s
Input Text
Output Text
Reasoning Yes
Qwen logo

Qwen3.5-9B

Qwen

Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture.

Context 256K
Speed 417 tok/s
Input Text, Image, Video
Output Text
Reasoning Yes
Qwen logo

Qwen3.5-27B

Qwen

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance.

Context 262K
Speed 354 tok/s
Input Text, Image, Video
Output Text
Reasoning Yes
Google logo

Gemini 3.1 Flash Lite Preview

Google

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases.

Context 1.0M
Speed 312 tok/s
Input Text, Image, Video, File, Audio
Output Text
Reasoning Yes
Amazon logo

Nova 2 Lite

Amazon

Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process text, images, and videos to generate text.

Context 1.0M
Speed 273 tok/s
Input Text, Image, Video, File
Output Text
Reasoning Yes
OpenAI logo

gpt-oss-safeguard-20b

OpenAI

gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b.

Context 131K
Speed 266 tok/s
Input Text
Output Text
Reasoning Yes
Mistral logo

Ministral 3 3B 2512

Mistral

The smallest model in the Ministral 3 family, Ministral 3 3B is a powerful, efficient tiny language model with vision capabilities.

Context 131K
Speed 263 tok/s
Input Text, Image
Output Text
Reasoning No
xAI logo

Grok 4.20

NEW

xAI

Grok 4.20 is xAI's newest flagship model with industry-leading speed and agentic tool calling capabilities.

Context 2.0M
Speed 248 tok/s
Input Text, Image
Output Text
Reasoning Yes
Mistral logo

Mistral Small 4

Mistral

Mistral Small 4 is the next major release in the Mistral Small family, unifying the capabilities of several flagship Mistral models into a single system.

Context 262K
Speed 204 tok/s
Input Text, Image
Output Text
Reasoning Yes
OpenAI logo

GPT-5.4

OpenAI

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system.

Context 1.1M
Speed 200 tok/s
Input Text, Image, File
Output Text
Reasoning Yes

Top Image Generation AI Models

Models that generate images from text prompts.

Google logo

Nano Banana 2 (Gemini 3.1 Flash Image Preview)

Google

Gemini 3.1 Flash Image Preview, a.k.a.

Context 66K
Speed N/A
Input Image, Text
Output Image, Text
Reasoning Yes
ByteDance Seed logo

Seedream 4.5

ByteDance Seed

Seedream 4.5 is the latest in-house image generation model developed by ByteDance.

Context 4K
Speed N/A
Input Image, Text
Output Image
Reasoning No
OpenAI logo

GPT-5 Image

OpenAI

It offers major improvements in reasoning, code quality, and user experience while incorporating GPT Image 1's superior instruction following, text rendering, and detailed image editing.

Context 400K
Speed 127 tok/s
Input Image, Text, File
Output Image, Text
Reasoning Yes
Sourceful logo

Riverflow V2 Pro

Sourceful

Riverflow V2 Pro is the most powerful variant of Sourceful's Riverflow 2.0 lineup, best for top-tier control and perfect text rendering.

Context 8K
Speed N/A
Input Text, Image
Output Image
Reasoning No
Sourceful logo

Riverflow V2 Fast

Sourceful

Riverflow V2 Fast is the fastest variant of Sourceful's Riverflow 2.0 lineup, best for production deployments and latency-critical workflows.

Context 8K
Speed N/A
Input Text, Image
Output Image
Reasoning No
Sourceful logo

Riverflow V2 Max Preview

Sourceful

Riverflow V2 Max Preview is the most powerful variant of Sourceful's Riverflow V2 preview lineup.

Context 8K
Speed N/A
Input Text, Image
Output Image
Reasoning No
Sourceful logo

Riverflow V2 Standard Preview

Sourceful

Riverflow V2 Standard Preview is the standard variant of Sourceful's Riverflow V2 preview lineup.

Context 8K
Speed N/A
Input Text, Image
Output Image
Reasoning No
Black Forest Labs logo

FLUX.2 Klein 4B

Black Forest Labs

FLUX.2 [klein] 4B is the fastest and most cost-effective model in the FLUX.2 family, optimized for high-throughput use cases while maintaining excellent image quality.

Context 41K
Speed N/A
Input Text, Image
Output Image
Reasoning No
Black Forest Labs logo

FLUX.2 Max

Black Forest Labs

FLUX.2 [max] is the new top-tier image model from Black Forest Labs, pushing image quality, prompt understanding, and editing consistency to the highest level yet.

Context 47K
Speed N/A
Input Text, Image
Output Image
Reasoning No
Black Forest Labs logo

FLUX.2 Flex

Black Forest Labs

FLUX.2 [flex] excels at rendering complex text, typography, and fine details, and supports multi-reference editing in the same unified architecture.

Context 67K
Speed N/A
Input Text, Image
Output Image
Reasoning No

Top Audio AI Models

Models with voice and audio output capabilities.

OpenAI logo

GPT Audio Mini

OpenAI

A cost-efficient version of GPT Audio.

Context 128K
Speed 102 tok/s
Input Text, Audio
Output Text, Audio
Reasoning No
OpenAI logo

GPT Audio

OpenAI

The gpt-audio model is OpenAI's first generally available audio model.

Context 128K
Speed N/A
Input Text, Audio
Output Text, Audio
Reasoning No

Large Context Window AI Models

Models with 200K+ context windows.

xAI logo

Grok 4.1 Fast

xAI

Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases like customer support and deep research. 2M context window.

Context 2.0M
Speed 198 tok/s
Input Text, Image, File
Output Text
Reasoning Yes
xAI logo

Grok 4.20

NEW

xAI

Grok 4.20 is xAI's newest flagship model with industry-leading speed and agentic tool calling capabilities.

Context 2.0M
Speed 248 tok/s
Input Text, Image
Output Text
Reasoning Yes
xAI logo

Grok 4.20 Multi-Agent

NEW

xAI

Grok 4.20 Multi-Agent is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows.

Context 2.0M
Speed N/A
Input Text, Image, File
Output Text
Reasoning Yes
OpenAI logo

GPT-5.4

OpenAI

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system.

Context 1.1M
Speed 200 tok/s
Input Text, Image, File
Output Text
Reasoning Yes
OpenAI logo

GPT-5.4 Pro

OpenAI

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks.

Context 1.1M
Speed N/A
Input Text, Image, File
Output Text
Reasoning Yes
Xiaomi logo

MiMo-V2-Pro

Xiaomi

MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1T total parameters and a 1M context length, deeply optimized for agentic scenarios.

Context 1.0M
Speed 123 tok/s
Input Text
Output Text
Reasoning Yes
Google logo

Gemini 3.1 Pro Preview

Google

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows.

Context 1.0M
Speed 120 tok/s
Input Audio, File, Image, Text, Video
Output Text
Reasoning Yes
Google logo

Gemini 3.1 Flash Lite Preview

Google

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases.

Context 1.0M
Speed 312 tok/s
Input Text, Image, Video, File, Audio
Output Text
Reasoning Yes
Google logo

Gemini 3.1 Pro Preview Custom Tools

Google

Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool selection behavior by preventing overuse of a general bash tool when more efficient third-party or user-defined functions are available.

Context 1.0M
Speed N/A
Input Text, Audio, Image, Video, File
Output Text
Reasoning Yes
Writer logo

Palmyra X5

Writer

Palmyra X5 is Writer's most advanced model, purpose-built for building and scaling AI agents across the enterprise.

Context 1.0M
Speed N/A
Input Text
Output Text
Reasoning No

Top Uncensored AI Models

Lightly filtered, high-flexibility models.

Newest AI Models

Fresh releases from OpenRouter.

Google logo

Gemma 4 31B

NEW

Google

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output.

Context 262K
Speed 47 tok/s
Input Image, Text, Video
Output Text
Reasoning Yes
Z.ai logo

GLM 5V Turbo

NEW

Z.ai

GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks.

Context 203K
Speed N/A
Input Image, Text, Video
Output Text
Reasoning Yes
Arcee AI logo

Trinity Large Thinking

NEW

Arcee AI

Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI.

Context 262K
Speed N/A
Input Text
Output Text
Reasoning Yes
xAI logo

Grok 4.20 Multi-Agent

NEW

xAI

Grok 4.20 Multi-Agent is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows.

Context 2.0M
Speed N/A
Input Text, Image, File
Output Text
Reasoning Yes
xAI logo

Grok 4.20

NEW

xAI

Grok 4.20 is xAI's newest flagship model with industry-leading speed and agentic tool calling capabilities.

Context 2.0M
Speed 248 tok/s
Input Text, Image
Output Text
Reasoning Yes
Kwaipilot logo

KAT-Coder-Pro V2

NEW

Kwaipilot

KAT-Coder-Pro V2 is the latest high-performance model in KwaiKAT’s KAT-Coder series, designed for complex enterprise-grade software engineering and SaaS integration.

Context 256K
Speed N/A
Input Text
Output Text
Reasoning No
rekaai logo

Reka Edge

NEW

rekaai

Reka Edge is an extremely efficient 7B multimodal vision-language model that accepts image/video+text inputs and generates text outputs.

Context 16K
Speed N/A
Input Image, Text, Video
Output Text
Reasoning Yes
Xiaomi logo

MiMo-V2-Omni

Xiaomi

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture.

Context 262K
Speed N/A
Input Text, Audio, Image, Video
Output Text
Reasoning Yes
Xiaomi logo

MiMo-V2-Pro

Xiaomi

MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1T total parameters and a 1M context length, deeply optimized for agentic scenarios.

Context 1.0M
Speed 123 tok/s
Input Text
Output Text
Reasoning Yes
MiniMax logo

MiniMax M2.7

MiniMax

MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productivity and continuous improvement.

Context 205K
Speed 40 tok/s
Input Text
Output Text
Reasoning Yes
EU Made in Europe

Chat with 100+ AI Models in one App.

Use Claude, ChatGPT, Gemini alongside with EU-Hosted Models like Deepseek, GLM-5, Kimi K2.5 and many more.

How to Choose the Right AI Model

A practical guide to picking the best LLM for your use case.

Match the model to the task

General-purpose models like GPT-4o and Claude Sonnet handle most tasks well. For specialized work, coding models (DeepSeek Coder, Codestral) and math models (QwQ, DeepSeek R1) outperform generalists on their respective benchmarks while often costing less per token.

Consider context window size

If you work with long documents, codebases, or multi-turn conversations, context window matters. Models range from 8K to over 1M tokens. Larger windows let you process entire books or repositories in a single prompt, but they increase cost and latency.

Balance cost, speed, and quality

Frontier models deliver the highest benchmark scores but cost more per token and respond slower. Fast models like Gemini Flash, Llama 3 (8B), and Mistral Small can handle routine tasks at a fraction of the cost with sub-second latency - ideal for high-volume applications.

Open source vs. proprietary

Open-source models (Llama, Mistral, Qwen, DeepSeek) let you self-host, fine-tune, and inspect weights. Proprietary models (GPT-4o, Claude, Gemini) often lead on benchmarks and offer managed APIs with built-in safety features. Many teams use both: proprietary for peak performance, open source for cost control and customization.

Check for multimodal capabilities

Some models accept images, audio, or files alongside text. If your workflow involves analyzing screenshots, diagrams, or audio transcriptions, filter for models with vision or audio input support. Models with structured output and function calling are essential for building agents and tool-using applications.

Use benchmarks as a starting point

Scores like GPQA, MMLU Pro, and HLE measure academic knowledge and reasoning. LiveCodeBench and SciCode test practical coding ability. MATH 500 and AIME evaluate mathematical problem-solving. No single benchmark tells the full story - compare scores across categories relevant to your use case, then test with your own prompts.

Data on this page is sourced from OpenRouter and Artificial Analysis. Pricing, speed, and benchmark scores are updated regularly. Try any model instantly using the free chat - no API key required.