Explore AI Models

Browse our extensive collection of state-of-the-art AI models. Compare capabilities, pricing, and find the perfect fit for your needs.

Google

9 Models

Gemini 3.1 Flash Lite Preview

google/gemini-3.1-flash-lite-preview

Gemini 3.1 Flash Lite Preview is Google’s efficient high-volume model optimized for speed and cost. It delivers strong performance in translation, data extraction, and code completion while supporting adjustable reasoning levels for flexible workloads.

ReasoningTools

Context Window

Gemini 3.1 Pro Preview

google/gemini-3.1-pro-preview

Frontier reasoning model for advanced agentic systems and complex coding. Delivers stronger engineering performance, better autonomous execution, improved token efficiency, and long-context stability, with a 1M-token context and full multimodal support (text, image, video, audio, code).

ReasoningTools

Context Window

Gemini 3 Flash Preview

google/gemini-3-flash-preview

High-speed, high-value thinking model designed for agentic workflows and coding. Delivers near-Pro level reasoning with low latency, supporting a 1M token context and multimodal inputs (audio, video, images).

ReasoningTools

Context Window

Gemini 3 Pro Preview

google/gemini-3-pro-preview

Flagship frontier model for high-precision multimodal reasoning. Combines strong performance across text, code, and media with a 1M-token context window. Excels at complex agentic tasks.

ReasoningTools

Context Window

Gemini 2.5 Flash Lite

google/gemini-2.5-flash-lite

Lightweight reasoning model optimized for ultra-low latency and cost efficiency. Offers improved throughput and speed for common tasks compared to earlier Flash models.

ReasoningTools

Context Window

Gemini 2.5 Flash

google/gemini-2.5-flash

Workhorse model designed for advanced reasoning, coding, and scientific tasks. Includes built-in thinking capabilities for accuracy and nuanced context handling.

ReasoningTools

Context Window

Gemini 2.5 Pro

google/gemini-2.5-pro

State-of-the-art model for advanced reasoning, coding, and math. Achieves top-tier performance on benchmarks with enhanced problem-solving abilities.

ReasoningTools

Context Window

Gemini 2.0 Flash Lite

google/gemini-2.0-flash-lite-001

Economical model offering significantly faster time-to-first-token while maintaining quality comparable to larger models like Gemini Pro 1.5.

Tools

Context Window

Gemini 2.0 Flash

google/gemini-2.0-flash-001

Fast and efficient model with enhanced multimodal understanding and coding capabilities. Delivers robust agentic experiences with improved instruction following.

Tools

Context Window

OpenAI

21 Models

GPT-5.4 Pro

openai/gpt-5.4-pro

GPT-5.4 Pro is OpenAI’s most advanced reasoning model, optimized for complex and high-stakes tasks. With a massive long-context window and strong instruction following, it excels at agentic coding, deep analysis, and multi-step problem solving.

ReasoningToolsWeb Search

Context Window

GPT-5.4

openai/gpt-5.4

GPT-5.4 is OpenAI’s latest frontier model, unifying GPT and Codex into a single system. Built for strong reasoning, coding, and long-context workflows, it performs reliably across general tasks, document analysis, and software engineering.

ReasoningToolsWeb Search

Context Window

GPT-5.2-Codex

openai/gpt-5.2-codex

Optimized for software engineering and long, complex coding tasks. Supports project building, debugging, and refactoring with dynamic reasoning effort.

ReasoningToolsWeb Search

Context Window

GPT-5.2 Chat

openai/gpt-5.2-chat

Fast, lightweight model optimized for low-latency chat. Uses adaptive reasoning for harder queries while maintaining a warm, conversational style.

ToolsWeb Search

Context Window

GPT-5.2 Pro

openai/gpt-5.2-pro

Advanced model with major improvements in agentic coding and long-context performance. Optimized for complex tasks requiring step-by-step reasoning and high accuracy.

ReasoningToolsWeb Search

Context Window

GPT-5.2

openai/gpt-5.2

Frontier-grade model with strong agentic and long-context performance. Delivers consistent gains across math, coding, and science with adaptive reasoning.

ReasoningToolsWeb Search

Context Window

GPT-5.1

openai/gpt-5.1

Strong general-purpose reasoning model with improved instruction adherence. Features adaptive reasoning and a natural conversational style.

ReasoningToolsWeb Search

Context Window

GPT-5.1 Chat

openai/gpt-5.1-chat

Lightweight, low-latency model for high-throughput chat. Balances speed with strong general intelligence and adaptive reasoning.

ToolsWeb Search

Context Window

GPT-5.1-Codex

openai/gpt-5.1-codex

Specialized for coding workflows. Supports interactive development and long execution tasks like refactoring and code review.

ReasoningTools

Context Window

GPT-5.1-Codex-Mini

openai/gpt-5.1-codex-mini

Smaller, faster version of GPT-5.1-Codex, optimized for efficiency in coding tasks.

ReasoningTools

Context Window

GPT-5 Chat

openai/gpt-5-chat

Designed for advanced, natural, and context-aware conversations. specialized for enterprise applications with multimodal support.

Web Search

Context Window

GPT-5

openai/gpt-5

Advanced model offering improvements in reasoning and code quality. Optimized for complex tasks with step-by-step reasoning.

ReasoningToolsWeb Search

Context Window

GPT-5 Mini

openai/gpt-5-mini

Compact version of GPT-5 for lighter-weight reasoning tasks. Provides instruction-following benefits with reduced latency and cost.

ReasoningToolsWeb Search

Context Window

GPT-5 Nano

openai/gpt-5-nano

Smallest and fastest GPT-5 variant. Optimized for developer tools and rapid interactions requiring ultra-low latency.

ReasoningToolsWeb Search

Context Window

gpt-oss-120b

openai/gpt-oss-120b

Open-weight 117B MoE model designed for high-reasoning and agentic use cases. Supports configurable reasoning depth and native tool use.

ReasoningTools

Context Window

gpt-oss-20b

openai/gpt-oss-20b

Open-weight 21B MoE model optimized for low-latency inference. Supports reasoning configuration and agentic capabilities like function calling.

ReasoningTools

Context Window

GPT-4.1

openai/gpt-4.1

Flagship model optimized for advanced instruction following and software engineering. Features a 1M token context and outperforms GPT-4o.

ToolsWeb Search

Context Window

GPT-4.1 Mini

openai/gpt-4.1-mini

Mid-sized model competitive with GPT-4o at lower cost. Strong coding and vision capabilities with a 1M token context.

ToolsWeb Search

Context Window

GPT-4.1 Nano

openai/gpt-4.1-nano

Fastest and cheapest GPT-4.1 model. Exceptional performance for its size, ideal for classification and autocompletion with 1M context.

ToolsWeb Search

Context Window

GPT-4o-mini

openai/gpt-4o-mini

Advanced small model. More affordable than previous frontier models while maintaining SOTA intelligence and multimodal capabilities.

Tools

Context Window

GPT-4o

openai/gpt-4o

Omni model supporting text and image inputs. Twice as fast and 50% cheaper than GPT-4 Turbo with improved multilingual and visual performance.

Tools

Context Window

Anthropic

7 Models

Claude Sonnet 4.6

anthropic/claude-sonnet-4.6

Most capable Sonnet model with frontier performance in coding and agents. Excels at complex codebase navigation and project management.

ReasoningToolsWeb Search

Context Window

Claude Opus 4.6

anthropic/claude-opus-4.6

Strongest model for coding and long-running professional tasks. Designed for agents operating across entire workflows with deep contextual understanding.

ReasoningToolsWeb Search

Context Window

Claude Opus 4.5

anthropic/claude-opus-4.5

Frontier reasoning model optimized for complex software engineering and agentic workflows. Features strong multimodal capabilities.

ReasoningToolsWeb Search

Context Window

Claude Haiku 4.5

anthropic/claude-haiku-4.5

Fastest and most efficient Claude model. Delivers near-frontier intelligence with extended thinking for controllable reasoning depth.

ReasoningToolsWeb Search

Context Window

Claude Sonnet 4.5

anthropic/claude-sonnet-4.5

Optimized for real-world agents and coding workflows. State-of-the-art coding performance designed for extended autonomous operation.

ReasoningToolsWeb Search

Context Window

Claude Sonnet 4

anthropic/claude-sonnet-4

Enhances Sonnet 3.7 capabilities. Excels in coding and reasoning with improved precision and controllability.

ReasoningToolsWeb Search

Context Window

Claude 3.7 Sonnet

anthropic/claude-3.7-sonnet

Advanced model with improved reasoning and coding. Introduces a hybrid reasoning approach for choosing between rapid responses and extended processing.

ReasoningToolsWeb Search

Context Window

DeepSeek

4 Models

DeepSeek V3.2

deepseek/deepseek-v3.2

High-efficiency model with strong reasoning and agentic tool-use. Uses sparse attention for long-context scenarios.

ReasoningTools

Context Window

DeepSeek V3.1

deepseek/deepseek-chat-v3.1

Hybrid reasoning model supporting thinking and non-thinking modes. Comparable to DeepSeek-R1 on difficult benchmarks but faster.

ReasoningTools

Context Window

R1 0528

deepseek/deepseek-r1-0528

High-performance open-weight reasoning model optimized for math, coding, and structured problem-solving. Delivers strong chain-of-thought reasoning, competitive benchmark results, and efficient inference, making it well-suited for advanced agents, technical analysis, and logic-heavy workflows.

ReasoningTools

Context Window

DeepSeek V3 0324

deepseek/deepseek-chat-v3-0324

685B Mixture-of-Experts model. Strong performance across a variety of tasks, serving as a predecessor to V3.1.

ReasoningTools

Context Window

Mistral

3 Models

Mistral Small 3.2 24B

mistralai/mistral-small-3.2-24b-instruct

24B model optimized for instruction following and function calling. Significant improvements in coding and STEM benchmarks.

Tools

Context Window

Mistral Small 3

mistralai/mistral-small-24b-instruct-2501

Mistral Small 3. Efficient 24B model for low-latency performance. Competitive with larger models while being faster.

Tools

Context Window

Mistral Nemo

mistralai/mistral-nemo

12B parameter model with 128k context. Multilingual support and function calling capabilities.

Tools

Context Window

Qwen

5 Models

Qwen3.5-Flash

qwen/qwen3.5-flash-02-23

High-speed native vision-language model built with hybrid linear attention and sparse MoE architecture for efficient inference. Delivers major performance gains over the 3 series across text and multimodal tasks, balancing fast responses with strong overall capability.

ReasoningTools

Context Window

Qwen3 Coder 480B A35B

qwen/qwen3-coder

MoE code generation model optimized for agentic coding. Features function calling and long-context reasoning over repositories.

Tools

Context Window

Qwen3 235B A22B Instruct 2507

qwen/qwen3-235b-a22b-2507

Multilingual MoE model optimized for general text, math, and code. Delivers significant gains in long-context reasoning.

ReasoningTools

Context Window

Qwen3 32B

qwen/qwen3-32b

Dense 32B model optimized for reasoning and dialogue. Supports seamless switching between thinking and non-thinking modes.

ReasoningTools

Context Window

Qwen2.5 7B Instruct

qwen/qwen-2.5-7b-instruct

Improved 7B model with better coding and math capabilities. Supports long context up to 128k tokens and multilingual output.

Tools

Context Window

Perplexity

5 Models

Sonar Pro Search

perplexity/sonar-pro-search

Advanced agentic search system. Designed for deeper reasoning and analysis, planning and executing entire research workflows.

ReasoningWeb Search

Context Window

Sonar Reasoning Pro

perplexity/sonar-reasoning-pro

Premier reasoning model powered by DeepSeek R1. Supports in-depth, multi-step queries with larger context and more citations.

ReasoningWeb Search

Context Window

Sonar Pro

perplexity/sonar-pro

Enterprise-grade model for in-depth, multi-step queries. Handles longer and more nuanced searches with added extensibility.

Web Search

Context Window

Sonar Deep Research

perplexity/sonar-deep-research

Research-focused model for multi-step retrieval and synthesis. Autonomously searches and evaluates sources for comprehensive reports.

ReasoningWeb Search

Context Window

Sonar

perplexity/sonar

Lightweight, fast, and affordable model. Optimized for speed and simple question-and-answer features with citations.

Web Search

Context Window

xAi

5 Models

xAI: Grok 4.1 Fast

x-ai/grok-4.1-fast

Agentic tool calling model. Shines in real-world use cases like customer support and deep research with a 2M context window.

ReasoningToolsWeb Search

Context Window

xAI: Grok 4 Fast

x-ai/grok-4-fast

Multimodal model with SOTA cost-efficiency. Features a 2M token context window and comes in reasoning and non-reasoning flavors.

ReasoningToolsWeb Search

Context Window

xAI: Grok Code Fast 1

x-ai/grok-code-fast-1

Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding. With reasoning traces visible in the response, developers can steer Grok Code for high-quality work flows.

ReasoningToolsWeb Search

Context Window

xAI: Grok 4

x-ai/grok-4

Advanced multimodal reasoning model with a 256k token context window. Supports parallel tool calling, structured outputs, and both text and image inputs. Reasoning is always enabled and cannot be disabled or adjusted. Pricing increases for requests exceeding 128k total tokens, making context management important for cost control.

ReasoningToolsWeb Search

Context Window

xAI: Grok 3 Mini

x-ai/grok-3-mini

A lightweight model that thinks before responding. Fast, smart, and great for logic-based tasks that do not require deep domain knowledge. The raw thinking traces are accessible.

ReasoningToolsWeb Search

Context Window

MoonshotAI

3 Models

Kimi K2.5

moonshotai/kimi-k2.5

Native multimodal model with state-of-the-art visual coding capabilities. Delivers strong performance in general reasoning.

ReasoningTools

Context Window

Kimi K2 Thinking

moonshotai/kimi-k2-thinking

Advanced open reasoning model optimized for persistent step-by-step thought. Supports complex reasoning workflows spanning hundreds of turns.

ReasoningTools

Context Window

Kimi K2 0905

moonshotai/kimi-k2-0905

Large-scale MoE model with 256k context. Improved agentic coding and frontend coding with better generalization.

Tools

Context Window

Others

8 Models

Inception: Mercury 2

inception/mercury-2

Mercury 2 is an ultra-fast reasoning diffusion LLM designed for latency-sensitive workloads. By generating and refining tokens in parallel, it delivers extremely high throughput for coding agents, real-time search, and tool-driven workflows.

ReasoningTools

Context Window

Z.ai: GLM 5

z-ai/glm-5

Flagship open-source foundation model built for complex system design and long-horizon agent workflows. Delivers production-grade performance on large programming tasks, with advanced planning, deep backend reasoning, and iterative self-correction—enabling full system construction and autonomous execution.

ReasoningTools

Context Window

StepFun: Step 3.5 Flash (free)

stepfun/step-3.5-flash:free

Speed-efficient reasoning model built on sparse MoE architecture. Capable of handling long contexts effectively.

ReasoningTools

Context Window

Arcee AI: Trinity Large Preview (free)

arcee-ai/trinity-large-preview:free

Frontier-scale open-weight 400B MoE model (13B active per token) built for creative writing, role-play, chat, and real-time voice. Supports agentic workflows, complex toolchains, and long constraint-heavy prompts, with up to 512k context (128k in Preview). Designed for efficient, production-ready deployment with permissive licensing.

ToolsWeb Search

Context Window

Upstage: Solar Pro 3

upstage/solar-pro-3

Powerful MoE model delivering exceptional performance. Optimized for Korean, English, and Japanese with high efficiency.

ReasoningTools

Context Window

LiquidAI: LFM2.5-1.2B-Thinking (free)

liquid/lfm-2.5-1.2b-thinking:free

Lightweight reasoning-focused model optimized for agentic tasks and RAG. Runs comfortably on edge devices with 32K context.

Reasoning

Context Window

LiquidAI: LFM2.5-1.2B-Instruct (free)

liquid/lfm-2.5-1.2b-instruct:free

Compact, high-performance instruction-tuned model. Delivers strong chat quality and efficient edge inference.

Context Window

Z.ai: GLM 4.5 Air (free)

z-ai/glm-4.5-air:free

Lightweight agent-centric model. Supports hybrid inference modes (thinking vs non-thinking) for versatile interactions.

ReasoningTools

Context Window

Explore AI Models

Google

OpenAI

Anthropic

DeepSeek

Meta

Mistral

Qwen

Perplexity

xAi

MoonshotAI

Others