Xylok

Explore AI Models

Browse our extensive collection of state-of-the-art AI models. Compare capabilities, pricing, and find the perfect fit for your needs.

Google

9 Models
Gemini 3.1 Flash Lite Preview
google/gemini-3.1-flash-lite-preview

Gemini 3.1 Flash Lite Preview is Google’s efficient high-volume model optimized for speed and cost. It delivers strong performance in translation, data extraction, and code completion while supporting adjustable reasoning levels for flexible workloads.

ReasoningTools
Context Window
Gemini 3.1 Pro Preview
google/gemini-3.1-pro-preview

Frontier reasoning model for advanced agentic systems and complex coding. Delivers stronger engineering performance, better autonomous execution, improved token efficiency, and long-context stability, with a 1M-token context and full multimodal support (text, image, video, audio, code).

ReasoningTools
Context Window
Gemini 3 Flash Preview
google/gemini-3-flash-preview

High-speed, high-value thinking model designed for agentic workflows and coding. Delivers near-Pro level reasoning with low latency, supporting a 1M token context and multimodal inputs (audio, video, images).

ReasoningTools
Context Window
Gemini 3 Pro Preview
google/gemini-3-pro-preview

Flagship frontier model for high-precision multimodal reasoning. Combines strong performance across text, code, and media with a 1M-token context window. Excels at complex agentic tasks.

ReasoningTools
Context Window
Gemini 2.5 Flash Lite
google/gemini-2.5-flash-lite

Lightweight reasoning model optimized for ultra-low latency and cost efficiency. Offers improved throughput and speed for common tasks compared to earlier Flash models.

ReasoningTools
Context Window
Gemini 2.5 Flash
google/gemini-2.5-flash

Workhorse model designed for advanced reasoning, coding, and scientific tasks. Includes built-in thinking capabilities for accuracy and nuanced context handling.

ReasoningTools
Context Window
Gemini 2.5 Pro
google/gemini-2.5-pro

State-of-the-art model for advanced reasoning, coding, and math. Achieves top-tier performance on benchmarks with enhanced problem-solving abilities.

ReasoningTools
Context Window
Gemini 2.0 Flash Lite
google/gemini-2.0-flash-lite-001

Economical model offering significantly faster time-to-first-token while maintaining quality comparable to larger models like Gemini Pro 1.5.

Tools
Context Window
Gemini 2.0 Flash
google/gemini-2.0-flash-001

Fast and efficient model with enhanced multimodal understanding and coding capabilities. Delivers robust agentic experiences with improved instruction following.

Tools
Context Window

OpenAI

21 Models
GPT-5.4 Pro
openai/gpt-5.4-pro

GPT-5.4 Pro is OpenAI’s most advanced reasoning model, optimized for complex and high-stakes tasks. With a massive long-context window and strong instruction following, it excels at agentic coding, deep analysis, and multi-step problem solving.

ReasoningToolsWeb Search
Context Window
GPT-5.4
openai/gpt-5.4

GPT-5.4 is OpenAI’s latest frontier model, unifying GPT and Codex into a single system. Built for strong reasoning, coding, and long-context workflows, it performs reliably across general tasks, document analysis, and software engineering.

ReasoningToolsWeb Search
Context Window
GPT-5.2-Codex
openai/gpt-5.2-codex

Optimized for software engineering and long, complex coding tasks. Supports project building, debugging, and refactoring with dynamic reasoning effort.

ReasoningToolsWeb Search
Context Window
GPT-5.2 Chat
openai/gpt-5.2-chat

Fast, lightweight model optimized for low-latency chat. Uses adaptive reasoning for harder queries while maintaining a warm, conversational style.

ToolsWeb Search
Context Window
GPT-5.2 Pro
openai/gpt-5.2-pro

Advanced model with major improvements in agentic coding and long-context performance. Optimized for complex tasks requiring step-by-step reasoning and high accuracy.

ReasoningToolsWeb Search
Context Window
GPT-5.2
openai/gpt-5.2

Frontier-grade model with strong agentic and long-context performance. Delivers consistent gains across math, coding, and science with adaptive reasoning.

ReasoningToolsWeb Search
Context Window
GPT-5.1
openai/gpt-5.1

Strong general-purpose reasoning model with improved instruction adherence. Features adaptive reasoning and a natural conversational style.

ReasoningToolsWeb Search
Context Window
GPT-5.1 Chat
openai/gpt-5.1-chat

Lightweight, low-latency model for high-throughput chat. Balances speed with strong general intelligence and adaptive reasoning.

ToolsWeb Search
Context Window
GPT-5.1-Codex
openai/gpt-5.1-codex

Specialized for coding workflows. Supports interactive development and long execution tasks like refactoring and code review.

ReasoningTools
Context Window
GPT-5.1-Codex-Mini
openai/gpt-5.1-codex-mini

Smaller, faster version of GPT-5.1-Codex, optimized for efficiency in coding tasks.

ReasoningTools
Context Window
GPT-5 Chat
openai/gpt-5-chat

Designed for advanced, natural, and context-aware conversations. specialized for enterprise applications with multimodal support.

Web Search
Context Window
GPT-5
openai/gpt-5

Advanced model offering improvements in reasoning and code quality. Optimized for complex tasks with step-by-step reasoning.

ReasoningToolsWeb Search
Context Window
GPT-5 Mini
openai/gpt-5-mini

Compact version of GPT-5 for lighter-weight reasoning tasks. Provides instruction-following benefits with reduced latency and cost.

ReasoningToolsWeb Search
Context Window
GPT-5 Nano
openai/gpt-5-nano

Smallest and fastest GPT-5 variant. Optimized for developer tools and rapid interactions requiring ultra-low latency.

ReasoningToolsWeb Search
Context Window
gpt-oss-120b
openai/gpt-oss-120b

Open-weight 117B MoE model designed for high-reasoning and agentic use cases. Supports configurable reasoning depth and native tool use.

ReasoningTools
Context Window
gpt-oss-20b
openai/gpt-oss-20b

Open-weight 21B MoE model optimized for low-latency inference. Supports reasoning configuration and agentic capabilities like function calling.

ReasoningTools
Context Window
GPT-4.1
openai/gpt-4.1

Flagship model optimized for advanced instruction following and software engineering. Features a 1M token context and outperforms GPT-4o.

ToolsWeb Search
Context Window
GPT-4.1 Mini
openai/gpt-4.1-mini

Mid-sized model competitive with GPT-4o at lower cost. Strong coding and vision capabilities with a 1M token context.

ToolsWeb Search
Context Window
GPT-4.1 Nano
openai/gpt-4.1-nano

Fastest and cheapest GPT-4.1 model. Exceptional performance for its size, ideal for classification and autocompletion with 1M context.

ToolsWeb Search
Context Window
GPT-4o-mini
openai/gpt-4o-mini

Advanced small model. More affordable than previous frontier models while maintaining SOTA intelligence and multimodal capabilities.

Tools
Context Window
GPT-4o
openai/gpt-4o

Omni model supporting text and image inputs. Twice as fast and 50% cheaper than GPT-4 Turbo with improved multilingual and visual performance.

Tools
Context Window

Anthropic

7 Models
Claude Sonnet 4.6
anthropic/claude-sonnet-4.6

Most capable Sonnet model with frontier performance in coding and agents. Excels at complex codebase navigation and project management.

ReasoningToolsWeb Search
Context Window
Claude Opus 4.6
anthropic/claude-opus-4.6

Strongest model for coding and long-running professional tasks. Designed for agents operating across entire workflows with deep contextual understanding.

ReasoningToolsWeb Search
Context Window
Claude Opus 4.5
anthropic/claude-opus-4.5

Frontier reasoning model optimized for complex software engineering and agentic workflows. Features strong multimodal capabilities.

ReasoningToolsWeb Search
Context Window
Claude Haiku 4.5
anthropic/claude-haiku-4.5

Fastest and most efficient Claude model. Delivers near-frontier intelligence with extended thinking for controllable reasoning depth.

ReasoningToolsWeb Search
Context Window
Claude Sonnet 4.5
anthropic/claude-sonnet-4.5

Optimized for real-world agents and coding workflows. State-of-the-art coding performance designed for extended autonomous operation.

ReasoningToolsWeb Search
Context Window
Claude Sonnet 4
anthropic/claude-sonnet-4

Enhances Sonnet 3.7 capabilities. Excels in coding and reasoning with improved precision and controllability.

ReasoningToolsWeb Search
Context Window
Claude 3.7 Sonnet
anthropic/claude-3.7-sonnet

Advanced model with improved reasoning and coding. Introduces a hybrid reasoning approach for choosing between rapid responses and extended processing.

ReasoningToolsWeb Search
Context Window

DeepSeek

4 Models
DeepSeek V3.2
deepseek/deepseek-v3.2

High-efficiency model with strong reasoning and agentic tool-use. Uses sparse attention for long-context scenarios.

ReasoningTools
Context Window
DeepSeek V3.1
deepseek/deepseek-chat-v3.1

Hybrid reasoning model supporting thinking and non-thinking modes. Comparable to DeepSeek-R1 on difficult benchmarks but faster.

ReasoningTools
Context Window
R1 0528
deepseek/deepseek-r1-0528

High-performance open-weight reasoning model optimized for math, coding, and structured problem-solving. Delivers strong chain-of-thought reasoning, competitive benchmark results, and efficient inference, making it well-suited for advanced agents, technical analysis, and logic-heavy workflows.

ReasoningTools
Context Window
DeepSeek V3 0324
deepseek/deepseek-chat-v3-0324

685B Mixture-of-Experts model. Strong performance across a variety of tasks, serving as a predecessor to V3.1.

ReasoningTools
Context Window

Meta

3 Models
Llama 3.3 70B Instruct
meta-llama/llama-3.3-70b-instruct

70B instruction-tuned model optimized for multilingual dialogue. Outperforms many open and closed models on industry benchmarks.

Tools
Context Window
Llama 3.1 8B Instruct
meta-llama/llama-3.1-8b-instruct

Fast and efficient 8B instruct-tuned model. Strong performance for its size compared to leading closed-source models.

Tools
Context Window
Llama 3.1 70B Instruct
meta-llama/llama-3.1-70b-instruct

70B instruct-tuned model optimized for high-quality dialogue. Demonstrated strong performance in human evaluations.

Tools
Context Window

Mistral

3 Models
Mistral Small 3.2 24B
mistralai/mistral-small-3.2-24b-instruct

24B model optimized for instruction following and function calling. Significant improvements in coding and STEM benchmarks.

Tools
Context Window
Mistral Small 3
mistralai/mistral-small-24b-instruct-2501

Mistral Small 3. Efficient 24B model for low-latency performance. Competitive with larger models while being faster.

Tools
Context Window
Mistral Nemo
mistralai/mistral-nemo

12B parameter model with 128k context. Multilingual support and function calling capabilities.

Tools
Context Window

Qwen

5 Models
Qwen3.5-Flash
qwen/qwen3.5-flash-02-23

High-speed native vision-language model built with hybrid linear attention and sparse MoE architecture for efficient inference. Delivers major performance gains over the 3 series across text and multimodal tasks, balancing fast responses with strong overall capability.

ReasoningTools
Context Window
Qwen3 Coder 480B A35B
qwen/qwen3-coder

MoE code generation model optimized for agentic coding. Features function calling and long-context reasoning over repositories.

Tools
Context Window
Qwen3 235B A22B Instruct 2507
qwen/qwen3-235b-a22b-2507

Multilingual MoE model optimized for general text, math, and code. Delivers significant gains in long-context reasoning.

ReasoningTools
Context Window
Qwen3 32B
qwen/qwen3-32b

Dense 32B model optimized for reasoning and dialogue. Supports seamless switching between thinking and non-thinking modes.

ReasoningTools
Context Window
Qwen2.5 7B Instruct
qwen/qwen-2.5-7b-instruct

Improved 7B model with better coding and math capabilities. Supports long context up to 128k tokens and multilingual output.

Tools
Context Window

Perplexity

5 Models
Sonar Pro Search
perplexity/sonar-pro-search

Advanced agentic search system. Designed for deeper reasoning and analysis, planning and executing entire research workflows.

ReasoningWeb Search
Context Window
Sonar Reasoning Pro
perplexity/sonar-reasoning-pro

Premier reasoning model powered by DeepSeek R1. Supports in-depth, multi-step queries with larger context and more citations.

ReasoningWeb Search
Context Window
Sonar Pro
perplexity/sonar-pro

Enterprise-grade model for in-depth, multi-step queries. Handles longer and more nuanced searches with added extensibility.

Web Search
Context Window
Sonar Deep Research
perplexity/sonar-deep-research

Research-focused model for multi-step retrieval and synthesis. Autonomously searches and evaluates sources for comprehensive reports.

ReasoningWeb Search
Context Window
Sonar
perplexity/sonar

Lightweight, fast, and affordable model. Optimized for speed and simple question-and-answer features with citations.

Web Search
Context Window

xAi

5 Models
xAI: Grok 4.1 Fast
x-ai/grok-4.1-fast

Agentic tool calling model. Shines in real-world use cases like customer support and deep research with a 2M context window.

ReasoningToolsWeb Search
Context Window
xAI: Grok 4 Fast
x-ai/grok-4-fast

Multimodal model with SOTA cost-efficiency. Features a 2M token context window and comes in reasoning and non-reasoning flavors.

ReasoningToolsWeb Search
Context Window
xAI: Grok Code Fast 1
x-ai/grok-code-fast-1

Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding. With reasoning traces visible in the response, developers can steer Grok Code for high-quality work flows.

ReasoningToolsWeb Search
Context Window
xAI: Grok 4
x-ai/grok-4

Advanced multimodal reasoning model with a 256k token context window. Supports parallel tool calling, structured outputs, and both text and image inputs. Reasoning is always enabled and cannot be disabled or adjusted. Pricing increases for requests exceeding 128k total tokens, making context management important for cost control.

ReasoningToolsWeb Search
Context Window
xAI: Grok 3 Mini
x-ai/grok-3-mini

A lightweight model that thinks before responding. Fast, smart, and great for logic-based tasks that do not require deep domain knowledge. The raw thinking traces are accessible.

ReasoningToolsWeb Search
Context Window

MoonshotAI

3 Models
Kimi K2.5
moonshotai/kimi-k2.5

Native multimodal model with state-of-the-art visual coding capabilities. Delivers strong performance in general reasoning.

ReasoningTools
Context Window
Kimi K2 Thinking
moonshotai/kimi-k2-thinking

Advanced open reasoning model optimized for persistent step-by-step thought. Supports complex reasoning workflows spanning hundreds of turns.

ReasoningTools
Context Window
Kimi K2 0905
moonshotai/kimi-k2-0905

Large-scale MoE model with 256k context. Improved agentic coding and frontend coding with better generalization.

Tools
Context Window

Others

8 Models
Inception: Mercury 2
inception/mercury-2

Mercury 2 is an ultra-fast reasoning diffusion LLM designed for latency-sensitive workloads. By generating and refining tokens in parallel, it delivers extremely high throughput for coding agents, real-time search, and tool-driven workflows.

ReasoningTools
Context Window
Z.ai: GLM 5
z-ai/glm-5

Flagship open-source foundation model built for complex system design and long-horizon agent workflows. Delivers production-grade performance on large programming tasks, with advanced planning, deep backend reasoning, and iterative self-correction—enabling full system construction and autonomous execution.

ReasoningTools
Context Window
StepFun: Step 3.5 Flash (free)
stepfun/step-3.5-flash:free

Speed-efficient reasoning model built on sparse MoE architecture. Capable of handling long contexts effectively.

ReasoningTools
Context Window
Arcee AI: Trinity Large Preview (free)
arcee-ai/trinity-large-preview:free

Frontier-scale open-weight 400B MoE model (13B active per token) built for creative writing, role-play, chat, and real-time voice. Supports agentic workflows, complex toolchains, and long constraint-heavy prompts, with up to 512k context (128k in Preview). Designed for efficient, production-ready deployment with permissive licensing.

ToolsWeb Search
Context Window
Upstage: Solar Pro 3
upstage/solar-pro-3

Powerful MoE model delivering exceptional performance. Optimized for Korean, English, and Japanese with high efficiency.

ReasoningTools
Context Window
LiquidAI: LFM2.5-1.2B-Thinking (free)
liquid/lfm-2.5-1.2b-thinking:free

Lightweight reasoning-focused model optimized for agentic tasks and RAG. Runs comfortably on edge devices with 32K context.

Reasoning
Context Window
LiquidAI: LFM2.5-1.2B-Instruct (free)
liquid/lfm-2.5-1.2b-instruct:free

Compact, high-performance instruction-tuned model. Delivers strong chat quality and efficient edge inference.

Context Window
Z.ai: GLM 4.5 Air (free)
z-ai/glm-4.5-air:free

Lightweight agent-centric model. Supports hybrid inference modes (thinking vs non-thinking) for versatile interactions.

ReasoningTools
Context Window