AI Models

Browse 101+ models from 14 providers. All accessible through a single API with built-in security, observability, and failover.

101 models14 providers
Anthropic

Claude 3.5 Haiku

claude-3-5-haiku-20241022

chat
Anthropic200K

Fast and efficient

Anthropic

Claude 3.5 Sonnet

claude-3-5-sonnet-20241022

chat
Anthropic200K

Balance of speed and capability

Anthropic

Claude 3.7 Sonnet

claude-3-7-sonnet

reasoning
Anthropic200K

Hybrid reasoning model

Anthropic

Claude Haiku 4.5

claude-haiku-4.5

chat
Anthropic200K

Fastest Claude model

Anthropic

Claude Opus 4

claude-opus-4

chat
Anthropic200K

Most capable Claude model

Anthropic

Claude Opus 4.5

claude-opus-4.5

chat
Anthropic200K

Latest, most intelligent

Anthropic

Claude Opus 4.6

claude-opus-4.6

chat
Anthropic1M

Latest flagship, agentic coding record-breaker

Anthropic

Claude Sonnet 4

claude-sonnet-4

chat
Anthropic200K

Balanced speed & capability

Anthropic

Claude Sonnet 4.5

claude-sonnet-4.5

chat
Anthropic200K

Enhanced coding & agents

Anthropic

Claude Sonnet 4.6

claude-sonnet-4.6

chat
Anthropic200K

Latest flagship, enhanced reasoning & coding

Cohere

Command A

command-a-03-2025

chat
Cohere256K

Most performant, agentic tasks

Cohere

Command Light

command-light

chat
Cohere4K

Fast and efficient

Cohere

Command R

command-r

chat
Cohere128K

Balanced performance

Cohere

Command R+

command-r-plus-08-2024

chat
Cohere128K

Complex RAG and multi-step

DeepSeek

DeepSeek Coder V2

deepseek-coder-v2

code
DeepSeek128K

338 languages, GPT-4 level

DeepSeek

DeepSeek R1

deepseek-reasoner

reasoning
DeepSeek64K

Reasoning model

DeepSeek

DeepSeek V3

deepseek-chat

chat
DeepSeek128K

128K context, MIT license

DeepSeek

DeepSeek V3.1

deepseek-v3.1

chat
DeepSeek128K

Hybrid thinking modes

DeepSeek

DeepSeek V3.2

deepseek-v3.2

chat
DeepSeek128K

GPT-5 level, daily driver

DeepSeek

DeepSeek V3.2 Speciale

deepseek-v3.2-speciale

reasoning
DeepSeek128K

Maxed reasoning, competition gold

Google

Gemini 2.0 Flash

gemini-2.0-flash

chat
Google AI1M

Fast model

Google

Gemini 2.0 Flash Thinking

gemini-2.0-flash-thinking

reasoning
Google AI1M

Reasoning variant

Google

Gemini 2.5 Flash

gemini-2.5-flash

chat
Google AI1M

Thinking capabilities

Google

Gemini 2.5 Flash Lite

gemini-2.5-flash-lite

chat
Google AI1M

Speed optimized

Google

Gemini 2.5 Pro

gemini-2.5-pro

chat
Google AI1M

Enhanced reasoning & coding

Google

Gemini 3 Deep Think

gemini-3-deep-think

reasoning
Google AI1M

Deep iterative reasoning

Google

Gemini 3 Flash

gemini-3-flash

chat
Google AI1M

Frontier speed & intelligence

Google

Gemini 3 Pro

gemini-3-pro

chat
Google AI2M

Powerful Gemini model

Google

Gemini 3 Pro Image

gemini-3-pro-image

image
Google AI

Fast photorealism

Google

Gemini 3.1 Flash Image (Nano Banana 2)

gemini-3.1-flash-image

image
Google AI

Reasoning-guided image synthesis, up to 4K

Google

Gemini 3.1 Pro (Custom Tools)

gemini-3.1-pro-preview-customtools

chat
Google AI1M

Optimized for custom tools and bash

Google

Gemini 3.1 Pro Preview

gemini-3.1-pro-preview

chat
Google AI1M

Latest flagship preview, 1M context, enhanced reasoning

Google

Imagen 3

imagen-3

image
Google AI

High quality images

Groq

Llama 3.1 8B Instant

llama-3.1-8b-instant

chat
Groq128K

Ultra-fast inference

Groq

Llama 3.3 70B Versatile

llama-3.3-70b-versatile

chat
Groq128K

Groq-hosted versatile Llama 3.3 model

Groq

Llama 4 Maverick

llama-4-maverick

chat
Groq256K

Latest multimodal Llama

Groq

Llama 4 Scout

llama-4-scout

chat
Groq256K

Advanced Llama 4 model

Groq

Mixtral 8x7B

mixtral-8x7b-32768

chat
Groq33K

MoE architecture

HuggingFace

Llama 3.3 70B

meta-llama/Llama-3.3-70B-Instruct

chat
Hugging Face128K

Via HF Inference

HuggingFace

Llama 4 Maverick

meta-llama/Llama-4-Maverick

chat
Hugging Face256K

Via HF Inference

HuggingFace

Mistral Large 3

mistralai/Mistral-Large-3

chat
Hugging Face128K

Via HF Inference

HuggingFace

Qwen 2.5 72B

Qwen/Qwen2.5-72B-Instruct

chat
Hugging Face32K

Via HF Inference

Meta

Llama 3.1 405B

llama-3.1-405b

chat
Meta AI128K

Largest open model

Meta

Llama 3.1 70B

llama-3.1-70b

chat
Meta AI128K

Balanced performance

Meta

Llama 3.2 90B Vision

llama-3.2-90b-vision

chat
Meta AI128K

Multimodal understanding

Meta

Llama 3.3 70B

llama-3.3-70b

chat
Meta AI128K

Latest Llama 3 model

Meta

Llama 4 Maverick

llama-4-maverick

chat
Meta AI256K

Latest multimodal flagship

Meta

Llama 4 Scout

llama-4-scout

chat
Meta AI256K

Advanced reasoning

Mistral

Codestral 25.01

codestral-latest

code
Mistral AI256K

2.5x faster code generation

Mistral

Devstral 2

devstral-latest

code
Mistral AI256K

Frontier code agents

Mistral

Magistral Medium

magistral-medium

reasoning
Mistral AI128K

Multimodal reasoning

Mistral

Ministral 3B

ministral-3b

chat
Mistral AI128K

Compact edge model

Mistral

Ministral 8B

ministral-8b

chat
Mistral AI128K

Small efficient model

Mistral

Mistral Large 3

mistral-large-latest

chat
Mistral AI128K

675B params, best open-weight multimodal

Mistral

Mistral Medium 3.1

mistral-medium-latest

chat
Mistral AI128K

Frontier-class multimodal

Mistral

Mistral Small 3

mistral-small-latest

chat
Mistral AI32K

24B params, fast

OpenAI

DALL-E 2

dall-e-2

image
OpenAI

Fast image generation

OpenAI

DALL-E 3

dall-e-3

image
OpenAI

High quality images

OpenAI

GPT Image 1

gpt-image-1

image
OpenAI

ChatGPT image generation model

OpenAI

GPT Image 1.5

gpt-image-1.5

image
OpenAI

Best text rendering

OpenAI

GPT-4 Turbo

gpt-4-turbo

chat
OpenAI128K

Legacy GPT-4 model

OpenAI

GPT-4.1

gpt-4.1

code
OpenAI1.0M

Long-context GPT-4.1

OpenAI

GPT-4.1 Mini

gpt-4.1-mini

chat
OpenAI1.0M

Balanced GPT-4.1 model

OpenAI

GPT-4.1 Nano

gpt-4.1-nano

chat
OpenAI1.0M

Fast GPT-4.1 nano model

OpenAI

GPT-4o

gpt-4o

chat
OpenAI128K

Omni-modal model

OpenAI

GPT-4o Mini

gpt-4o-mini

chat
OpenAI128K

Fast and cost-effective

OpenAI

GPT-5

gpt-5

chat
OpenAI400K

Flagship model

OpenAI

GPT-5 Mini

gpt-5-mini

chat
OpenAI400K

Fast and efficient

OpenAI

GPT-5 Nano

gpt-5-nano

chat
OpenAI400K

Lowest-latency GPT-5 model

OpenAI

GPT-5 Pro

gpt-5-pro

chat
OpenAI400K

High-quality GPT-5 variant

OpenAI

GPT-5.1

gpt-5.1

chat
OpenAI400K

Improved GPT-5 generation

OpenAI

GPT-5.2

gpt-5.2

chat
OpenAI400K

Latest GPT-5.2 flagship

OpenAI

GPT-5.2 Pro

gpt-5.2-pro

chat
OpenAI400K

Most capable GPT-5.2 variant

OpenAI

o1

o1

reasoning
OpenAI200K

Legacy reasoning model

OpenAI

o3

o3

reasoning
OpenAI200K

Advanced reasoning model

OpenAI

o3 Mini

o3-mini

reasoning
OpenAI200K

Fast reasoning model

OpenAI

o3 Pro

o3-pro

reasoning
OpenAI200K

Most advanced reasoning model

OpenAI

o4 Mini

o4-mini

reasoning
OpenAI200K

Successor to o1-mini

OpenRouter

Claude Opus 4.5 (via OpenRouter)

anthropic/claude-opus-4.5

chat
OpenRouter200K

Unified billing

OpenRouter

Gemini 3 Pro (via OpenRouter)

google/gemini-3-pro

chat
OpenRouter2M

Meta-provider

OpenRouter

GPT-5 (via OpenRouter)

openai/gpt-5

chat
OpenRouter256K

Access any model

OpenRouter

Grok 4 (via OpenRouter)

x-ai/grok-4

chat
OpenRouter256K

Access xAI models

Perplexity

Sonar

sonar

search
Perplexity128K

Default web-connected

Perplexity

Sonar Large Online

llama-3.1-sonar-large-128k-online

search
Perplexity128K

Web-connected search

Perplexity

Sonar Pro

sonar-pro

search
Perplexity128K

Enhanced search, richer context

Perplexity

Sonar Reasoning Pro

sonar-reasoning-pro

reasoning
Perplexity128K

Deep inference & research

Qwen

Qwen 2.5 32B

qwen2.5-32b-instruct

chat
Qwen128K

Balanced performance

Qwen

Qwen 2.5 72B

qwen2.5-72b-instruct

chat
Qwen128K

Flagship model

Qwen

Qwen 2.5 Coder 32B

qwen2.5-coder-32b

code
Qwen128K

Code specialized

Qwen

QwQ 32B

qwq-32b-preview

reasoning
Qwen32K

Reasoning model

together.ai

DeepSeek V3.1

deepseek-ai/DeepSeek-V3.1

chat
Together AI128K

Hybrid reasoning

together.ai

Llama 3.3 70B Turbo

meta-llama/Llama-3.3-70B-Instruct-Turbo

chat
Together AI128K

Fast Llama inference

together.ai

Llama 4 Maverick

meta-llama/Llama-4-Maverick

chat
Together AI256K

Latest Llama

together.ai

Qwen 2.5 72B

Qwen/Qwen2.5-72B-Instruct-Turbo

chat
Together AI32K

Alibaba flagship

Grok

Grok 3

grok-3

chat
xAI128K

DeepSearch, Big Brain Mode

Grok

Grok 3 Mini

grok-3-mini

chat
xAI128K

Fast responses

Grok

Grok 4

grok-4

chat
xAI256K

Enhanced reasoning, real-time search

Grok

Grok 4 Heavy

grok-4-heavy

chat
xAI256K

Maximum capability

Grok

Grok 4.1

grok-4.1

chat
xAI256K

Improved multimodal & reasoning

Grok

Grok 4.1 Fast

grok-4.1-fast

chat
xAI2M

Best agentic tool calling

Grok

Grok Code Fast

grok-code-fast-1

code
xAI128K

Fast agentic coding