Docs/AI SDK

AI

Models

Last updated March 3, 2026

Browse all supported AI models available through Cencori, including their providers, types, and context windows.

Overview

Cencori provides unified access to 100+ state-of-the-art AI models from leading providers like OpenAI, Anthropic, Google, Mistral, xAI, DeepSeek, and more.

You can use any of these models with the same API format, simply by changing the model parameter.

Model Catalog


101 models14 providers
Anthropic

Claude 3.5 Haiku

claude-3-5-haiku-20241022

chat
Anthropic200K

Fast and efficient

Anthropic

Claude 3.5 Sonnet

claude-3-5-sonnet-20241022

chat
Anthropic200K

Balance of speed and capability

Anthropic

Claude 3.7 Sonnet

claude-3-7-sonnet

reasoning
Anthropic200K

Hybrid reasoning model

Anthropic

Claude Haiku 4.5

claude-haiku-4.5

chat
Anthropic200K

Fastest Claude model

Anthropic

Claude Opus 4

claude-opus-4

chat
Anthropic200K

Most capable Claude model

Anthropic

Claude Opus 4.5

claude-opus-4.5

chat
Anthropic200K

Latest, most intelligent

Anthropic

Claude Opus 4.6

claude-opus-4.6

chat
Anthropic1M

Latest flagship, agentic coding record-breaker

Anthropic

Claude Sonnet 4

claude-sonnet-4

chat
Anthropic200K

Balanced speed & capability

Anthropic

Claude Sonnet 4.5

claude-sonnet-4.5

chat
Anthropic200K

Enhanced coding & agents

Anthropic

Claude Sonnet 4.6

claude-sonnet-4.6

chat
Anthropic200K

Latest flagship, enhanced reasoning & coding

Cohere

Command A

command-a-03-2025

chat
Cohere256K

Most performant, agentic tasks

Cohere

Command Light

command-light

chat
Cohere4K

Fast and efficient

Cohere

Command R

command-r

chat
Cohere128K

Balanced performance

Cohere

Command R+

command-r-plus-08-2024

chat
Cohere128K

Complex RAG and multi-step

DeepSeek

DeepSeek Coder V2

deepseek-coder-v2

code
DeepSeek128K

338 languages, GPT-4 level

DeepSeek

DeepSeek R1

deepseek-reasoner

reasoning
DeepSeek64K

Reasoning model

DeepSeek

DeepSeek V3

deepseek-chat

chat
DeepSeek128K

128K context, MIT license

DeepSeek

DeepSeek V3.1

deepseek-v3.1

chat
DeepSeek128K

Hybrid thinking modes

DeepSeek

DeepSeek V3.2

deepseek-v3.2

chat
DeepSeek128K

GPT-5 level, daily driver

DeepSeek

DeepSeek V3.2 Speciale

deepseek-v3.2-speciale

reasoning
DeepSeek128K

Maxed reasoning, competition gold

Google

Gemini 2.0 Flash

gemini-2.0-flash

chat
Google AI1M

Fast model

Google

Gemini 2.0 Flash Thinking

gemini-2.0-flash-thinking

reasoning
Google AI1M

Reasoning variant

Google

Gemini 2.5 Flash

gemini-2.5-flash

chat
Google AI1M

Thinking capabilities

Google

Gemini 2.5 Flash Lite

gemini-2.5-flash-lite

chat
Google AI1M

Speed optimized

Google

Gemini 2.5 Pro

gemini-2.5-pro

chat
Google AI1M

Enhanced reasoning & coding

Google

Gemini 3 Deep Think

gemini-3-deep-think

reasoning
Google AI1M

Deep iterative reasoning

Google

Gemini 3 Flash

gemini-3-flash

chat
Google AI1M

Frontier speed & intelligence

Google

Gemini 3 Pro

gemini-3-pro

chat
Google AI2M

Powerful Gemini model

Google

Gemini 3 Pro Image

gemini-3-pro-image

image
Google AI

Fast photorealism

Google

Gemini 3.1 Flash Image (Nano Banana 2)

gemini-3.1-flash-image

image
Google AI

Reasoning-guided image synthesis, up to 4K

Google

Gemini 3.1 Pro (Custom Tools)

gemini-3.1-pro-preview-customtools

chat
Google AI1M

Optimized for custom tools and bash

Google

Gemini 3.1 Pro Preview

gemini-3.1-pro-preview

chat
Google AI1M

Latest flagship preview, 1M context, enhanced reasoning

Google

Imagen 3

imagen-3

image
Google AI

High quality images

Groq

Llama 3.1 8B Instant

llama-3.1-8b-instant

chat
Groq128K

Ultra-fast inference

Groq

Llama 3.3 70B Versatile

llama-3.3-70b-versatile

chat
Groq128K

Groq-hosted versatile Llama 3.3 model

Groq

Llama 4 Maverick

llama-4-maverick

chat
Groq256K

Latest multimodal Llama

Groq

Llama 4 Scout

llama-4-scout

chat
Groq256K

Advanced Llama 4 model

Groq

Mixtral 8x7B

mixtral-8x7b-32768

chat
Groq33K

MoE architecture

HuggingFace

Llama 3.3 70B

meta-llama/Llama-3.3-70B-Instruct

chat
Hugging Face128K

Via HF Inference

HuggingFace

Llama 4 Maverick

meta-llama/Llama-4-Maverick

chat
Hugging Face256K

Via HF Inference

HuggingFace

Mistral Large 3

mistralai/Mistral-Large-3

chat
Hugging Face128K

Via HF Inference

HuggingFace

Qwen 2.5 72B

Qwen/Qwen2.5-72B-Instruct

chat
Hugging Face32K

Via HF Inference

Meta

Llama 3.1 405B

llama-3.1-405b

chat
Meta AI128K

Largest open model

Meta

Llama 3.1 70B

llama-3.1-70b

chat
Meta AI128K

Balanced performance

Meta

Llama 3.2 90B Vision

llama-3.2-90b-vision

chat
Meta AI128K

Multimodal understanding

Meta

Llama 3.3 70B

llama-3.3-70b

chat
Meta AI128K

Latest Llama 3 model

Meta

Llama 4 Maverick

llama-4-maverick

chat
Meta AI256K

Latest multimodal flagship

Meta

Llama 4 Scout

llama-4-scout

chat
Meta AI256K

Advanced reasoning

Mistral

Codestral 25.01

codestral-latest

code
Mistral AI256K

2.5x faster code generation

Mistral

Devstral 2

devstral-latest

code
Mistral AI256K

Frontier code agents

Mistral

Magistral Medium

magistral-medium

reasoning
Mistral AI128K

Multimodal reasoning

Mistral

Ministral 3B

ministral-3b

chat
Mistral AI128K

Compact edge model

Mistral

Ministral 8B

ministral-8b

chat
Mistral AI128K

Small efficient model

Mistral

Mistral Large 3

mistral-large-latest

chat
Mistral AI128K

675B params, best open-weight multimodal

Mistral

Mistral Medium 3.1

mistral-medium-latest

chat
Mistral AI128K

Frontier-class multimodal

Mistral

Mistral Small 3

mistral-small-latest

chat
Mistral AI32K

24B params, fast

OpenAI

DALL-E 2

dall-e-2

image
OpenAI

Fast image generation

OpenAI

DALL-E 3

dall-e-3

image
OpenAI

High quality images

OpenAI

GPT Image 1

gpt-image-1

image
OpenAI

ChatGPT image generation model

OpenAI

GPT Image 1.5

gpt-image-1.5

image
OpenAI

Best text rendering

OpenAI

GPT-4 Turbo

gpt-4-turbo

chat
OpenAI128K

Legacy GPT-4 model

OpenAI

GPT-4.1

gpt-4.1

code
OpenAI1.0M

Long-context GPT-4.1

OpenAI

GPT-4.1 Mini

gpt-4.1-mini

chat
OpenAI1.0M

Balanced GPT-4.1 model

OpenAI

GPT-4.1 Nano

gpt-4.1-nano

chat
OpenAI1.0M

Fast GPT-4.1 nano model

OpenAI

GPT-4o

gpt-4o

chat
OpenAI128K

Omni-modal model

OpenAI

GPT-4o Mini

gpt-4o-mini

chat
OpenAI128K

Fast and cost-effective

OpenAI

GPT-5

gpt-5

chat
OpenAI400K

Flagship model

OpenAI

GPT-5 Mini

gpt-5-mini

chat
OpenAI400K

Fast and efficient

OpenAI

GPT-5 Nano

gpt-5-nano

chat
OpenAI400K

Lowest-latency GPT-5 model

OpenAI

GPT-5 Pro

gpt-5-pro

chat
OpenAI400K

High-quality GPT-5 variant

OpenAI

GPT-5.1

gpt-5.1

chat
OpenAI400K

Improved GPT-5 generation

OpenAI

GPT-5.2

gpt-5.2

chat
OpenAI400K

Latest GPT-5.2 flagship

OpenAI

GPT-5.2 Pro

gpt-5.2-pro

chat
OpenAI400K

Most capable GPT-5.2 variant

OpenAI

o1

o1

reasoning
OpenAI200K

Legacy reasoning model

OpenAI

o3

o3

reasoning
OpenAI200K

Advanced reasoning model

OpenAI

o3 Mini

o3-mini

reasoning
OpenAI200K

Fast reasoning model

OpenAI

o3 Pro

o3-pro

reasoning
OpenAI200K

Most advanced reasoning model

OpenAI

o4 Mini

o4-mini

reasoning
OpenAI200K

Successor to o1-mini

OpenRouter

Claude Opus 4.5 (via OpenRouter)

anthropic/claude-opus-4.5

chat
OpenRouter200K

Unified billing

OpenRouter

Gemini 3 Pro (via OpenRouter)

google/gemini-3-pro

chat
OpenRouter2M

Meta-provider

OpenRouter

GPT-5 (via OpenRouter)

openai/gpt-5

chat
OpenRouter256K

Access any model

OpenRouter

Grok 4 (via OpenRouter)

x-ai/grok-4

chat
OpenRouter256K

Access xAI models

Perplexity

Sonar

sonar

search
Perplexity128K

Default web-connected

Perplexity

Sonar Large Online

llama-3.1-sonar-large-128k-online

search
Perplexity128K

Web-connected search

Perplexity

Sonar Pro

sonar-pro

search
Perplexity128K

Enhanced search, richer context

Perplexity

Sonar Reasoning Pro

sonar-reasoning-pro

reasoning
Perplexity128K

Deep inference & research

Qwen

Qwen 2.5 32B

qwen2.5-32b-instruct

chat
Qwen128K

Balanced performance

Qwen

Qwen 2.5 72B

qwen2.5-72b-instruct

chat
Qwen128K

Flagship model

Qwen

Qwen 2.5 Coder 32B

qwen2.5-coder-32b

code
Qwen128K

Code specialized

Qwen

QwQ 32B

qwq-32b-preview

reasoning
Qwen32K

Reasoning model

together.ai

DeepSeek V3.1

deepseek-ai/DeepSeek-V3.1

chat
Together AI128K

Hybrid reasoning

together.ai

Llama 3.3 70B Turbo

meta-llama/Llama-3.3-70B-Instruct-Turbo

chat
Together AI128K

Fast Llama inference

together.ai

Llama 4 Maverick

meta-llama/Llama-4-Maverick

chat
Together AI256K

Latest Llama

together.ai

Qwen 2.5 72B

Qwen/Qwen2.5-72B-Instruct-Turbo

chat
Together AI32K

Alibaba flagship

Grok

Grok 3

grok-3

chat
xAI128K

DeepSearch, Big Brain Mode

Grok

Grok 3 Mini

grok-3-mini

chat
xAI128K

Fast responses

Grok

Grok 4

grok-4

chat
xAI256K

Enhanced reasoning, real-time search

Grok

Grok 4 Heavy

grok-4-heavy

chat
xAI256K

Maximum capability

Grok

Grok 4.1

grok-4.1

chat
xAI256K

Improved multimodal & reasoning

Grok

Grok 4.1 Fast

grok-4.1-fast

chat
xAI2M

Best agentic tool calling

Grok

Grok Code Fast

grok-code-fast-1

code
xAI128K

Fast agentic coding

Usage

To use a model, pass its ID to any of the AI SDK methods:

Codetext
import { cencori } from '@/lib/cencori'
 
const result = await cencori.ai.chat({
  model: 'gpt-4o', // Use the ID from the catalog above
  messages: [
    { role: 'user', content: 'Hello!' }
  ]
})