Supported Models

159 models from 21 model makers. Call any model by its ID in the maker/model format.

Token prices are per 1M tokens; image/video prices are per image/second. All include our service fee.

91 models

OpenAI (34)

Model ID	Capabilities	Context	Input	Output
`openai/gpt-3.5-turbo` OpenAI: GPT-3.5 Turbo GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.	Tools	16K	$0.600	$1.80
`openai/gpt-3.5-turbo-16k` OpenAI: GPT-3.5 Turbo 16k This model offers four times the context length of gpt-3.5-turbo, allowing it to support approximately 20 pages of text in a single request at a higher cost. Training data: up...	Tools	16K	$3.60	$4.80
`openai/gpt-4` OpenAI: GPT-4 OpenAI's flagship model, GPT-4 is a large-scale multimodal language model capable of solving difficult problems with greater accuracy than previous models due to its broader general knowledge and advanced reasoning...	Tools	8K	$36.00	$72.00
`openai/gpt-4-turbo` OpenAI: GPT-4 Turbo The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to December 2023.	VisionTools	128K	$12.00	$36.00
`openai/gpt-4.1` OpenAI: GPT-4.1 GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and...	VisionTools	1.0M	$2.40	$9.60
`openai/gpt-4.1-mini` OpenAI: GPT-4.1 Mini GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard...	VisionTools	1.0M	$0.480	$1.92
`openai/gpt-4.1-nano` OpenAI: GPT-4.1 Nano For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million...	VisionTools	1.0M	$0.120	$0.480
`openai/gpt-4o` OpenAI: GPT-4o GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...	VisionTools	128K	$3.00	$12.00
`openai/gpt-4o-2024-05-13` OpenAI: GPT-4o (2024-05-13) GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...	VisionTools	128K	$6.00	$18.00
`openai/gpt-4o-2024-08-06` OpenAI: GPT-4o (2024-08-06) The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more [here](https://openai.com/index/introducing-structured-outputs-in-the-api/). GPT-4o ("o" for "omni") is...	VisionTools	128K	$3.00	$12.00
`openai/gpt-4o-2024-11-20` OpenAI: GPT-4o (2024-11-20) The 2024-11-20 version of GPT-4o offers a leveled-up creative writing ability with more natural, engaging, and tailored writing to improve relevance & readability. It’s also better at working with uploaded...	VisionTools	128K	$3.00	$12.00
`openai/gpt-4o-mini` OpenAI: GPT-4o-mini GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...	VisionTools	128K	$0.180	$0.720
`openai/gpt-4o-mini-2024-07-18` OpenAI: GPT-4o-mini (2024-07-18) GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...	VisionTools	128K	$0.180	$0.720
`openai/gpt-4o-mini-search-preview` OpenAI: GPT-4o-mini Search Preview GPT-4o mini Search Preview is a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.	Chat	128K	$0.180	$0.720
`openai/gpt-4o-search-preview` OpenAI: GPT-4o Search Preview GPT-4o Search Previewis a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.	Chat	128K	$3.00	$12.00
`openai/gpt-5` OpenAI: GPT-5 GPT-5 is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and accuracy...	VisionTools	400K	$1.50	$12.00
`openai/gpt-5-chat-latest` GPT-5 Chat (Latest) OpenAI GPT-5 chat-tuned, non-reasoning variant — fast vision + chat (no reasoning-mode latency). Points at OpenAI's gpt-5-chat-latest moving alias.	VisionTools	400K	$1.50	$12.00
`openai/gpt-5-mini` OpenAI: GPT-5 Mini GPT-5 Mini is a compact version of GPT-5, designed to handle lighter-weight reasoning tasks. It provides the same instruction-following and safety-tuning benefits as GPT-5, but with reduced latency and cost....	VisionTools	400K	$0.300	$2.40
`openai/gpt-5-nano` OpenAI: GPT-5 Nano GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for developer tools, rapid interactions, and ultra-low latency environments. While limited in reasoning depth compared to its larger...	VisionTools	400K	$0.060	$0.480
`openai/gpt-5.1` OpenAI: GPT-5.1 GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-purpose reasoning, improved instruction adherence, and a more natural conversational style compared to GPT-5. It uses adaptive reasoning...	VisionTools	400K	$1.50	$12.00
`openai/gpt-5.2` OpenAI: GPT-5.2 GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic and long context perfomance compared to GPT-5.1. It uses adaptive reasoning to allocate computation dynamically, responding quickly...	VisionTools	400K	$2.10	$16.80
`openai/gpt-5.4` OpenAI: GPT-5.4 GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for...	VisionTools	1.1M	$3.00	$18.00
`openai/gpt-5.4-mini` OpenAI: GPT-5.4 Mini GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding,...	VisionTools	400K	$0.900	$5.40
`openai/gpt-5.4-nano` OpenAI: GPT-5.4 Nano GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. It supports text and image inputs and is designed for low-latency...	VisionTools	400K	$0.240	$1.50
`openai/gpt-5.5` OpenAI: GPT-5.5 GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks. It features a 1M+ token...	VisionTools	1.1M	$6.00	$36.00
`openai/gpt-5.6-luna` OpenAI: GPT-5.6 Luna GPT-5.6 Luna is OpenAI's cost-optimized GPT-5.6 model for high-volume, latency-sensitive workloads. It keeps strong reasoning, a 1,050,000 token context window, and text and image input at the lowest price in the family, and it corresponds to the nano tier of earlier GPT-5 families. Served direct from OpenAI.	VisionTools	1.1M	$1.20	$7.20
`openai/gpt-5.6-sol` OpenAI: GPT-5.6 Sol GPT-5.6 Sol is the frontier model in OpenAI's GPT-5.6 family, built for complex professional work that demands the highest reasoning. It supports reasoning tokens, a 1,050,000 token context window, and text and image input, and it corresponds to the unsuffixed tier of earlier GPT-5 families. Served direct from OpenAI.	VisionTools	1.1M	$6.00	$36.00
`openai/gpt-5.6-terra` OpenAI: GPT-5.6 Terra GPT-5.6 Terra is OpenAI's GPT-5.6 model that balances intelligence and cost, with higher reasoning for everyday production workloads. It supports reasoning tokens, a 1,050,000 token context window, and text and image input, and it corresponds to the mini tier of earlier GPT-5 families. Served direct from OpenAI.	VisionTools	1.1M	$3.00	$18.00
`openai/gpt-oss-120b` OpenAI: GPT-OSS 120B GPT-OSS 120B is OpenAI's large open-weight model — strong reasoning and tool use at low cost.	Tools	128K	$0.180	$0.720
`openai/gpt-oss-20b` OpenAI: GPT-OSS 20B GPT-OSS 20B is OpenAI's compact open-weight model for fast, inexpensive chat and agents.	Tools	128K	$0.084	$0.360
`openai/o1` OpenAI: o1 The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 model series is trained with large-scale reinforcement learning to reason...	VisionTools	200K	$18.00	$72.00
`openai/o3` OpenAI: o3 o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following....	VisionTools	200K	$2.40	$9.60
`openai/o3-mini` OpenAI: o3 Mini OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and coding. This model supports the `reasoning_effort` parameter, which can be set to...	VisionTools	200K	$1.32	$5.28
`openai/o4-mini` OpenAI: o4 Mini OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning...	VisionTools	200K	$1.32	$5.28

Mistral (15)

Model ID	Capabilities	Context	Input	Output
`mistralai/codestral-2508` Mistral: Codestral 2508 Mistral's cutting-edge language model for coding released end of July 2025. Codestral specializes in low-latency, high-frequency tasks such as fill-in-the-middle (FIM), code correction and test generation. [Blog Post](https://mistral.ai/news/codestral-25-08)	VisionTools	256K	$0.360	$1.08
`mistralai/devstral-2512` Mistral: Devstral 2 2512 Devstral 2 is a state-of-the-art open-source model by Mistral AI specializing in agentic coding. It is a 123B-parameter dense transformer model supporting a 256K context window. Devstral 2 supports exploring...	VisionTools	262K	$0.480	$2.40
`mistralai/devstral-2512` Mistral: Devstral 2 2512 Devstral 2 is a state-of-the-art open-source model by Mistral AI specializing in agentic coding. It is a 123B-parameter dense transformer model supporting a 256K context window. Devstral 2 supports exploring...	VisionTools	262K	$0.480	$2.40
`mistralai/ministral-14b-2512` Mistral: Ministral 3 14B 2512 The largest model in the Ministral 3 family, Ministral 3 14B offers frontier capabilities and performance comparable to its larger Mistral Small 3.2 24B counterpart. A powerful and efficient language...	VisionTools	262K	$0.240	$0.240
`mistralai/ministral-14b-2512` Mistral: Ministral 3 14B 2512 The largest model in the Ministral 3 family, Ministral 3 14B offers frontier capabilities and performance comparable to its larger Mistral Small 3.2 24B counterpart. A powerful and efficient language...	VisionTools	262K	$0.240	$0.240
`mistralai/ministral-3b-2512` Mistral: Ministral 3 3B 2512 The smallest model in the Ministral 3 family, Ministral 3 3B is a powerful, efficient tiny language model with vision capabilities.	VisionTools	131K	$0.120	$0.120
`mistralai/ministral-3b-2512` Mistral: Ministral 3 3B 2512 The smallest model in the Ministral 3 family, Ministral 3 3B is a powerful, efficient tiny language model with vision capabilities.	VisionTools	131K	$0.120	$0.120
`mistralai/ministral-8b-2512` Mistral: Ministral 3 8B 2512 A balanced model in the Ministral 3 family, Ministral 3 8B is a powerful, efficient tiny language model with vision capabilities.	VisionTools	262K	$0.180	$0.180
`mistralai/ministral-8b-2512` Mistral: Ministral 3 8B 2512 A balanced model in the Ministral 3 family, Ministral 3 8B is a powerful, efficient tiny language model with vision capabilities.	VisionTools	262K	$0.180	$0.180
`mistralai/mistral-large-2512` Mistral: Mistral Large 3 2512 Mistral Large 3 2512 is Mistral’s most capable model to date, featuring a sparse mixture-of-experts architecture with 41B active parameters (675B total), and released under the Apache 2.0 license.	VisionTools	262K	$0.600	$1.80
`mistralai/mistral-large-2512` Mistral: Mistral Large 3 2512 Mistral Large 3 2512 is Mistral’s most capable model to date, featuring a sparse mixture-of-experts architecture with 41B active parameters (675B total), and released under the Apache 2.0 license.	VisionTools	262K	$0.600	$1.80
`mistralai/mistral-medium-3` Mistral: Mistral Medium 3 Mistral Medium 3 is a high-performance enterprise-grade language model designed to deliver frontier-level capabilities at significantly reduced operational cost. It balances state-of-the-art reasoning and multimodal performance with 8× lower cost...	VisionTools	131K	$0.480	$2.40
`mistralai/mistral-medium-3-5` Mistral: Mistral Medium 3.5 Mistral Medium 3.5 is a dense 128B instruction-following model from Mistral AI. It supports text and image inputs with text output, and is designed for agentic workflows, coding, and complex...	VisionTools	262K	$1.80	$9.00
`mistralai/mistral-small-2603` Mistral: Mistral Small 4 Mistral Small 4 is the next major release in the Mistral Small family, unifying the capabilities of several flagship Mistral models into a single system. It combines strong reasoning from...	VisionTools	262K	$0.180	$0.720
`mistralai/pixtral-large-2411` Mistral: Pixtral Large 2411 Pixtral Large is a 124B parameter, open-weight, multimodal model built on top of [Mistral Large 2](/mistralai/mistral-large-2411). The model is able to understand documents, charts and natural images. The model is...	VisionTools	131K	$2.40	$7.20

Anthropic (13)

Model ID	Capabilities	Context	Input	Output
`anthropic/claude-fable-5` Claude Fable 5 Anthropic's most capable widely released model — most demanding reasoning and long-horizon agentic work. Adaptive thinking always on, 1M-token context. Served direct via the Claude API.	VisionTools	1M	$12.00	$60.00
`anthropic/claude-haiku-4.5` Anthropic: Claude Haiku 4.5	VisionTools	200K	$1.20	$6.00
`anthropic/claude-haiku-4.5` Anthropic: Claude Haiku 4.5 Claude Haiku 4.5 — Anthropic's fast, cost-efficient model. Strong for high-volume tasks and simple agents at a fraction of Sonnet/Opus cost.	VisionTools	200K	$1.20	$6.00
`anthropic/claude-opus-4.5` Anthropic: Claude Opus 4.5 Claude Opus 4.5 — Anthropic's flagship reasoning model for coding, agents, and long-context workflows.	VisionTools	1M	$6.00	$30.00
`anthropic/claude-opus-4.5` Anthropic: Claude Opus 4.5	VisionTools	1M	$6.00	$30.00
`anthropic/claude-opus-4.6` Anthropic: Claude Opus 4.6	VisionTools	1M	$6.00	$30.00
`anthropic/claude-opus-4.6` Anthropic: Claude Opus 4.6 Opus 4.6 is Anthropic’s strongest model for coding and long-running professional tasks. It is built for agents that operate across entire workflows rather than single prompts, making it especially effective...	VisionTools	1M	$6.00	$30.00
`anthropic/claude-opus-4.7` Anthropic: Claude Opus 4.7 Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on...	VisionTools	1M	$6.00	$30.00
`anthropic/claude-opus-4.7` Anthropic: Claude Opus 4.7	VisionTools	1M	$6.00	$30.00
`anthropic/claude-opus-4.8` Anthropic: Claude Opus 4.8 Claude Opus 4.8 — Anthropic's most intelligent Opus model for coding and long-running agents. Memory tool (beta), 1M context, enhanced tool orchestration.	VisionTools	1M	$6.00	$30.00
`anthropic/claude-opus-4.8` Anthropic: Claude Opus 4.8	VisionTools	1M	$6.00	$30.00
`anthropic/claude-sonnet-4.6` Anthropic: Claude Sonnet 4.6 Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with...	VisionTools	1M	$3.60	$18.00
`anthropic/claude-sonnet-4.6` Anthropic: Claude Sonnet 4.6	VisionTools	1M	$3.60	$18.00

Google (12)

Model ID	Capabilities	Context	Input	Output
`google/gemini-2.5-flash` Google: Gemini 2.5 Flash Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...	VisionTools	1.0M	$0.360	$3.00
`google/gemini-2.5-flash-image` Google: Nano Banana (Gemini 2.5 Flash Image) Gemini 2.5 Flash Image, a.k.a. "Nano Banana," is now generally available. It is a state of the art image generation model with contextual understanding. It is capable of image generation,...	Vision	33K	$0.360	$3.00
`google/gemini-2.5-flash-lite` Google: Gemini 2.5 Flash Lite Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...	VisionTools	1.0M	$0.120	$0.480
`google/gemini-2.5-pro` Google: Gemini 2.5 Pro Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...	VisionTools	1.0M	$1.50	$12.00
`google/gemini-3-flash-preview` Google: Gemini 3 Flash Preview Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool...	VisionTools	1.0M	$0.600	$3.60
`google/gemini-3.1-flash-lite` Google: Gemini 3.1 Flash Lite Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...	VisionTools	1.0M	$0.300	$1.80
`google/gemini-3.1-flash-lite-preview` Google: Gemini 3.1 Flash Lite Preview Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across...	VisionTools	1.0M	$0.300	$1.80
`google/gemini-3.1-pro-preview` Google: Gemini 3.1 Pro Preview Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...	VisionTools	1.0M	$2.40	$14.40
`google/gemini-3.1-pro-preview-customtools` Google: Gemini 3.1 Pro Preview Custom Tools Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool selection behavior by preventing overuse of a general bash tool when more efficient third-party...	VisionTools	1.0M	$2.40	$14.40
`google/gemini-3.5-flash` Google: Gemini 3.5 Flash Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...	VisionTools	1.0M	$1.80	$10.80
`google/gemma-4-26b-a4b-it` Google: Gemma 4 26B A4B Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...	VisionTools	262K	$0.072	$0.396
`google/gemma-4-31b-it` Google: Gemma 4 31B Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...	VisionTools	262K	$0.144	$0.444

Qwen (9)

Model ID	Capabilities	Context	Input	Output
`qwen/qwen3-coder-plus` Qwen3 Coder Plus Qwen3 coding specialist (480B-class) with a 1M-token context window. Served direct from Alibaba Cloud Model Studio.	Tools	1M	$1.20	$6.00
`qwen/qwen3-max` Qwen3 Max Alibaba's flagship Qwen3 model. Strongest reasoning and instruction following in the Qwen family. Served direct from Alibaba Cloud Model Studio.	Tools	262K	$0.432	$1.73
`qwen/qwen3-vl-flash` Qwen3 VL Flash Fast, low-cost Qwen3 vision-language model. Served direct from Alibaba Cloud Model Studio.	VisionTools	262K	$0.036	$0.264
`qwen/qwen3-vl-plus` Qwen3 VL Plus Qwen3 vision-language model — image understanding plus text generation. Served direct from Alibaba Cloud Model Studio.	VisionTools	262K	$0.180	$1.73
`qwen/qwen3.5-flash` Qwen3.5 Flash Fast, low-cost Qwen3.5 with a 1M-token context window. Served direct from Alibaba Cloud Model Studio.	Tools	1M	$0.036	$0.348
`qwen/qwen3.5-plus` Qwen3.5 Plus Balanced flagship-tier Qwen3.5 with a 1M-token context window. Served direct from Alibaba Cloud Model Studio.	Tools	1M	$0.144	$0.828
`qwen/qwen3.6-flash` Qwen3.6 Flash Fast, low-cost Qwen3.6 vision-language model — strong agentic coding and reasoning, 1M-token context. A capable step up from 3.5 Flash. Served direct from Alibaba Cloud Model Studio.	VisionTools	1M	$0.300	$1.80
`qwen/qwen3.7-max` Qwen3.7 Max Newest Qwen3.7 flagship — strongest agent-level reasoning, coding, and long-horizon execution. Text-only, 1M-token context. Served direct from Alibaba Cloud Model Studio.	Tools	1M	$3.00	$9.00
`qwen/qwen3.7-plus` Qwen3.7 Plus Cost-effective Qwen3.7 with full vision-language and agent capabilities. Image+text+video in, 1M-token context. Served direct from Alibaba Cloud Model Studio.	VisionTools	1M	$0.480	$1.92

Zai (3)

Model ID Capabilities Context Input Output

zai/glm-4.7

Z.AI: GLM 4.7

GLM 4.7 is a balanced, cost-effective Z.AI model for general chat, coding, and tools.

Tools

203K

$0.720

$2.64

zai/glm-4.7-flash

Z.AI: GLM 4.7 Flash

GLM 4.7 Flash is Z.AI's fast, low-cost tier for high-volume chat and lightweight agents.

Tools

203K

$0.084

$0.480

zai/glm-5

Z.AI: GLM 5

GLM 5 is Z.AI's flagship model with strong reasoning, coding, and agentic tool use.

Tools

200K

$1.20

$3.84

DeepSeek (2)

Model ID Capabilities Context Input Output

deepseek/deepseek-v4-flash

DeepSeek: DeepSeek V4 Flash

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

Tools

1.0M

$0.120

$0.240

deepseek/deepseek-v4-pro

DeepSeek: DeepSeek V4 Pro

DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, coding,...

Tools

1.0M

$0.528

$1.04

Moonshot AI (2)

Model ID Capabilities Context Input Output

moonshotai/kimi-k2-thinking

Moonshot AI: Kimi K2 Thinking

Kimi K2 Thinking is Moonshot AI's reasoning model that thinks step-by-step before answering.

Tools

256K

$0.720

$3.00

moonshotai/kimi-k2.5

Moonshot AI: Kimi K2.5

Kimi K2.5 is Moonshot AI's strong agentic model with a 256K context window and tool use.

Tools

256K

$0.720

$3.60

MiniMax (1)

Model ID Capabilities Context Input Output

minimax/minimax-m2.5

MiniMax M2.5

MiniMax M2.5 is a capable, cost-efficient general-purpose model with a 196K context window.

Tools

196K

$0.360

$1.44

This catalog updates automatically. Retrieve it programmatically via the List Models API endpoint.