GPT 4.1
openai/gpt-4.1Flagship model with 200K context, best for complex reasoning and coding
openai/gpt-4.1Flagship model with 200K context, best for complex reasoning and coding
openai/gpt-4.1-miniFast and affordable, great balance of speed and intelligence
openai/gpt-4.1-nanoFastest and cheapest, ideal for simple tasks and classification
openai/gpt-4oPrevious flagship with vision, strong all-around performance
openai/gpt-4o-miniCompact model optimized for speed and cost efficiency
openai/gpt-4o-audio-previewMultimodal model supporting audio input and output
openai/o3Latest reasoning model with improved speed and accuracy
openai/o3-miniEfficient reasoning model, best value for thinking tasks
openai/o4-miniNewest compact reasoning model with tool use support
openai/dall-e-3State-of-the-art image generation from text prompts
openai/tts-1Standard text-to-speech, fast and natural sounding
openai/tts-1-hdHigh-definition text-to-speech with premium voice quality
openai/whisper-1Industry-leading speech-to-text transcription
anthropic/claude-sonnet-4Balanced Claude with strong coding and reasoning abilities
anthropic/claude-sonnet-4-thinkingSonnet 4 with extended thinking for deeper reasoning
anthropic/claude-opus-4-6Latest Opus with improved coding and reduced cost
anthropic/claude-sonnet-4-6Latest Sonnet, top choice for Claude Code and Cursor
anthropic/claude-haiku-4-5Fast and capable, great for real-time applications
anthropic/claude-haiku-3.5Previous Haiku generation, compact and efficient
anthropic/claude-sonnet-4.5modelsCatalog.modelDesc.anthropic_claude-sonnet-4-5
anthropic/claude-opus-4.5modelsCatalog.modelDesc.anthropic_claude-opus-4-5
anthropic/claude-opus-4.1modelsCatalog.modelDesc.anthropic_claude-opus-4-1
google/gemini-2.5-proGoogle most capable model with 1M context window
google/gemini-2.5-flashFast Gemini with strong reasoning at low cost
google/gemini-2.0-flashPrevious generation flash model, reliable and fast
google/gemini-2.0-flash-liteUltra-lightweight model for high-throughput tasks
google/gemini-1.5-proProven model with excellent long-context understanding
google/gemini-1.5-flashFast and affordable, great for summarization
deepseek/deepseek-chatOpen-source powerhouse, strong coding and math skills
deepseek/deepseek-reasonerChain-of-thought reasoning model rivaling o1
deepseek/deepseek-v3.2modelsCatalog.modelDesc.deepseek_deepseek-v3-2
deepseek/deepseek-v3.1modelsCatalog.modelDesc.deepseek_deepseek-v3-1
deepseek/deepseek-v3modelsCatalog.modelDesc.deepseek_deepseek-v3
deepseek/deepseek-r1modelsCatalog.modelDesc.deepseek_deepseek-r1
deepseek/deepseek-r1-0528modelsCatalog.modelDesc.deepseek_deepseek-r1-0528
meta/llama-4-maverickLatest Llama with 1M context and multimodal support
meta/llama-4-scoutEfficient Llama 4 variant with 512K context
meta/llama-3.3-70bStrong open-source model for general tasks
meta/llama-3.1-405bLargest open-source model, near-frontier performance
meta/llama-3.1-70bVersatile 70B model with good cost-performance ratio
meta/llama-3.1-8bLightweight and fast, ideal for simple tasks
mistral/mistral-largeMistral flagship, strong multilingual and reasoning
mistral/pixtral-largeMultimodal model with vision capabilities
mistral/mistral-large-3modelsCatalog.modelDesc.mistral_mistral-large-3
mistral/devstral-2modelsCatalog.modelDesc.mistral_devstral-2
mistral/magistral-smallmodelsCatalog.modelDesc.mistral_magistral-small
mistral/ministral-14bmodelsCatalog.modelDesc.mistral_ministral-14b
cohere/command-r-plusEnterprise-grade RAG and tool use specialist
cohere/command-rEfficient model optimized for retrieval tasks
cohere/command-aLatest Command model with improved reasoning
xai/grok-3xAI flagship with deep reasoning and real-time knowledge
xai/grok-3-miniFast and affordable Grok for everyday tasks
xai/grok-2Previous generation Grok model
qwen/qwen-maxQwen flagship via direct API
qwen/qwen-plusBalanced Qwen model via direct API
qwen/qwen-turboFast Qwen model, deprecated in favor of Qwen Flash
qwen/qwen2.5-coder-32bSpecialized coding model with 32B parameters
qwen/qwen-vl-maxQwen vision-language model via direct API
qwen/qwen3-maxmodelsCatalog.modelDesc.qwen_qwen3-max
qwen/qwen3.5-plusmodelsCatalog.modelDesc.qwen_qwen3-5-plus
qwen/qwen3.5-flashmodelsCatalog.modelDesc.qwen_qwen3-5-flash
qwen/qwen-longmodelsCatalog.modelDesc.qwen_qwen-long
qwen/qwq-plusmodelsCatalog.modelDesc.qwen_qwq-plus
qwen/qwen3-coder-plusmodelsCatalog.modelDesc.qwen_qwen3-coder-plus
qwen/qwen3-vl-plusmodelsCatalog.modelDesc.qwen_qwen3-vl-plus
qwen/qwen-flashmodelsCatalog.modelDesc.qwen_qwen-flash
qwen/qwen-max-latestmodelsCatalog.modelDesc.qwen_qwen-max-latest
qwen/qwen-plus-latestmodelsCatalog.modelDesc.qwen_qwen-plus-latest
qwen/qwen3-coder-nextmodelsCatalog.modelDesc.qwen_qwen3-coder-next
qwen/qwen-vl-plusmodelsCatalog.modelDesc.qwen_qwen-vl-plus
qwen/qwen3-next-80bmodelsCatalog.modelDesc.qwen_qwen3-next-80b
qwen/qwen3-vl-235bmodelsCatalog.modelDesc.qwen_qwen3-vl-235b
qwen/qwen3-coder-30bmodelsCatalog.modelDesc.qwen_qwen3-coder-30b
qwen/qwen3-32bmodelsCatalog.modelDesc.qwen_qwen3-32b
zhipu/glm-5modelsCatalog.modelDesc.zhipu_glm-5
zhipu/glm-4.7modelsCatalog.modelDesc.zhipu_glm-4-7
zhipu/glm-4.6modelsCatalog.modelDesc.zhipu_glm-4-6
zhipu/glm-4.5modelsCatalog.modelDesc.zhipu_glm-4-5
zhipu/glm-4.5-airmodelsCatalog.modelDesc.zhipu_glm-4-5-air
zhipu/glm-4.7-flashmodelsCatalog.modelDesc.zhipu_glm-4-7-flash
minimax/minimax-m2.5modelsCatalog.modelDesc.minimax_minimax-m2-5
minimax/minimax-m2.1modelsCatalog.modelDesc.minimax_minimax-m2-1
minimax/minimax-m2modelsCatalog.modelDesc.minimax_minimax-m2
moonshot/kimi-k2.5modelsCatalog.modelDesc.moonshot_kimi-k2-5
moonshot/kimi-k2-thinkingmodelsCatalog.modelDesc.moonshot_kimi-k2-thinking
doubao/doubao-1.5-pro-256kByteDance Doubao with 256K context
doubao/doubao-1.5-pro-32kDoubao Pro with standard 32K context
doubao/doubao-1.5-lite-32kUltra-affordable Doubao for basic tasks
amazon/nova-micromodelsCatalog.modelDesc.amazon_nova-micro
amazon/nova-litemodelsCatalog.modelDesc.amazon_nova-lite
amazon/nova-promodelsCatalog.modelDesc.amazon_nova-pro
amazon/nova-premiermodelsCatalog.modelDesc.amazon_nova-premier
nvidia/nemotron-super-3-120bmodelsCatalog.modelDesc.nvidia_nemotron-super-3-120b
nvidia/nemotron-nano-3-30bmodelsCatalog.modelDesc.nvidia_nemotron-nano-3-30b
google/gemma-3-27bmodelsCatalog.modelDesc.google_gemma-3-27b
google/gemma-3-12bmodelsCatalog.modelDesc.google_gemma-3-12b
google/gemma-3-4bmodelsCatalog.modelDesc.google_gemma-3-4b
ai21/jamba-1.5-largemodelsCatalog.modelDesc.ai21_jamba-1-5-large
ai21/jamba-1.5-minimodelsCatalog.modelDesc.ai21_jamba-1-5-mini
meta/llama-3.2-90bmodelsCatalog.modelDesc.meta_llama-3-2-90b
meta/llama-3.2-11bmodelsCatalog.modelDesc.meta_llama-3-2-11b
meta/llama-3.2-3bmodelsCatalog.modelDesc.meta_llama-3-2-3b
meta/llama-3.2-1bmodelsCatalog.modelDesc.meta_llama-3-2-1b
qwen/flux-mergedmodelsCatalog.modelDesc.qwen_flux-merged
qwen/flux-schnellmodelsCatalog.modelDesc.qwen_flux-schnell
qwen/cosyvoice-v2modelsCatalog.modelDesc.qwen_cosyvoice-v2
qwen/sensevoice-v1modelsCatalog.modelDesc.qwen_sensevoice-v1
qwen/paraformer-v2modelsCatalog.modelDesc.qwen_paraformer-v2
amazon/nova-canvasmodelsCatalog.modelDesc.amazon_nova-canvas