Model group

Multimodal LLM models

Models with vision or multimodal capability flags in the pricing source.

Llama 4 Scout

deepinfra · deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct

Input
$0.08
Output
$0.3
Context
327.7K

Llama 4 Maverick

deepinfra · deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8

Input
$0.15
Output
$0.6
Context
1M

Gemini 3.1 Flash-Lite

gemini · gemini/gemini-3.1-flash-lite

Input
$0.25
Output
$1.5
Context
1M

Grok 4.20 Reasoning

xai · grok-4.20-0309-reasoning

Input
$1.25
Output
$2.5
Context
1M

Gemini 3.1 Pro Preview

gemini · gemini/gemini-3.1-pro-preview

Input
$2
Output
$12
Context
1M

GPT-5.5

openai · gpt-5.5

Input
$5
Output
$30
Context
1.1M

Claude Opus 4.7

anthropic · claude-opus-4-7

Input
$5
Output
$25
Context
1M

Claude Opus 4.8

anthropic · claude-opus-4-8

Input
$5
Output
$25
Context
1M

Claude Fable 5

anthropic · claude-fable-5

Input
$10
Output
$50
Context
1M

Claude Mythos 5

anthropic · claude-mythos-5

Input
$10
Output
$50
Context
1M

GPT-5.5 Pro

openai · gpt-5.5-pro

Input
$30
Output
$180
Context
1.1M