Model group

Rag LLM models

Long-context and enterprise retrieval models for documents, knowledge bases, and analysis.

Cohere Command R7B

cohere_chat · command-r7b-12-2024

Input
$0.037
Output
$0.15
Context
128K

Llama 4 Scout

deepinfra · deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct

Input
$0.08
Output
$0.3
Context
327.7K

Llama 4 Maverick

deepinfra · deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8

Input
$0.15
Output
$0.6
Context
1M

Gemini 3.1 Flash-Lite

gemini · gemini/gemini-3.1-flash-lite

Input
$0.25
Output
$1.5
Context
1M

Claude Haiku 4.5

anthropic · claude-haiku-4-5

Input
$1
Output
$5
Context
200K

Grok 4.20 Reasoning

xai · grok-4.20-0309-reasoning

Input
$1.25
Output
$2.5
Context
1M

Grok 4.20 Multi-Agent

xai · grok-4.20-multi-agent-0309

Input
$1.25
Output
$2.5
Context
1M

GLM-5.2

zai · glm-5.2

Input
$1.4
Output
$4.4
Context
1M

Gemini 3.1 Pro Preview

gemini · gemini/gemini-3.1-pro-preview

Input
$2
Output
$12
Context
1M

Cohere Command A

cohere_chat · command-a-03-2025

Input
$2.5
Output
$10
Context
256K

Claude Sonnet 4.6

anthropic · claude-sonnet-4-6

Input
$3
Output
$15
Context
1M

GPT-5.5

openai · gpt-5.5

Input
$5
Output
$30
Context
1.1M

Claude Opus 4.7

anthropic · claude-opus-4-7

Input
$5
Output
$25
Context
1M

Claude Opus 4.8

anthropic · claude-opus-4-8

Input
$5
Output
$25
Context
1M

Claude Fable 5

anthropic · claude-fable-5

Input
$10
Output
$50
Context
1M

Claude Mythos 5

anthropic · claude-mythos-5

Input
$10
Output
$50
Context
1M

GPT-5.5 Pro

openai · gpt-5.5-pro

Input
$30
Output
$180
Context
1.1M