Model group
Rag LLM models
Long-context and enterprise retrieval models for documents, knowledge bases, and analysis.
Cohere Command R7B
cohere_chat · command-r7b-12-2024
- Input
- $0.037
- Output
- $0.15
- Context
- 128K
Llama 4 Scout
deepinfra · deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct
- Input
- $0.08
- Output
- $0.3
- Context
- 327.7K
Llama 4 Maverick
deepinfra · deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
- Input
- $0.15
- Output
- $0.6
- Context
- 1M
Gemini 3.1 Flash-Lite
gemini · gemini/gemini-3.1-flash-lite
- Input
- $0.25
- Output
- $1.5
- Context
- 1M
Claude Haiku 4.5
anthropic · claude-haiku-4-5
- Input
- $1
- Output
- $5
- Context
- 200K
Grok 4.20 Reasoning
xai · grok-4.20-0309-reasoning
- Input
- $1.25
- Output
- $2.5
- Context
- 1M
Grok 4.20 Multi-Agent
xai · grok-4.20-multi-agent-0309
- Input
- $1.25
- Output
- $2.5
- Context
- 1M
GLM-5.2
zai · glm-5.2
- Input
- $1.4
- Output
- $4.4
- Context
- 1M
Gemini 3.1 Pro Preview
gemini · gemini/gemini-3.1-pro-preview
- Input
- $2
- Output
- $12
- Context
- 1M
Cohere Command A
cohere_chat · command-a-03-2025
- Input
- $2.5
- Output
- $10
- Context
- 256K
Claude Sonnet 4.6
anthropic · claude-sonnet-4-6
- Input
- $3
- Output
- $15
- Context
- 1M
GPT-5.5
openai · gpt-5.5
- Input
- $5
- Output
- $30
- Context
- 1.1M
Claude Opus 4.7
anthropic · claude-opus-4-7
- Input
- $5
- Output
- $25
- Context
- 1M
Claude Opus 4.8
anthropic · claude-opus-4-8
- Input
- $5
- Output
- $25
- Context
- 1M
Claude Fable 5
anthropic · claude-fable-5
- Input
- $10
- Output
- $50
- Context
- 1M
Claude Mythos 5
anthropic · claude-mythos-5
- Input
- $10
- Output
- $50
- Context
- 1M
GPT-5.5 Pro
openai · gpt-5.5-pro
- Input
- $30
- Output
- $180
- Context
- 1.1M