How accurate are these LLM API prices?
Prices are comparison data transformed from public pricing metadata. Verify provider pages before production routing or purchasing decisions.
apiroute.dev
Cost intelligence for AI infrastructure
Compare prices, cache discounts, prompt costs, and model routes. Before paying for cloud tokens, check whether the model fits on your own GPU. Local AI companion tool Can my GPU run this LLM? Check VRAM fit, model size, quantization, and local-vs-cloud fallback. Open local GPU checker Read update
Quick route
First choose the use case. Then choose whether you want the best model, the cheapest route, best value, or your own model.
1. Use case
2. Optimize for
The router is loading model pricing data.
Why this helps
The route shows the cheapest fitting model and scales it to a workflow budget.
Top picks
Cost estimate
$0.000000
Token Calculator
Token approximation: 1 word equals roughly 1.3 tokens.
Best Route
One clear route first. Open the settings when the job needs different requirements.
Live Route API
Endpoint
Model views
Loading models...
| Model | Provider | Context | Input |
|---|---|---|---|
| Loading models... | |||
Data source
Freshness, source and currency view
Prices are comparison data. Verify provider pages before production routing or purchasing decisions.
Planning score is a rule-based heuristic derived from price tier, context window, capabilities, and model name signals. It is not an external benchmark, Elo rating, or LMArena score. Local open-weight models can serve as a practical cost fallback when frontier model pricing rises or availability changes; use this as a planning signal, not a performance guarantee.
Token Waste Check
Compare the selected model with cheaper SLM, budget, and local-open routes before a multi-step agent spends tokens.
| Route | Cost | Context |
|---|---|---|
| Loading routes... | ||
Model Intelligence
Selected model metadata updates with the calculator.
Context Calculator
Paste text or choose a preset to compare document fit, output limits, and one-time versus cached analysis costs.
| Model | Fit | Context Used | One-time | Cached repeat |
|---|---|---|---|---|
| Enter text to compare all models. | ||||
The calculator uses a simple token approximation: 1 word ≈ 1.3 tokens. It is not an exact tokenizer simulation, but it is useful for fast cost comparisons across LLM APIs and for estimating prompt length before production use.
Cache read costs describe lower prices for prompt segments that have already been cached. With providers such as OpenAI or Anthropic, reused system prompts, long contexts, or repeated prefixes can cost significantly less than entirely new input tokens.
FAQ
Prices are comparison data transformed from public pricing metadata. Verify provider pages before production routing or purchasing decisions.
It is a lower price for repeated prompt segments, reused system prompts, document prefixes, or stable context blocks when a provider supports prompt caching.
It depends on prompt size, output tokens, cache share, and required capabilities. Use the context matrix and Best Route cards for the current job.
No. Provider and model names are used referentially for API pricing, capability, and compatibility comparisons.
For AI Agents
Use these endpoints directly in agents, crawlers, and workflow tools. No HTML table scraping required.
Best default JSON endpoint for current model pricing, capabilities, freshness, and source metadata.
Model list only, useful when an agent already knows the metadata policy.
OpenAPI 3.1 spec for tool registration in agents and workflow systems.
Compact crawler guide for discovering the pricing API and agent-readable docs.
Route recommendation guide
Static machine guide for the Best Route V1 scoring inputs and route labels.
Token Waste Check contract
Agent-readable guide for detecting overpaid repeated prompts and cheaper SLM/budget routes.
{
"tool": "apiroute_prices",
"openapi": "https://apiroute.dev/openapi.yaml",
"default_endpoint": "https://apiroute.dev/api/live-prices",
"routing_guide": "https://apiroute.dev/api/route-recommendation-guide",
"recommend_endpoint": "https://apiroute.dev/api/recommend-route",
"token_waste_contract": "https://apiroute.dev/api/token-waste-check"
}
Business Layer
apiroute.dev can test monetization through alerts, provider sponsorships, and premium data access while keeping the comparison table independent.
A simple waitlist for teams that want alerts when model prices, context windows, or cache discounts change.
Join alert waitlistOne OpenAI-compatible API for many hosted models. Use it as a provider option when you want to test model routing without integrating every vendor separately.
Affiliate link. This does not affect model rankings, calculator results, or route recommendations.
Try AI/ML APIFuture paid access could add higher refresh frequency, pricing history, diff alerts, and machine-readable change logs.
Request API detailsProvider placements can be sold only if clearly labeled. Core price rankings stay sorted by data, not payment.
Discuss sponsorshipCommercial metadata is available for agents and partners as JSON. It describes what is testable now and what is intentionally not sold.
Market Radar