AI & LLMs · Guide · AI & Prompt Tools
AI Pricing Cheat Sheet (2026)
Every consumer plan + API rate for Claude, ChatGPT, Gemini, Perplexity, DeepSeek, Kimi, Grok, Mistral, plus the 5 levers that cut your bill.
By FreeToolArena Staff · Updated June 2026 · 6 min read
Every frontier AI model’s pricing in one page. Consumer plans, API rates, hidden caps, and the 5 levers that change the bill. Updated for 2026 Q1.
Advertisement
Consumer monthly plans (USD)
- Claude Pro / Max: $20 / $100 / $200.
- ChatGPT Plus / Pro: $20 / $200.
- Gemini Advanced / Ultra: $20 / $250.
- Perplexity Pro / Max: $20 / $200.
- Grok (X Premium / Premium+): $8 / $40.
- NotebookLM: Free (Plus included with Gemini Advanced).
- Microsoft Copilot Pro: $20.
API pricing (per 1M tokens)
| Model | Input | Output |
|---|---|---|
| Claude Opus 4.7 | $15 | $75 |
| Claude Sonnet 4.6 | $3 | $15 |
| Claude Haiku 4.5 | $0.80 | $4 |
| GPT-5 | $2.50 | $10 |
| GPT-5 mini | $0.25 | $2 |
| GPT-5 nano | $0.05 | $0.40 |
| Gemini 3 Pro | $2.50 | $10 |
| Gemini 2.5 Pro | $1.25 | $5 |
| Gemini 2.5 Flash | $0.30 | $2.50 |
| DeepSeek V3.2 | $0.27 | $1.10 |
| DeepSeek R1 | $0.55 | $2.19 |
| Kimi K2 | $0.60 | $2.50 |
| Grok 4 | $3 | $15 |
| Mistral Large 3 | $2 | $6 |
The 5 levers that cut your bill
- Prompt caching — 90% off cached input on Anthropic / Google. Always on for stable system prompts.
- Batch API (50% off) — for any async work with 24h SLA.
- Right-sized model — Haiku not Sonnet, Sonnet not Opus, Flash not Pro for routine work.
- Off-peak DeepSeek — 50% off again UTC 16:30-00:30.
- Open weights / self-host — for sustained workloads with consistent load. See the break-even calculator.
Multimodal surcharges
- Image: ~$0.04 per image (Imagen 4, DALL-E 4).
- Video: $0.30-0.50 per 5-10 sec clip (Sora, Veo).
- Audio in: ~$0.006/min (Whisper); native bundled in chat plans.
- Image in (vision): ~1500 tokens per image at flagship rates.
- Video in: ~250 tokens per second @ 1fps (Gemini).
Other 2026 tools worth knowing
- GitHub Copilot Pro / Business: $10 / $19.
- Cursor Pro / Ultra: $20 / $200.
- Windsurf Pro: $15.
- Midjourney: $10-120 across tiers.
- Perplexity Spaces / API: Sonar API ~$5 / 1M tokens.
Run the math against your actual volume: AI cost estimator, monthly budgeter, local vs API break-even.
Use these while you read
Tools that pair with this guide
- AI Monthly Cost BudgeterList every AI subscription and API spend, set a budget, see your over/under at a glance. Free tracker for ChatGPT, Claude, Gemini, Cursor, and more.AI & Prompt Tools
- AI Prompt GeneratorTurn a vague idea into a structured prompt. Pick role, task, context, constraints, and output format. Works with ChatGPT, Claude, and Gemini.AI & Prompt Tools
- AI Prompt LibraryBrowse a curated catalog of prompt templates for writing, coding, marketing, and research. One click to copy.AI & Prompt Tools
- Custom GPT & Claude Project Prompt BuilderBuild a full custom GPT or Claude Project prompt with persona, rules, examples, and output schema. One copy-paste block for ChatGPT, Claude Projects, and assistants.AI & Prompt Tools
Advertisement
Continue reading
- AI & LLMsGitHub Copilot Pricing and ComparisonCompare free vs paid GitHub Copilot tiers and analyze it against ChatGPT, Cursor, and Tabnine. Find the best value plan instantly with this free online guide.
- AI & LLMsGitHub Copilot Features and CapabilitiesTest what Copilot really does — code accuracy, scope limits, debugging, web dev, legacy code, tests, docs, team customization. Free guide, no sign-up.
- AI & LLMsGitHub Copilot Security and Data HandlingAudit where your code goes, who sees it, training-data policy, network needs, and what happens when Copilot suggests broken code. Free, no sign-up.
- AI & LLMsAI Fluency SkillsThe 8 sub-skills of AI fluency: prompt structure, model selection, tool use, quality calibration, iteration, context management, cost awareness, privacy.
- AI & LLMsAnthropic Skills ExplainedSkills as Anthropic's answer to Custom GPTs — markdown-defined, version-controlled in git, work in terminal. Anatomy + Skills vs Custom GPTs.
- AI & LLMsKimi K2 vs DeepSeek V3Two open-weight Chinese flagships. Kimi K2 = 1M context, DeepSeek V3.2 = top-tier reasoning + coding. Pick by use case.