AI & LLMs · Guide · AI & Prompt Tools
DeepSeek Pricing Explained (2026)
DeepSeek V3.2 at $0.27/$1.10, R1 at $0.55/$2.19, off-peak 50% off, free chat, and what to know about privacy + self-host.
DeepSeek pricing in 2026 is the cheapest frontier-class API on the market. V3.2 sits at $0.27/$1.10 per 1M tokens; off-peak hours drop to $0.135/$0.55. R1 reasoning sits at $0.55/$2.19. Plus the consumer chat is free. Here’s the full breakdown.
Advertisement
Consumer chat
- chat.deepseek.com — free. V3.2 + R1 access. No account required for light use.
API pricing (per 1M tokens, USD)
- DeepSeek V3.2 (chat): $0.27 input / $1.10 output. Cache miss; cache hit $0.027 (90% off cached).
- DeepSeek V3.2 off-peak (UTC 16:30-00:30): $0.135 / $0.55. Half off.
- DeepSeek R1 (reasoning): $0.55 / $2.19. Off-peak: $0.275 / $1.10.
- R1 reasoner output: includes thinking tokens at output rate — budget for ~5x the visible answer length.
What you get
- OpenAI-compatible SDK — drop-in replacement (
base_url="https://api.deepseek.com"). - Tool use, JSON mode, structured outputs.
- 128k context window.
- Open weights for self-host.
The cost story vs competitors
- vs Claude Sonnet 4.6 ($3 / $15): ~10x cheaper.
- vs GPT-5 ($2.50 / $10): ~9x cheaper.
- vs Gemini 2.5 Pro ($1.25 / $5): ~5x cheaper.
- R1 vs o-pro reasoning: ~30x cheaper.
Privacy realism
The DeepSeek cloud API routes through Chinese infrastructure. For most non-sensitive workloads this is fine; for regulated data (HIPAA, SOC 2 customers) most teams self-host the open weights instead. V3.2 is large (671B MoE) so a Hyperspace pod or rented cloud GPU is needed; smaller distilled versions run on commodity hardware.
When DeepSeek wins
- High-volume API workloads where total cost matters.
- Agentic loops at scale.
- Embedding pre-processing pipelines.
- Reasoning chains where R1’s thinking-token economics make it cheap to over-think.
- Anyone willing to self-host the open weights for privacy.
When DeepSeek isn’t the right pick
- The hardest 5% of SWE-bench tasks — Claude Opus opens a real lead.
- 30+ step agents where reliability dominates — Claude / GPT-5 still meaningfully ahead.
- Customer-facing English work where the marginal quality and tone calibration matter.
Compare: Claude vs DeepSeek, DeepSeek R1 vs Claude. Cost math: cost calculator.
Advertisement