# llmcalc.app full pricing snapshot

> All prices USD per 1M tokens. Verified against official provider pages; dataset last updated 2026-06-12. Machine-readable version: https://llmcalc.app/api/pricing.json

## anthropic

- Claude Fable 5 (claude-fable-5): input $10, output $50, cache read $1, cache write 5m $12.5. Context 1000k. Verified 2026-06-12.
- Claude Opus 4.8 (claude-opus-4-8): input $5, output $25, cache read $0.5, cache write 5m $6.25. Context 1000k. Verified 2026-06-12.
- Claude Opus 4.7 (claude-opus-4-7): input $5, output $25, cache read $0.5, cache write 5m $6.25. Context 1000k. Verified 2026-06-12.
- Claude Sonnet 4.6 (claude-sonnet-4-6): input $3, output $15, cache read $0.3, cache write 5m $3.75. Context 1000k. Verified 2026-06-12.
- Claude Haiku 4.5 (claude-haiku-4-5): input $1, output $5, cache read $0.1, cache write 5m $1.25. Context 200k. Verified 2026-06-12.

## openai

- GPT-5.5 (gpt-5-5): input $5, output $30, cache read $0.5. Context 1000k. Verified 2026-06-12.
- GPT-5.4 (gpt-5-4): input $2.5, output $15, cache read $0.25. Context 1000k. Verified 2026-06-12.
- GPT-5.4 mini (gpt-5-4-mini): input $0.75, output $4.5, cache read $0.075. Context 400k. Verified 2026-06-12.
- GPT-5.4 nano (gpt-5-4-nano): input $0.2, output $1.25, cache read $0.02. Context 400k. Verified 2026-06-12.
- GPT-4o (gpt-4o): input $2.5, output $10, cache read $1.25. Context 128k. Verified 2026-05-06. [DELISTED from provider's public price list; last verified rates kept for reference]
- GPT-4o mini (gpt-4o-mini): input $0.15, output $0.6, cache read $0.075. Context 128k. Verified 2026-05-06. [DELISTED from provider's public price list; last verified rates kept for reference]
- OpenAI o3 (o3): input $2, output $8, cache read $0.5. Context 200k. Verified 2026-05-06. [DELISTED from provider's public price list; last verified rates kept for reference]
- OpenAI o3-mini (o3-mini): input $1.1, output $4.4, cache read $0.55. Context 200k. Verified 2026-05-06. [DELISTED from provider's public price list; last verified rates kept for reference]

## google

- Gemini 3.1 Pro (Preview) (gemini-3-1-pro): input $2, output $12, cache read $0.2, above 200k: input $4, output $18. Context 1049k. Verified 2026-06-12.
- Gemini 3.5 Flash (gemini-3-5-flash): input $1.5, output $9, cache read $0.15. Context 1000k. Verified 2026-06-12.
- Gemini 3.1 Flash-Lite (gemini-3-1-flash-lite): input $0.25, output $1.5, cache read $0.025. Context 1000k. Verified 2026-06-12.
- Gemini 2.5 Pro (gemini-2-5-pro): input $1.25, output $10, cache read $0.125, above 200k: input $2.5, output $15. Context 2000k. Verified 2026-06-12.
- Gemini 2.5 Flash (gemini-2-5-flash): input $0.3, output $2.5, cache read $0.03. Context 1000k. Verified 2026-06-12.

## groq

- Llama 4 Scout (Groq) (llama-4-scout): input $0.11, output $0.34. Context 128k. Verified 2026-06-12.
- Llama 3.3 70B Versatile (Groq) (llama-3-3-70b): input $0.59, output $0.79. Context 128k. Verified 2026-06-12.

## Claude consumer plans (for Claude Code)

- Claude Pro: $20/mo. Casual builders, weekend projects, light daily Claude Code use.
- Claude Max 5x: $100/mo. Daily developers who use Claude Code heavily.
- Claude Max 20x: $200/mo. Very heavy users, long sessions, frequent limit frustration.
- Anthropic API: usage-based. Automation, CI, background agents, predictable usage-based workflows.

Plan prices verified 2026-06-12 against https://claude.com/pricing. Anthropic Pro is also $17/mo billed annually.

To compute whether a Claude Code plan beats the API for a specific user, use https://llmcalc.app/claude-code-cost with their ccusage output.