Complete pricing for Gemini 2.5 Pro, Flash, and 1.5 models โ with free tier details, real cost scenarios, and comparison to OpenAI and Claude.
Google offers a generous free tier for the Gemini API through Google AI Studio. No credit card required to start.
| Model | Free Requests/Day | Free RPM Limit | Free TPM Limit |
|---|---|---|---|
| Gemini 2.5 Flash Popular | 1,500/day | 10 RPM | 1M TPM |
| Gemini 2.5 Pro Most Capable | 50/day | 2 RPM | 32K TPM |
| Gemini 1.5 Flash | 1,500/day | 15 RPM | 1M TPM |
| Gemini 1.5 Pro | 50/day | 2 RPM | 32K TPM |
| Gemini 1.0 Pro | Unlimited | 15 RPM | 32K TPM |
Once you exceed free tier limits or need higher rate limits, billing is per million tokens. Enable billing in Google AI Studio or use Vertex AI.
| Model | Input (per MTok) | Output (per MTok) | Context Window |
|---|---|---|---|
|
Gemini 2.5 Flash
Popular
Text/image/audio/video
|
$0.075 | $0.30 | 1M tokens |
|
Gemini 2.5 Flash (Thinking)
Complex reasoning mode
|
$0.075 | $3.50 (thinking tokens) | 1M tokens |
|
Gemini 2.5 Pro
Most Capable
โค200K context
|
$1.25 | $10.00 | 1M tokens |
|
Gemini 2.5 Pro
>200K context
|
$2.50 | $15.00 | 1M tokens |
|
Gemini 1.5 Flash
โค128K context
|
$0.075 | $0.30 | 1M tokens |
|
Gemini 1.5 Pro
โค128K context
|
$1.25 | $5.00 | 2M tokens |
| Gemini Embedding 004 | $0.00 | N/A | 2K tokens/request |
| Provider / Model | Input (per MTok) | Output (per MTok) | Context |
|---|---|---|---|
| Gemini 2.5 Flash (Google) | $0.075 | $0.30 | 1M tokens |
| GPT-4o mini (OpenAI) | $0.15 | $0.60 | 128K |
| Claude 3.5 Haiku (Anthropic) | $0.80 | $4.00 | 200K |
| Gemini 2.5 Pro (Google) | $1.25 | $10.00 | 1M tokens |
| GPT-4o (OpenAI) | $2.50 | $10.00 | 128K |
| Claude 3.5 Sonnet (Anthropic) | $3.00 | $15.00 | 200K |
| o3 (OpenAI) | $10.00 | $40.00 | 200K |
| Claude 3 Opus (Anthropic) | $15.00 | $75.00 | 200K |
Gemini Flash is the cheapest option for most tasks โ 2x cheaper than GPT-4o mini and 10x cheaper than Claude Haiku. For a full feature and benchmark comparison, see OpenAI vs Claude API pricing.
| Use Case | Recommendation | Reason |
|---|---|---|
| Side project / prototype | Gemini 2.5 Flash (Free) | 1,500 req/day free โ zero cost to ship v1 |
| Cost-sensitive production app | Gemini 2.5 Flash (Paid) | $0.075/MTok is best price among major providers |
| Long document processing | Gemini 2.5 Pro | 1M token context window handles entire books/codebases |
| Multimodal (image + video + text) | Gemini 2.5 Flash | Native multimodal at same price as text-only tasks |
| Google Cloud / GCP integration | Vertex AI (Gemini) | Single billing, VPC, enterprise SLA, IAM integration |
| Enterprise compliance (EU data residency) | Vertex AI + region selection | Vertex AI supports regional data residency; AI Studio doesn't |
| Date | Change | Impact |
|---|---|---|
| Dec 2023 | Gemini Pro launches (API) | Free in preview; first public Gemini API access |
| May 2024 | Gemini 1.5 Flash launches | $0.35/MTok input โ 10x cheaper than Gemini 1.5 Pro |
| Jul 2024 | Gemini 1.5 Flash price cut | Cut from $0.35 to $0.075/MTok input โ 79% reduction |
| Sep 2024 | Free tier expanded | 1,500 req/day free (was 60); 1M TPM free added |
| Feb 2025 | Gemini 2.0 Flash launches | Same price as 1.5 Flash, significantly better quality |
| Mar 2025 | Gemini 2.5 Pro launches | $1.25/MTok input โ top benchmark performance, competitive price |
| Apr 2025 | Gemini 2.5 Flash launches | Replaces 2.0 Flash; same price, 2.5 quality with thinking mode |
| Ongoing | Context caching added | Cached tokens 4x cheaper โ major savings for repeated prompts |
Google cut Gemini Flash prices by 79% in a single announcement. Set up instant alerts so you always know when to renegotiate or switch models.
Set Up Price Alerts โ Free Free API AccessRelated Reading