Gemini 3.5 Flash vs Llama 3.3 70B Instruct API Price Comparison
Prices updated: ·30 quotes
As of : At official rates, Gemini 3.5 Flash is $1.5000/1M input tokens and Llama 3.3 70B Instruct is $0.5900 — Llama 3.3 70B Instruct is ~61% cheaper than Gemini 3.5 Flash. Via relay providers, the lowest Gemini 3.5 Flash rate is $0.0621 (TreeRouter) and Llama 3.3 70B Instruct is $0.0993 (ProAI API). Context windows: Gemini 3.5 Flash 1000K, Llama 3.3 70B Instruct 128K. Gemini 3.5 Flash supports image input. 30 provider quotes tracked (Gemini 3.5 Flash: 14, Llama 3.3 70B Instruct: 16), updated daily.
Gemini 3.5 FlashMid-tier
Official
$1.5000
/ 1M tokens
Best Price
$0.0621
/ 1M tokens
14 providers · 1000K Context · 👁 Vision
Llama 3.3 70B InstructMid-tier
Official
$0.5900
/ 1M tokens
Best Price
$0.0993
/ 1M tokens
16 providers · 128K Context
At official prices, Llama 3.3 70B Instruct is ~61% cheaper. At the best relay price, Gemini 3.5 Flash is ~37% cheaper.
Price Comparison
| Gemini 3.5 Flash | Llama 3.3 70B Instruct | |
|---|---|---|
| Official input / 1M tokens | $1.5000 | $0.5900 ✓ |
| Official output / 1M tokens | $9.0000 | $0.9900 ✓ |
| Cheapest input / 1M tokens | $0.0621 ✓ | $0.0993 |
| Cheapest output / 1M tokens | $0.3724 | $0.0993 ✓ |
| providers | 14 | 16 |
| Context | 1000K | 128K |
| Vision | ✓ | — |
Gemini 3.5 Flash — All Providers
Detail page →Llama 3.3 70B Instruct — All Providers
Detail page →Frequently Asked Questions
Which is cheaper, Gemini 3.5 Flash or Llama 3.3 70B Instruct?
At official price, Llama 3.3 70B Instruct is cheaper by ~61%. At the best relay price, Gemini 3.5 Flash with a ~37% difference.
What context window do Gemini 3.5 Flash and Llama 3.3 70B Instruct support?
Gemini 3.5 Flash has a 1000K token context window; Llama 3.3 70B Instruct has 128K tokens.
Do Gemini 3.5 Flash and Llama 3.3 70B Instruct support image input?
Gemini 3.5 Flash supports image input; Llama 3.3 70B Instruct does not support image input.
What is the cheapest way to access Gemini 3.5 Flash or Llama 3.3 70B Instruct?
Gemini 3.5 Flash's lowest input price is $0.0621/1M tokens (TreeRouter). Llama 3.3 70B Instruct's is $0.0993/1M tokens (ProAI API). Relay providers typically offer lower prices with different SLA terms.
Related Comparisons
Related