Gemini 3.5 Flash vs Llama 3.3 70B Instruct API Price Comparison

Prices updated: 2026-06-19·30 quotes

As of 2026-06-19: At official rates, Gemini 3.5 Flash is $1.5000/1M input tokens and Llama 3.3 70B Instruct is $0.5900 — Llama 3.3 70B Instruct is ~61% cheaper than Gemini 3.5 Flash. Via relay providers, the lowest Gemini 3.5 Flash rate is $0.0621 (TreeRouter) and Llama 3.3 70B Instruct is $0.0993 (ProAI API). Context windows: Gemini 3.5 Flash 1000K, Llama 3.3 70B Instruct 128K. Gemini 3.5 Flash supports image input. 30 provider quotes tracked (Gemini 3.5 Flash: 14, Llama 3.3 70B Instruct: 16), updated daily.

Gemini 3.5 FlashMid-tier

Official

$1.5000

/ 1M tokens

Best Price

$0.0621

/ 1M tokens

14 providers · 1000K Context · 👁 Vision

Llama 3.3 70B InstructMid-tier

Official

$0.5900

/ 1M tokens

Best Price

$0.0993

/ 1M tokens

16 providers · 128K Context

At official prices, Llama 3.3 70B Instruct is ~61% cheaper. At the best relay price, Gemini 3.5 Flash is ~37% cheaper.

Price Comparison

	Gemini 3.5 Flash	Llama 3.3 70B Instruct
Official input / 1M tokens	$1.5000	$0.5900 ✓
Official output / 1M tokens	$9.0000	$0.9900 ✓
Cheapest input / 1M tokens	$0.0621 ✓	$0.0993
Cheapest output / 1M tokens	$0.3724	$0.0993 ✓
providers	14	16
Context	1000K	128K
Vision	✓	—

Gemini 3.5 Flash — All Providers

Detail page →

ProviderInput / 1MOutput / 1MvsAPI

Official

Google AIOfficial🌐VisaMCGCP👁Live

$1.5000

$9.0000

Base

Gemini 3.5 Flash vs Llama 3.3 70B Instruct API Price Comparison

Price Comparison

Gemini 3.5 Flash — All Providers

Llama 3.3 70B Instruct — All Providers

Frequently Asked Questions

Which is cheaper, Gemini 3.5 Flash or Llama 3.3 70B Instruct?

What context window do Gemini 3.5 Flash and Llama 3.3 70B Instruct support?

Do Gemini 3.5 Flash and Llama 3.3 70B Instruct support image input?

What is the cheapest way to access Gemini 3.5 Flash or Llama 3.3 70B Instruct?