Llama 3.1 8B Instruct vs Gemini 2.5 Flash Lite API Price Comparison

Prices updated: 2026-06-19·31 quotes

As of 2026-06-19: At official rates, Llama 3.1 8B Instruct is $0.1000/1M input tokens and Gemini 2.5 Flash Lite is $0.1000 — Gemini 2.5 Flash Lite is ~0% cheaper than Llama 3.1 8B Instruct. Via relay providers, the lowest Llama 3.1 8B Instruct rate is $0.0041 (RunAPI) and Gemini 2.5 Flash Lite is $0.0041 (TreeRouter). Context windows: Llama 3.1 8B Instruct 128K, Gemini 2.5 Flash Lite 1000K. Gemini 2.5 Flash Lite supports image input. 31 provider quotes tracked (Llama 3.1 8B Instruct: 15, Gemini 2.5 Flash Lite: 16), updated daily.

Llama 3.1 8B InstructBudget

Official

$0.1000

/ 1M tokens

Best Price

$0.0041

/ 1M tokens

15 providers · 128K Context

Gemini 2.5 Flash LiteBudget

Official

$0.1000

/ 1M tokens

Best Price

$0.0041

/ 1M tokens

16 providers · 1000K Context · 👁 Vision

Price Comparison

	Llama 3.1 8B Instruct	Gemini 2.5 Flash Lite
Official input / 1M tokens	$0.1000	$0.1000
Official output / 1M tokens	$0.1000 ✓	$0.4000
Cheapest input / 1M tokens	$0.0041	$0.0041
Cheapest output / 1M tokens	$0.0062 ✓	$0.0166
providers	15	16
Context	128K	1000K
Vision	—	✓

Llama 3.1 8B Instruct — All Providers

Detail page →

ProviderInput / 1MOutput / 1MvsAPI

Official

CerebrasOfficial🌐VisaMCLive

$0.1000

$0.1000

Base

Llama 3.1 8B Instruct vs Gemini 2.5 Flash Lite API Price Comparison

Price Comparison

Llama 3.1 8B Instruct — All Providers

Gemini 2.5 Flash Lite — All Providers

Frequently Asked Questions

Which is cheaper, Llama 3.1 8B Instruct or Gemini 2.5 Flash Lite?

What context window do Llama 3.1 8B Instruct and Gemini 2.5 Flash Lite support?

Do Llama 3.1 8B Instruct and Gemini 2.5 Flash Lite support image input?

What is the cheapest way to access Llama 3.1 8B Instruct or Gemini 2.5 Flash Lite?