Llama 3.1 8B Instruct vs Gemini 2.5 Flash Lite API Price Comparison

Prices updated: ·31 quotes

As of : At official rates, Llama 3.1 8B Instruct is $0.1000/1M input tokens and Gemini 2.5 Flash Lite is $0.1000 — Gemini 2.5 Flash Lite is ~0% cheaper than Llama 3.1 8B Instruct. Via relay providers, the lowest Llama 3.1 8B Instruct rate is $0.0041 (RunAPI) and Gemini 2.5 Flash Lite is $0.0041 (TreeRouter). Context windows: Llama 3.1 8B Instruct 128K, Gemini 2.5 Flash Lite 1000K. Gemini 2.5 Flash Lite supports image input. 31 provider quotes tracked (Llama 3.1 8B Instruct: 15, Gemini 2.5 Flash Lite: 16), updated daily.
Llama 3.1 8B InstructBudget
Official
$0.1000
/ 1M tokens
Best Price
$0.0041
/ 1M tokens
15 providers · 128K Context
Gemini 2.5 Flash LiteBudget
Official
$0.1000
/ 1M tokens
Best Price
$0.0041
/ 1M tokens
16 providers · 1000K Context · 👁 Vision

Price Comparison

Llama 3.1 8B InstructGemini 2.5 Flash Lite
Official input / 1M tokens$0.1000$0.1000
Official output / 1M tokens$0.1000$0.4000
Cheapest input / 1M tokens$0.0041$0.0041
Cheapest output / 1M tokens$0.0062$0.0166
providers1516
Context128K1000K
Vision

Llama 3.1 8B Instruct — All Providers

Detail page →
ProviderInput / 1MAPI
Official
CerebrasOfficial🌐VisaMCLive
$0.1000
Try →
Relay / Aggregator
RunAPIRelayLowest🇨🇳支付宝微信⚠️ Unusually lowLive
$0.0041
Try →
OpenRouterRelay🌐VisaMCCryptoLive
$0.0200
Try →
Novita AIRelay🌐VisaMCLive
$0.0200
Try →
Nebius AIRelay🌐Manual
$0.0200
Try →
Cloudflare Workers AIRelay🌐VisaMCLive
$0.0450
Try →
Neets.aiRelay🌐VisaMCManual
$0.0500
Try →

Gemini 2.5 Flash Lite — All Providers

Detail page →
ProviderInput / 1MAPI
Official
Google Vertex AIOfficial🌐VisaMCGCPManual
$0.1000
Try →
Google AIOfficial🌐VisaMCGCP👁Live
$0.1000
Try →
Relay / Aggregator
TreeRouterRelayLowest🇨🇳支付宝微信⚠️ Unusually lowLive
$0.0041
Try →
RunAPIRelay🇨🇳支付宝微信Live
$0.0059
Try →
LaoZhang APIRelay🇨🇳支付宝微信Live
$0.0138
Try →
UiUiAPIRelay🇨🇳支付宝微信VisaMCLive
$0.0138
Try →
PoloAPIRelay🇨🇳支付宝微信Live
$0.0138
Try →
EasyRouterRelay🇨🇳支付宝微信Live
$0.0138
Try →

Frequently Asked Questions

Which is cheaper, Llama 3.1 8B Instruct or Gemini 2.5 Flash Lite?

Official pricing for these two models cannot be directly compared. See the live provider tables below.

What context window do Llama 3.1 8B Instruct and Gemini 2.5 Flash Lite support?

Llama 3.1 8B Instruct has a 128K token context window; Gemini 2.5 Flash Lite has 1000K tokens.

Do Llama 3.1 8B Instruct and Gemini 2.5 Flash Lite support image input?

Llama 3.1 8B Instruct does not support image input; Gemini 2.5 Flash Lite supports image input.

What is the cheapest way to access Llama 3.1 8B Instruct or Gemini 2.5 Flash Lite?

Llama 3.1 8B Instruct's lowest input price is $0.0041/1M tokens (RunAPI). Gemini 2.5 Flash Lite's is $0.0041/1M tokens (TreeRouter). Relay providers typically offer lower prices with different SLA terms.

Related Comparisons

llama-3-1-8b-instruct vs llama-4-scoutllama-3-1-8b-instruct vs gpt-4o-minillama-3-1-8b-instruct vs deepseek-v4-flashgpt-5-4-mini vs gemini-2-5-flash-liteclaude-haiku-4-5 vs gemini-2-5-flash-litegemini-2-5-flash vs gemini-2-5-flash-litegemini-2-5-flash-lite vs deepseek-v4-flashgemini-2-5-flash-lite vs gemini-2-0-flash-lite

Related

Llama 3.1 8B Instruct API PricesGemini 2.5 Flash Lite API PricesAll Language Model APIs
← Back to AI API Pricing