Llama 3.1 8B Instruct vs GPT-4o mini API Price Comparison

Prices updated: ·50 quotes

As of : At official rates, Llama 3.1 8B Instruct is $0.1000/1M input tokens and GPT-4o mini is $0.1500 — Llama 3.1 8B Instruct is ~33% cheaper than GPT-4o mini. Via relay providers, the lowest Llama 3.1 8B Instruct rate is $0.0041 (RunAPI) and GPT-4o mini is $0.0062 (TreeRouter). Context windows: Llama 3.1 8B Instruct 128K, GPT-4o mini 131K. GPT-4o mini supports image input. 50 provider quotes tracked (Llama 3.1 8B Instruct: 15, GPT-4o mini: 35), updated daily.
Llama 3.1 8B InstructBudget
Official
$0.1000
/ 1M tokens
Best Price
$0.0041
/ 1M tokens
15 providers · 128K Context
GPT-4o miniBudget
Official
$0.1500
/ 1M tokens
Best Price
$0.0062
/ 1M tokens
35 providers · 131K Context · 👁 Vision
At official prices, Llama 3.1 8B Instruct is ~33% cheaper. At the best relay price, Llama 3.1 8B Instruct is ~33% cheaper.

Price Comparison

Llama 3.1 8B InstructGPT-4o mini
Official input / 1M tokens$0.1000$0.1500
Official output / 1M tokens$0.1000$0.6000
Cheapest input / 1M tokens$0.0041$0.0062
Cheapest output / 1M tokens$0.0062$0.0031
providers1535
Context128K131K
Vision

Llama 3.1 8B Instruct — All Providers

Detail page →
ProviderInput / 1MAPI
Official
CerebrasOfficial🌐VisaMCLive
$0.1000
Try →
Relay / Aggregator
RunAPIRelayLowest🇨🇳支付宝微信⚠️ Unusually lowLive
$0.0041
Try →
OpenRouterRelay🌐VisaMCCryptoLive
$0.0200
Try →
Novita AIRelay🌐VisaMCLive
$0.0200
Try →
Nebius AIRelay🌐Manual
$0.0200
Try →
Cloudflare Workers AIRelay🌐VisaMCLive
$0.0450
Try →
Neets.aiRelay🌐VisaMCManual
$0.0500
Try →

GPT-4o mini — All Providers

Detail page →
ProviderInput / 1MAPI
Official
Azure OpenAIOfficial🌐Manual
$0.1500
Try →
Relay / Aggregator
TreeRouterRelayLowest🇨🇳支付宝微信⚠️ Unusually lowLive
$0.0062
Try →
RunAPIRelay🇨🇳支付宝微信Live
$0.0083
Try →
ProAI APIRelay🇨🇳支付宝微信Live
$0.0108
Try →
UiUiAPIRelay🇨🇳支付宝微信VisaMCLive
$0.0207
Try →
LaoZhang APIRelay🇨🇳支付宝微信Live
$0.0207
Try →
PoloAPIRelay🇨🇳支付宝微信Live
$0.0207
Try →

Frequently Asked Questions

Which is cheaper, Llama 3.1 8B Instruct or GPT-4o mini?

At official price, Llama 3.1 8B Instruct is cheaper by ~33%. At the best relay price, Llama 3.1 8B Instruct with a ~33% difference.

What context window do Llama 3.1 8B Instruct and GPT-4o mini support?

Llama 3.1 8B Instruct has a 128K token context window; GPT-4o mini has 131K tokens.

Do Llama 3.1 8B Instruct and GPT-4o mini support image input?

Llama 3.1 8B Instruct does not support image input; GPT-4o mini supports image input.

What is the cheapest way to access Llama 3.1 8B Instruct or GPT-4o mini?

Llama 3.1 8B Instruct's lowest input price is $0.0041/1M tokens (RunAPI). GPT-4o mini's is $0.0062/1M tokens (TreeRouter). Relay providers typically offer lower prices with different SLA terms.

Related Comparisons

llama-3-1-8b-instruct vs llama-4-scoutllama-3-1-8b-instruct vs gemini-2-5-flash-litellama-3-1-8b-instruct vs deepseek-v4-flashgpt-4o-mini vs claude-haiku-4-5gpt-4o-mini vs gemini-2-5-flashgpt-4o-mini vs gemini-2-5-flash-litegpt-4o-mini vs deepseek-v4-flashgpt-4o-mini vs gpt-5-4-mini

Related

Llama 3.1 8B Instruct API PricesGPT-4o mini API PricesAll Language Model APIs
← Back to AI API Pricing