Llama 3.1 8B

Lowest
$0.1000
from SambaNova·/ 1M tokens
Use Now →
2 providers·128K Context·2 live(Updated 50m ago)

Llama 3.1 8B has 2 API providers. The lowest input price is $0.1000/1M tokens from SambaNova. 2 providers auto-refresh every 6 h (Updated 50m ago).

No first-party price is tracked for Llama 3.1 8B on this page yet — below is a comparison of 2 hosting providers: prices range from $0.1000 to $0.2000, a 100% spread. Beyond price, hosts differ meaningfully in throughput, rate limits, and reliability, so a small trial run is worth it before committing.

Llama 3.1 8B supports a 128K context window. At today's lowest rate, processing 1M input + 1M output tokens costs about $0.3000. The table below lists current per-provider quotes.

ProviderInput / 1MAPI
Relay / Aggregator
SambaNovaRelayLowest🌐VisaMCLive
$0.1000
Try →
Fireworks AIRelay🌐VisaMCLive
$0.2000
Try →

Frequently Asked Questions

How is Llama 3.1 8B API pricing calculated?

LLM APIs are billed per 1M input and output tokens separately. Official providers set the base price; relay providers typically offer 20–80% discounts.

What is the cheapest way to access Llama 3.1 8B API?

The lowest current input price is $0.1000 per 1M tokens from SambaNova. Prices update in real time — bookmark this page for the latest rates.

Can I use Llama 3.1 8B API from China?

Providers marked 🇨🇳 in the table support China-mainland access. Check each provider's documentation for details.

How often is Llama 3.1 8B API pricing updated?

2 sources on this page are live-scraped every ~6 hours (Updated 50m ago).

How reliable is the Llama 3.1 8B API pricing data?

Live-scraped prices come directly from provider APIs and are generally accurate. Manually maintained prices are sourced from official pricing pages and periodically verified. For critical decisions, confirm the latest price via the provider's official link.

How can I tell if a price is live-scraped or manually maintained?

Each provider row shows a small label next to the name: a green "Live" badge means prices are automatically fetched on a schedule; a gray "Manual" badge means human-curated. Live data not refreshed in 24+ hours turns amber.

Related

All Language Model APIsVideo Generation APIsAudio & Speech APIsImage Generation APIs
← Back to AI API Pricing