Llama 3.1 405B API Pricing

Lowest
$0.6897
from RunAPI·/ 1M tokens
Use Now →
7 providers·128K Context·7 live(Updated 2h ago)

Llama 3.1 405B has 7 API providers. The lowest input price is $0.6897/1M tokens from RunAPI. 7 providers auto-refresh every 6 h (Updated 2h ago).

No first-party price is tracked for Llama 3.1 405B on this page yet — below is a comparison of 7 hosting providers: prices range from $0.6897 to $6.0000, a 770% spread. Beyond price, hosts differ meaningfully in throughput, rate limits, and reliability, so a small trial run is worth it before committing.

Llama 3.1 405B supports a 128K context window. At today's lowest rate, processing 1M input + 1M output tokens costs about $1.3793. The table below lists current per-provider quotes.

Compare Llama 3.1 405B API Prices Across Providers

ProviderInput / 1MAPI
Relay / Aggregator
RunAPIRelayLowest🇨🇳支付宝微信Live
$0.6897
Try →
ProAI APIRelay🇨🇳支付宝微信Live
$0.8276
Try →
Fireworks AIRelay🌐VisaMCLive
$0.9000
Try →
YunWu AIRelay🇨🇳支付宝微信MediumLive
$2.0483
Try →
CrazyRouterRelay🌐Live
$6.0000
Try →

Frequently Asked Questions

How is Llama 3.1 405B API pricing calculated?

LLM APIs are billed per 1M input and output tokens separately. Official providers set the base price; relay providers typically offer 20–80% discounts.

What is the cheapest way to access Llama 3.1 405B API?

The lowest current input price is $0.6897 per 1M tokens from RunAPI. Prices update in real time — bookmark this page for the latest rates.

Can I use Llama 3.1 405B API from China?

Providers marked 🇨🇳 in the table support China-mainland access. Check each provider's documentation for details.

How often is Llama 3.1 405B API pricing updated?

7 sources on this page are live-scraped every ~6 hours (Updated 2h ago).

How reliable is the Llama 3.1 405B API pricing data?

Live-scraped prices come directly from provider APIs and are generally accurate. Manually maintained prices are sourced from official pricing pages and periodically verified. For critical decisions, confirm the latest price via the provider's official link.

How can I tell if a price is live-scraped or manually maintained?

Each provider row shows a small label next to the name: a green "Live" badge means prices are automatically fetched on a schedule; a gray "Manual" badge means human-curated. Live data not refreshed in 24+ hours turns amber.

Related

All Language Model APIsVideo Generation APIsAudio & Speech APIsImage Generation APIs
← Back to AI API Pricing