Fireworks AI API Pricing
🌍 GlobalPayment Methods
Fireworks AI is an AI API relay service. ComputeUnion tracks 60 price records for this platform (56 auto-scraped, Updated 4m ago; 4 manually maintained). Browse the categories below to compare Fireworks AI pricing across providers.
Model Pricing on Fireworks AI
Prices per 1M tokens
LLM53 models
| Model | Input | Output | Context | Updated |
|---|---|---|---|---|
| OpenAI GPT OSS 20B accounts/fireworks/models/gpt-oss-20b | $0.0700 | $0.3000 | 125K | Updated 4m ago |
| Phi-3 Mini 128K accounts/fireworks/models/phi-3-mini-128k-instruct | $0.1000 | $0.1000 | 125K | Updated 4m ago |
| Qwen3 0.6B accounts/fireworks/models/qwen3-0p6b | $0.1000 | $0.1000 | 32K | Updated 4m ago |
| Llama 3.2 3B accounts/fireworks/models/llama-v3p2-3b-instruct | $0.1000 | $0.1000 | 125K | Updated 4m ago |
| Llama 3.2 1B accounts/fireworks/models/llama-v3p2-1b-instruct | $0.1000 | $0.1000 | 125K | Updated 4m ago |
| Qwen3 1.7B accounts/fireworks/models/qwen3-1p7b | $0.1000 | $0.1000 | 32K | Updated 4m ago |
| Gemma 3 1B accounts/fireworks/models/gemma-3-1b-it | $0.1000 | $0.1000 | 32K | Updated 4m ago |
| DeepSeek V4 Flash accounts/deepseek-ai/models/deepseek-v4-flash | $0.1400 | $0.2800 | 125K | Updated 4m ago |
| OpenAI GPT OSS 120B accounts/fireworks/models/gpt-oss-120b | $0.1500 | $0.6000 | 125K | Updated 4m ago |
| Qwen2.5 14B accounts/fireworks/models/qwen2p5-14b-instruct | $0.2000 | $0.2000 | 125K | Updated 4m ago |
| DeepSeek R1 0528 Distill Qwen3 8B accounts/fireworks/models/deepseek-r1-0528-distill-qwen3-8b | $0.2000 | $0.2000 | 125K | Updated 4m ago |
| Gemma 3 4B accounts/fireworks/models/gemma-3-4b-it | $0.2000 | $0.2000 | 125K | Updated 4m ago |
| Gemma 3 12B accounts/fireworks/models/gemma-3-12b-it | $0.2000 | $0.2000 | 125K | Updated 4m ago |
| Qwen3 8B accounts/fireworks/models/qwen3-8b | $0.2000 | $0.2000 | 125K | Updated 4m ago |
| Qwen3 14B accounts/fireworks/models/qwen3-14b | $0.2000 | $0.2000 | 125K | Updated 4m ago |
| Qwen2.5 7B accounts/fireworks/models/qwen2p5-7b-instruct | $0.2000 | $0.2000 | 125K | Updated 4m ago |
| Qwen2 7B accounts/fireworks/models/qwen2-7b-instruct | $0.2000 | $0.2000 | 125K | Updated 4m ago |
| Mistral 7B v3 accounts/fireworks/models/mistral-7b-instruct-v3 | $0.2000 | $0.2000 | 32K | Updated 4m ago |
| DeepSeek R1 Distill Llama 8B accounts/fireworks/models/deepseek-r1-distill-llama-8b | $0.2000 | $0.2000 | 125K | Updated 4m ago |
| InternVL3 8B accounts/fireworks/models/internvl3-8b | $0.2000 | $0.2000 | 8K | Updated 4m ago |
| DeepSeek R1 Distill Qwen 14B accounts/fireworks/models/deepseek-r1-distill-qwen-14b | $0.2000 | $0.2000 | 64K | Updated 4m ago |
| Gemma 2 9B accounts/fireworks/models/gemma2-9b-it | $0.2000 | $0.2000 | 8K | Updated 4m ago |
| Llama 3.1 8B accounts/fireworks/models/llama-v3p1-8b-instruct | $0.2000 | $0.2000 | 125K | Updated 4m ago |
| Llama 3.2 11B Vision accounts/fireworks/models/llama-v3p2-11b-vision-instruct | $0.2000 | $0.2000 | 125K | Updated 4m ago |
| MiniMax 2.5 accounts/fireworks/models/minimax-m2p5 | $0.3000 | $1.2000 | 40K | Updated 4m ago |
| MiniMax 2.7 accounts/fireworks/models/minimax-m2p7 | $0.3000 | $1.2000 | 80K | Updated 4m ago |
| Llama 4 Scout accounts/fireworks/models/llama4-scout-instruct-basic | $0.5000 | $0.5000 | 125K | Updated 4m ago |
| Qwen 3.6 Plus accounts/fireworks/models/qwen3p6-plus | $0.5000 | $3.0000 | 125K | Updated 4m ago |
| Mixtral 8x7B accounts/fireworks/models/mixtral-8x7b-instruct | $0.5000 | $0.5000 | 32K | Updated 4m ago |
| Kimi K2.5 accounts/fireworks/models/kimi-k2p5 | $0.6000 | $3.0000 | 250K | Updated 4m ago |
| Qwen2.5 72B accounts/fireworks/models/qwen2p5-72b-instruct | $0.9000 | $0.9000 | 125K | Updated 4m ago |
| Qwen3 32B accounts/fireworks/models/qwen3-32b | $0.9000 | $0.9000 | 125K | Updated 4m ago |
| Qwen2.5 VL 72B accounts/fireworks/models/qwen2p5-vl-72b-instruct | $0.9000 | $0.9000 | 125K | Updated 4m ago |
| InternVL3 38B accounts/fireworks/models/internvl3-38b | $0.9000 | $0.9000 | 8K | Updated 4m ago |
| InternVL3 78B accounts/fireworks/models/internvl3-78b | $0.9000 | $0.9000 | 8K | Updated 4m ago |
| Gemma 3 27B accounts/fireworks/models/gemma-3-27b-it | $0.9000 | $0.9000 | 125K | Updated 4m ago |
| DeepSeek R1 Distill Qwen 32B accounts/fireworks/models/deepseek-r1-distill-qwen-32b | $0.9000 | $0.9000 | 125K | Updated 4m ago |
| Llama 3.1 70B accounts/fireworks/models/llama-v3p1-70b-instruct | $0.9000 | $0.9000 | 125K | Updated 4m ago |
| Llama 3.3 70B accounts/fireworks/models/llama-v3p3-70b-instruct | $0.9000 | $0.9000 | 125K | Updated 4m ago |
| DeepSeek R1 Distill Llama 70B accounts/fireworks/models/deepseek-r1-distill-llama-70b | $0.9000 | $0.9000 | 125K | Updated 4m ago |
Showing 40 of 53 models — see the API pricing pages for the full list
Coding1 models
| Model | Input | Output | Context | Updated |
|---|---|---|---|---|
| Qwen2.5 Coder 32B accounts/fireworks/models/qwen2p5-coder-32b-instruct | $0.9000 | $0.9000 | 125K | Updated 4m ago |
Embedding6 models
| Model | Input | Output | Context | Updated |
|---|---|---|---|---|
| Qwen3 Embedding 0.6B accounts/fireworks/models/qwen3-embedding-0p6b | $0.0160 | $0.0160 | 4K | Updated 4m ago |
| Qwen3 Reranker 0.6B accounts/fireworks/models/qwen3-reranker-0p6b | $0.0160 | $0.0160 | 4K | Updated 4m ago |
| Qwen3 Embedding 8B accounts/fireworks/models/qwen3-embedding-8b | $0.1000 | $0.1000 | 8K | Updated 4m ago |
| Qwen3 Reranker 8B accounts/fireworks/models/qwen3-reranker-8b | $0.1000 | $0.1000 | 8K | Updated 4m ago |
| Qwen3 Embedding 4B accounts/fireworks/models/qwen3-embedding-4b | $0.1000 | $0.1000 | 4K | Updated 4m ago |
| Qwen3 Reranker 4B accounts/fireworks/models/qwen3-reranker-4b | $0.1000 | $0.1000 | 4K | Updated 4m ago |
FAQ
What is Fireworks AI?
Fireworks AI is an AI API relay service aggregating multi-provider access through a unified endpoint. ComputeUnion currently tracks 60 price records for this platform — 56 auto-scraped (Updated 4m ago), 4 manually maintained.
How does Fireworks AI pricing compare to official providers?
Relay services like Fireworks AI typically offer discounted rates versus official provider pricing, though latency and SLA terms may differ. Use the category links below to compare Fireworks AI prices against other providers on ComputeUnion.
Is Fireworks AI accessible from China?
Fireworks AI is an international service — direct access from China mainland may be restricted.
Related pages