SiliconFlow API Pricing
🇨🇳 ChinaPayment Methods
SiliconFlow is an AI API relay service. ComputeUnion tracks 58 price records for this platform (57 auto-scraped, Updated 4m ago; 1 manually maintained). Browse the categories below to compare SiliconFlow pricing across providers.
Model Pricing on SiliconFlow
Prices per 1M tokens
LLM48 models
| Model | Input | Output | Context | Updated |
|---|---|---|---|---|
| Qwen2.5-7B-Instruct (Pro) Pro/Qwen/Qwen2.5-7B-Instruct | $0.0516 | $0.0516 | 32K | Updated 4m ago |
| Ling-mini-2.0 inclusionAI/Ling-mini-2.0 | $0.0738 | $0.2951 | 128K | Updated 4m ago |
| Qwen3-VL-8B-Thinking Qwen/Qwen3-VL-8B-Thinking | $0.0738 | $0.7378 | 256K | Updated 4m ago |
| Qwen3-14B Qwen/Qwen3-14B | $0.0738 | $0.2951 | 128K | Updated 4m ago |
| Qwen3-VL-8B-Instruct Qwen/Qwen3-VL-8B-Instruct | $0.0738 | $0.2951 | 256K | Updated 4m ago |
| Qwen3-VL-30B-A3B-Thinking Qwen/Qwen3-VL-30B-A3B-Thinking | $0.1033 | $0.4132 | 256K | Updated 4m ago |
| Qwen2.5-14B-Instruct Qwen/Qwen2.5-14B-Instruct | $0.1033 | $0.1033 | 32K | Updated 4m ago |
| Qwen3-Coder-30B-A3B-Instruct Qwen/Qwen3-Coder-30B-A3B-Instruct | $0.1033 | $0.4132 | 256K | Updated 4m ago |
| Qwen3-Omni-30B-A3B-Thinking Qwen/Qwen3-Omni-30B-A3B-Thinking | $0.1033 | $0.4132 | 64K | Updated 4m ago |
| Qwen3-Omni-30B-A3B-Instruct Qwen/Qwen3-Omni-30B-A3B-Instruct | $0.1033 | $0.4132 | 64K | Updated 4m ago |
| Qwen3-Omni-30B-A3B-Captioner Qwen/Qwen3-Omni-30B-A3B-Captioner | $0.1033 | $0.4132 | 64K | Updated 4m ago |
| Step-3.5-Flash stepfun-ai/Step-3.5-Flash | $0.1033 | $0.3099 | 256K | Updated 4m ago |
| Qwen3-VL-30B-A3B-Instruct Qwen/Qwen3-VL-30B-A3B-Instruct | $0.1033 | $0.4132 | 256K | Updated 4m ago |
| Qwen3-30B-A3B-Instruct-2507 Qwen/Qwen3-30B-A3B-Instruct-2507 | $0.1033 | $0.4132 | 256K | Updated 4m ago |
| Ling-flash-2.0 inclusionAI/Ling-flash-2.0 | $0.1476 | $0.5903 | 128K | Updated 4m ago |
| GLM-4.5-Air zai-org/GLM-4.5-Air | $0.1476 | $0.8854 | 128K | Updated 4m ago |
| DeepSeek-V4-Flash deepseek-ai/DeepSeek-V4-Flash | $0.1476 | $0.2951 | 1M | Updated 4m ago |
| Qwen3-VL-32B-Thinking Qwen/Qwen3-VL-32B-Thinking | $0.1476 | $1.4757 | 256K | Updated 4m ago |
| Qwen3-VL-32B-Instruct Qwen/Qwen3-VL-32B-Instruct | $0.1476 | $0.5903 | 256K | Updated 4m ago |
| Hunyuan-A13B-Instruct tencent/Hunyuan-A13B-Instruct | $0.1476 | $0.5903 | 128K | Updated 4m ago |
| Qwen3-32B Qwen/Qwen3-32B | $0.1476 | $0.5903 | 128K | Updated 4m ago |
| GLM-4.5V zai-org/GLM-4.5V | $0.1476 | $0.8854 | 64K | Updated 4m ago |
| Qwen3.5-397B-A17B Qwen/Qwen3.5-397B-A17B | $0.1771 | $0.1771 | 256K | Updated 4m ago |
| Qwen2.5-32B-Instruct Qwen/Qwen2.5-32B-Instruct | $0.1859 | $0.1859 | 32K | Updated 4m ago |
| Seed-OSS-36B-Instruct ByteDance-Seed/Seed-OSS-36B-Instruct | $0.2214 | $0.5903 | 256K | Updated 4m ago |
| Qwen3.5-9B Qwen/Qwen3.5-9B | $0.2214 | $1.7708 | 256K | Updated 4m ago |
| Qwen3.6-35B-A3B Qwen/Qwen3.6-35B-A3B | $0.2361 | $1.8889 | 256K | Updated 4m ago |
| Qwen3.5-35B-A3B Qwen/Qwen3.5-35B-A3B | $0.2361 | $1.8889 | 256K | Updated 4m ago |
| Qwen3.6-27B Qwen/Qwen3.6-27B | $0.2656 | $2.1250 | 256K | Updated 4m ago |
| Qwen3.5-27B Qwen/Qwen3.5-27B | $0.2656 | $2.1250 | 256K | Updated 4m ago |
| GLM-4-32B-0414 THUDM/GLM-4-32B-0414 | $0.2789 | $0.2789 | 32K | Updated 4m ago |
| DeepSeek-V3 (Pro) Pro/deepseek-ai/DeepSeek-V3 | $0.2951 | $1.1805 | 160K | Updated 4m ago |
| DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 | $0.2951 | $0.4427 | 160K | Updated 4m ago |
| DeepSeek-V3.2 (Pro) Pro/deepseek-ai/DeepSeek-V3.2 | $0.2951 | $0.4427 | 160K | Updated 4m ago |
| Qwen3.5-122B-A10B Qwen/Qwen3.5-122B-A10B | $0.2951 | $2.3611 | 256K | Updated 4m ago |
| DeepSeek-V3 deepseek-ai/DeepSeek-V3 | $0.2951 | $1.1805 | 160K | Updated 4m ago |
| MiniMax-M2.5 (Pro) Pro/MiniMaxAI/MiniMax-M2.5 | $0.3099 | $1.2396 | 192K | Updated 4m ago |
| MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 | $0.3099 | $1.2396 | 192K | Updated 4m ago |
| DeepSeek-R1 deepseek-ai/DeepSeek-R1 | $0.5903 | $2.3611 | 160K | Updated 4m ago |
| DeepSeek-R1 (Pro) Pro/deepseek-ai/DeepSeek-R1 | $0.5903 | $2.3611 | 160K | Updated 4m ago |
Showing 40 of 48 models — see the API pricing pages for the full list
Embedding10 models
| Model | Input | Output | Context | Updated |
|---|---|---|---|---|
| bge-m3 (Pro) Pro/BAAI/bge-m3 | $0.0103 | $0.0103 | 8K | Updated 4m ago |
| Qwen3-Reranker-0.6B Qwen/Qwen3-Reranker-0.6B | $0.0103 | $0.0103 | 32K | Updated 4m ago |
| Qwen3-Embedding-0.6B Qwen/Qwen3-Embedding-0.6B | $0.0103 | $0.0103 | 32K | Updated 4m ago |
| bge-reranker-v2-m3 (Pro) Pro/BAAI/bge-reranker-v2-m3 | $0.0103 | $0.0103 | 8K | Updated 4m ago |
| Qwen3-Reranker-4B Qwen/Qwen3-Reranker-4B | $0.0207 | $0.0207 | 32K | Updated 4m ago |
| Qwen3-Embedding-4B Qwen/Qwen3-Embedding-4B | $0.0207 | $0.0207 | 32K | Updated 4m ago |
| Qwen3-Embedding-8B Qwen/Qwen3-Embedding-8B | $0.0413 | $0.0413 | 32K | Updated 4m ago |
| Qwen3-Reranker-8B Qwen/Qwen3-Reranker-8B | $0.0413 | $0.0413 | 32K | Updated 4m ago |
| Qwen3-VL-Embedding-8B Qwen/Qwen3-VL-Embedding-8B | $0.1033 | $0.1033 | 32K | Updated 4m ago |
| Qwen3-VL-Reranker-8B Qwen/Qwen3-VL-Reranker-8B | $0.1033 | $0.1033 | 32K | Updated 4m ago |
FAQ
What is SiliconFlow?
SiliconFlow is an AI API relay service aggregating multi-provider access through a unified endpoint. ComputeUnion currently tracks 58 price records for this platform — 57 auto-scraped (Updated 4m ago), 1 manually maintained.
How does SiliconFlow pricing compare to official providers?
Relay services like SiliconFlow typically offer discounted rates versus official provider pricing, though latency and SLA terms may differ. Use the category links below to compare SiliconFlow prices against other providers on ComputeUnion.
Is SiliconFlow accessible from China?
SiliconFlow is accessible from China mainland.
Related pages