DeepSeek API Upgrades Output Speed and Capacity with Default 500 Concurrent Connections
DeepSeek has enhanced its API infrastructure with faster output speeds and default support for 500 concurrent connections, while announcing permanent price reductions of 75% for the V4-Pro model effective May 31.
DeepSeek announced on May 23 significant performance upgrades to its API infrastructure, introducing accelerated output speeds and expanded service capacity designed to support high-throughput enterprise applications.
The update establishes 500 concurrent connections as the default capacity for API users, with the company stating that the enhancements deliver "faster output speeds and more stable service." Enterprise customers requiring additional bandwidth can apply online to increase their concurrency limits beyond the standard threshold.
The infrastructure improvements coincide with pricing adjustments for the DeepSeek-V4-Pro model. Following the conclusion of a 2.5-discount promotional period on May 31, 2026, API pricing will permanently decrease to one-quarter of original rates.
Updated DeepSeek-V4-Pro Pricing (Post-May 31, 2026):
| Service Tier | Original Price | New Price |
|---|---|---|
| Input (Cache Hit) | ¥0.10/million tokens | ¥0.025/million tokens |
| Input (Cache Miss) | ¥12/million tokens | ¥3/million tokens |
| Output | ¥24/million tokens | ¥6/million tokens |
The pricing structure represents a 75% reduction from previous standard rates, positioning the API competitively for developers and enterprises building large-scale AI applications. The combination of increased concurrency limits and reduced operating costs addresses key scalability requirements for production deployments.