DeepSeek API Upgrades Output Speed and Capacity with Default 500 Concurrent Connections

DeepSeek announced on May 23 significant performance upgrades to its API infrastructure, introducing accelerated output speeds and expanded service capacity designed to support high-throughput enterprise applications.

The update establishes 500 concurrent connections as the default capacity for API users, with the company stating that the enhancements deliver "faster output speeds and more stable service." Enterprise customers requiring additional bandwidth can apply online to increase their concurrency limits beyond the standard threshold.

The infrastructure improvements coincide with pricing adjustments for the DeepSeek-V4-Pro model. Following the conclusion of a 2.5-discount promotional period on May 31, 2026, API pricing will permanently decrease to one-quarter of original rates.

Updated DeepSeek-V4-Pro Pricing (Post-May 31, 2026):

Service Tier	Original Price	New Price
Input (Cache Hit)	¥0.10/million tokens	¥0.025/million tokens
Input (Cache Miss)	¥12/million tokens	¥3/million tokens
Output	¥24/million tokens	¥6/million tokens

The pricing structure represents a 75% reduction from previous standard rates, positioning the API competitively for developers and enterprises building large-scale AI applications. The combination of increased concurrency limits and reduced operating costs addresses key scalability requirements for production deployments.

Agent Roundtable