ByteDance Unleashes Doubao 2.1 Pro: Crosses Production-Grade Threshold as Daily Token Usage Hits 180 Trillion

Beijing, June 23, 2026 — ByteDance's cloud computing arm Volcano Engine has officially launched Doubao Large Model 2.1 Pro, positioning the Chinese tech giant at the forefront of the global AI race as the industry crosses what executives call the "production-grade qualitative change point."

The announcement, made at the 2026 Summer FORCE Conference, comes as ByteDance reveals staggering usage metrics: daily API token calls across all platforms have surged to 180 trillion, representing more than 10x growth over the past year. In China's public cloud Model-as-a-Service (MaaS) market, Volcano Engine now commands 49.5% of token market share, effectively delivering nearly half of all enterprise AI processing in the country.

Crossing the Threshold

"Only when model capabilities cross the 'qualitative change point' can they truly meet production demands for both enterprises and individuals," stated Tan Dai, President of Volcano Engine, emphasizing the shift from AI as an experimental tool to core infrastructure.

The 2.1 Pro release targets two critical domains: Coding and Agent capabilities. In programming benchmarks, Doubao 2.1 Pro approaches or exceeds Claude Opus 4.7 and GPT-5.5 across multiple evaluations. On Terminal Bench 2.1—a benchmark simulating real-world software development scenarios—the model ranks in the global top tier. It scored 59.8 on SciCode scientific computing tests (surpassing Claude Opus 4.7) and achieved 47.0 on NL2Repo-Bench for repository-level code generation, significantly outperforming GPT-5.5.

Perhaps most tellingly, the model completed an 18-hour continuous chip design task, generating 1,303 lines of RTL code for a 16×16 PE Tiny NPU Tile across nine iterative rounds, successfully passing simulation, testing, and synthesis verification—a demonstration of true production-grade engineering delivery.

Agent Revolution

In the Agent domain, Doubao 2.1 Pro demonstrates advanced dynamic path planning and autonomous error correction. On OpenAI's GDPval benchmark—which evaluates real-world economic value creation across 44 professions—the model ranks first domestically. In the MCP-Atlas test featuring 36 real MCP servers and 1,000 tasks, it outperformed both Claude Opus 4.7 and GPT-5.5.

A live demonstration showcased the model orchestrating over 500 simultaneous AI agents to construct a 3D virtual city, executing thousands of tool calls to generate more than 100 architecturally distinct buildings with autonomous iterative refinement.

Multimodal and Economic Impact

Beyond core LLM capabilities, Volcano Engine introduced Seedance 2.5, claimed as the world's first video generation model to cross the production threshold. The system generates 30-second clips (surpassing the industry standard 20 seconds), accepts up to 50 multimodal inputs for consistency control, and offers localized editing without full regeneration.

Pricing strategy underscores ByteDance's aggressive market positioning. Doubao 2.1 Pro costs ¥6 per million input tokens and ¥30 per million output tokens—with cache hits at just ¥1.2—representing an approximately 80% cost reduction compared to Claude Opus 4.6. A lighter Turbo variant halves these prices further.

Enterprise Adoption Accelerates

Major enterprises have already integrated the model into core workflows. WPS leverages it for automated PPT generation and document processing; Unity China reports strong performance in 3D game scripting; ARM China deploys it for cross-system data retrieval and CAD automation; while New Oriental has deployed AI teaching assistants for personalized education.

With over 200 members in the "Trillion Token Club" (enterprises consuming over 1 trillion tokens annually) and HiAgent platform ranked first in China's intelligent agent development market, ByteDance is betting that the combination of production-grade capability and aggressive pricing will cement AI as the foundational layer of the digital economy.

As Tan Dai noted, "While per-token prices may fluctuate, the value created per token is rising faster—overall cost-effectiveness is in a clear upward trajectory."

Agent Roundtable