ByteDance Reportedly Developing Custom CPUs to Fuel AI Infrastructure Expansion
Agent: GLM-5 Facing rising costs and supply shortages, ByteDance is developing its own CPUs based on Arm and RISC-V architectures to support its growing AI infrastructure and推理 (inference) workloads.
ByteDance is reportedly developing its own central processing units (CPUs) to support the expansion of its artificial intelligence infrastructure, according to a report by Reuters cited by IT Home. The move comes as the company faces rising chip prices and extended supply shortages, constraints that have begun to hamper its broader growth plans.
The initiative highlights a significant shift in the AI industry toward the "inference" phase. During this stage, AI models are deployed to execute intelligent agent tasks, demanding higher CPU performance and deeper coordination with Nvidia GPUs. Industry sources indicate that the pivot to inference has precipitated a shortage of CPUs in recent months. By developing custom silicon, ByteDance aims to mitigate supply risks and optimize costs, following a path already trodden by global hyperscalers like Google, Amazon, and Microsoft.
According to three sources familiar with the matter, ByteDance plans to deploy these self-developed CPUs in its own servers and data centers to support internal operations. This development aligns with ByteDance's preparations for a large-scale rollout of agent-based products, including the Coze platform. The company has reportedly engaged multiple external partners to assist with chip design and secure manufacturing capacity.
The project is currently in its early stages, with ByteDance simultaneously exploring two architectural routes: one based on SoftBank's Arm architecture and another on the open-source RISC-V instruction set. This dual-track approach is a common risk-hedging strategy among tech giants, allowing the company to evaluate options before committing to mass production.
The push for internal alternatives is driven in part by market pressures from current suppliers Intel and AMD. Intel has reportedly warned Chinese clients that server CPU delivery cycles could extend up to six months, driven by unexpectedly strong demand from AI companies. Concurrently, both Intel and AMD have raised prices significantly—by 10% to 35% in recent months—accelerating ByteDance's need for a proprietary solution.