Technology

Stack battles: the US-China artificial-intelligence rivalry is moving beyond chips alone

China is challenging US leadership in both AI hardware and software, with Europe unlikely to catch up.

Despite Chinese progress, the United States remains for now ahead in the race for dominance over the so-called artificial intelligence hardware stack – the resources and equipment, especially semiconductors, needed to run AI models. US multinational Nvidia is the global market leader with the best-performing AI chips. It accounts for roughly half the world’s installed AI chip stock but as much as two thirds of installed computing capacity.

China’s Huawei is the most credible foreign challenger, but its chips remain technologically inferior. Nevertheless, demand for Huawei’s chips within China is strong because of the combination of incentives and restrictions – not least US export controls on advanced chips – that create a captive market. In this context, Nvidia has largely conceded the Chinese AI chip market to Huawei.

Beijing’s demand-side support for Huawei and other domestic chipmakers – through subsidies, cheap energy and procurement preferences – is part of a strategy to close the gap with the US. Another part of this strategy is diplomatic: seeking to tackle US export controls (García-Herrero and Martens, 2026).

But the competition over hardware tells only part of the story. Nvidia’s durable advantage also rests on CUDA (Compute Unified Device Architecture), its proprietary software platform that makes its best-in-class chips run efficiently and that has become the de-facto standard for AI development. CUDA generates powerful network effects between chip/infrastructure suppliers, AI model developers, application deployers and end users. More of one type of user attracts more of other types of users. Nvidia makes CUDA freely available to developers – a classic zero-price strategy to maximise adoption and lock in demand for its hardware.

Most AI developers write in PyTorch, the leading open-source machine-learning framework. Because Nvidia dominates the installed chip base, developers optimise for CUDA-compatible hardware. This creates a self-reinforcing cycle: the larger the developer community and the richer the library of pre-optimised code, the harder it becomes for alternatives to gain traction. The result is exceptional pricing power (Nvidia’s margins on chips are 70 percent to 80 percent).

Competing ecosystems face a classic chicken-and-egg problem. Several US players (Amazon, Google, Meta) have developed their own chips but still largely rely on the PyTorch/CUDA ecosystem for broad adoption. The network effects and switching costs (code rewrites, performance uncertainty, loss of optimised libraries) are simply too high for most developers and companies.

Huawei’s strategy to bypass Nvidia’s software moat

A rival AI stack to Nvidia will require an alternative software ecosystem. Huawei has CUDA equivalents in CANN (Compute Architecture for Neural Networks) and MindSpore, an open-source machine learning framework optimised for its Ascend AI processors. CANN is still maturing and according to developers is less user-friendly than CUDA. Nevertheless, Huawei has moved aggressively to close the usability gap. In August 2025, it announced the open-sourcing of the CANN toolkit – a direct strike at CUDA’s proprietary model – and has been recruiting Chinese AI labs, universities and research institutions to contribute to an open Ascend developer community.

Just as important is torch_npu, a backend plugin that lets standard PyTorch code run on Ascend processors. Because most AI developers write in PyTorch, rather than directly in CUDA, this lowers the single biggest switching cost: developers no longer need to abandon their familiar framework to leave Nvidia hardware. The Huawei strategy mirrors, layer by layer, the playbook China has used on the hardware side: open up what the incumbent keeps closed, subsidise adoption and nudge a captive domestic generation of developers towards the alternative stack (García-Herrero and Martens, 2026).

Notably, Huawei collaborates with DeepSeek, one of China’s leading AI labs. DeepSeek’s models (open-weight and highly competitive on price/performance) have been engineered for compatibility with both Nvidia and Huawei processors. This is strategically clever: it lowers the barriers for developers already familiar with Nvidia ecosystems to try Ascend hardware, while widening the addressable market for Huawei chips. DeepSeek is rapidly becoming one of the most popular AI models globally. Unlike some earlier closed efforts by US labs, leveraging popular open-weight models as a bridge appears to be accelerating Huawei’s software ecosystem momentum. Whether this can generate self-reinforcing network effects comparable to CUDA’s two-decade head start remains the core open question.

The early evidence suggests that the software gap, like the hardware gap, is no longer fixed or unbridgeable. The signs of Chinese catch-up are real: an open-sourced toolkit with a state-backed contributor pipeline, falling switching costs through PyTorch compatibility, flagship open-weight models running on Ascend and a protected domestic market large enough to sustain the ecosystem through its immature phase. None of these existed in meaningful form two years ago.

Nevertheless, obstacles remain formidable. Chinese developers continue to report serious usability problems with CANN, and CUDA’s accumulated stock of optimised libraries is clearly large. Whether China can close the gap is genuinely uncertain.

The strategic implication, however, is clear. If Chinese companies succeed in crossing the CUDA moat – even only within their domestic market – the durable US advantage will narrow. More generally, the dual US hardware and software strength could become less mutually reinforcing.

European absence

Though it has AI assets (such as chip lithography equipment and AI research; see García-Herrero and Martens, 2026), Europe has no AI chip designer of global significance and no software layer analogous to CUDA or CANN that translates high-level AI algorithms into efficient machine code across a broad developer community.

As CUDA and CANN show, the hardware and software layers are closely intertwined through co-design. Without domestic chip-design capability at scale, it is much harder to build and optimise a competitive software platform, because there is less tight feedback between hardware architects and compiler/runtime developers. The economic value in the AI stack accrues overwhelmingly to those who control chip design and the software platform, not merely to equipment suppliers. With assets such as lithography systems, Europe is, in effect, an indispensable input provider to a race in which it is not itself a full contestant.

However, the absence of a full domestic stack does not mean Europe has no options. Targeted industrial policy can help close the compute gap (García-Herrero and Martens, 2026). Europe could also pursue a more deliberate open-source and vertical-specialisation strategy, with a focus on where it has comparative advantage (examples include automotive AI, climate modelling and trustworthy AI). Whether such a partial strategy can deliver genuine sovereignty or merely reduce dependence is the central strategic question for European policymakers. They should realise that the contest is shifting from who designs the best chips to who controls the full stack, including the software layer.

Source : Bruegel

GLOBAL BUSINESS AND FINANCE MAGAZINE

Recent Posts

Europe’s ungoverned space: Military AI and the autonomy that cannot be bought

European defence ministries and intelligence services run on infrastructure they do not control and cannot…

54 minutes ago

Independent central banks take more risk, not less

The fiscal dominance view holds that politically captured central banks are more likely to be…

58 minutes ago

Writing code versus shipping code: Productivity effects across generations of AI coding tools

Generative AI now writes a substantial share of the world's code, but aggregate software output…

1 hour ago

Electing women does not reduce corruption: Evidence from Brazil

As women gain political ground around the world, there is hope that the election of…

1 hour ago

From bilateralism to a system: Europe’s early trade treaties and lessons for EU trade policy in a contested world

As tensions rise between major powers and the global trading system becomes more contested, policymakers…

1 hour ago

Private capital markets and inequality

Private capital markets have expanded rapidly, but access remains concentrated among wealthy investors. This column…

1 hour ago