Home » Technology » A breakthrough system AI from Huawei already in the largest Chinese companies

A breakthrough system AI from Huawei already in the largest Chinese companies

Huawei Unveils AI Supercomputer to Challenge NVIDIA Dominance

CloudMatrix 384 Boasts Unprecedented Power and Connectivity

Chinese tech giant Huawei has launched its most ambitious artificial intelligence computation system, the CloudMatrix 384, aiming to rival the cutting-edge offerings of industry leader NVIDIA.

Unveiling the Atlas 900 A3 SuperPoD

The new system, officially named Atlas 900 A3 SuperPoD, was showcased at the World Artificial Intelligence Conference (WAiC) in Shanghai. This development represents a significant stride for Huawei in the high-performance computing sector.

Industry experts suggest the CloudMatrix 384 directly competes with NVIDIA’s advanced NVL72 GB200 system. Huawei first announced the project in April, generating considerable interest globally.

Performance Metrics and Design Philosophy

The CloudMatrix 384 reportedly delivers up to 300 PFLOPS in BF16 computation, nearly double the 180 PFLOPS offered by NVIDIA’s system. It also surpasses NVIDIA in memory capacity by 3.6 times and bandwidth by 2.1 times.

Built upon Huawei’s Ascend AI framework, the system emphasizes three key advantages: extensive bandwidth, minimal latency, and exceptional performance density. These features are designed to enhance AI model training efficiency and ensure reliability for large-scale operations.

While a single Ascend 910C processor offers about a third of the performance of NVIDIA’s Blackwell architecture, Huawei compensates by integrating a substantially larger number of processors—384—into its system.

A notable design choice is the system’s exclusive reliance on fiber optics for all internal and inter-cabinet connections, a departure from traditional copper cabling, which facilitates higher communication speeds.

Efficiency Challenges and Market Impact

Despite its performance gains, the CloudMatrix 384 faces efficiency hurdles. It is reportedly 2.3 times less energy-efficient per FLOP, 1.8 times less efficient for terabyte-per-second memory bandwidth, and 1.1 times less efficient for terabyte HBM memory when compared to NVIDIA’s solutions.

However, within the context of China’s abundant energy resources and the nation’s efforts to bolster its advanced silicon capabilities, Huawei’s approach may prove strategically advantageous.

Adoption and Pricing

Priced at $8 million per unit, the CloudMatrix 384 is positioned for enterprise-level clients. Reports indicate that ten major Chinese companies have already integrated the system into their data center infrastructures.

This move by Huawei highlights the escalating competition in the AI hardware market, with companies investing heavily in systems that can power the next generation of artificial intelligence.

Polsat News

You may also like

Leave a Comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.