AI News 05月28日 23:42
Huawei Supernode 384 disrupts Nvidia’s AI market hold
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

华为的Supernode 384架构是AI能力的一项突破,正值中美技术紧张局势加剧之际,标志着全球处理器竞争中的一个重要时刻。华为在深圳举行的鲲鹏昇腾开发者大会上展示了这一创新成果,表明其计算框架直接挑战了英伟达长期以来的市场主导地位。Supernode 384放弃了冯·诺依曼计算原则,转而采用专为现代AI工作负载设计的对等架构,通过用高速总线连接取代传统的以太网互连,将通信带宽提高了15倍,并将单跳延迟从2微秒降低到200纳秒,提高了性能。

🚀华为Supernode 384架构是AI能力的一项突破,采用对等架构,专为现代AI工作负载设计,挑战了传统服务器架构中并行处理规模扩大时,跨机器带宽成为瓶颈的问题。

💡华为CloudMatrix 384的实现展示了令人印象深刻的技术规格:384个昇腾AI处理器,跨越12个计算柜和4个总线柜,产生300 petaflops的原始计算能力,并配有48 terabytes的高带宽内存。

📊实际基准测试显示,在Meta的LLaMA 3等密集AI模型上,Supernode 384每卡每秒达到132个token,性能是传统集群架构的2.5倍。在阿里巴巴Qwen和DeepSeek系列模型上,达到了每卡每秒600到750个token。

🌐华为已在安徽省、内蒙古和贵州省的多个中国数据中心投入使用CloudMatrix 384系统,验证了该架构的可行性,并为更广泛的市场采用建立了基础设施框架。

Huawei’s AI capabilities have made a breakthrough in the form of the company’s Supernode 384 architecture, marking an important moment in the global processor wars amid US-China tech tensions.

The Chinese tech giant’s latest innovation emerged from last Friday’s Kunpeng Ascend Developer Conference in Shenzhen, where company executives demonstrated how the computing framework challenges Nvidia’s long-standing market dominance directly, as the company continues to operate under severe US-led trade restrictions.

Architectural innovation born from necessity

Zhang Dixuan, president of Huawei’s Ascend computing business, articulated the fundamental problem driving the innovation during his conference keynote: “As the scale of parallel processing grows, cross-machine bandwidth in traditional server architectures has become a critical bottleneck for training.”

The Supernode 384 abandons Von Neumann computing principles in favour of a peer-to-peer architecture engineered specifically for modern AI workloads. The change proves especially powerful for Mixture-of-Experts models (machine-learning systems using multiple specialised sub-networks to solve complex computational challenges.)

Huawei’s CloudMatrix 384 implementation showcases impressive technical specifications: 384 Ascend AI processors spanning 12 computing cabinets and four bus cabinets, generating 300 petaflops of raw computational power paired with 48 terabytes of high-bandwidth memory, representing a leap in integrated AI computing infrastructure.

Performance metrics challenge industry leaders

Real-world benchmark testing reveals the system’s competitive positioning in comparison to established solutions. Dense AI models like Meta’s LLaMA 3 achieved 132 tokens per second per card on the Supernode 384 – delivering 2.5 times superior performance compared to traditional cluster architectures.

Communications-intensive applications demonstrate even more dramatic improvements. Models from Alibaba’s Qwen and DeepSeek families reached 600 to 750 tokens per second per card, revealing the architecture’s optimisation for next-generation AI workloads.

The performance gains stem from fundamental infrastructure redesigns. Huawei replaced conventional Ethernet interconnects with high-speed bus connections, improving communications bandwidth by 15 times while reducing single-hop latency from 2 microseconds to 200 nanoseconds – a tenfold improvement.

Geopolitical strategy drives technical innovation

The Supernode 384’s development cannot be divorced from broader US-China technological competition. American sanctions have systematically restricted Huawei’s access to cutting-edge semiconductor technologies, forcing the company to maximise performance within existing constraints.

Industry analysis from SemiAnalysis suggests the CloudMatrix 384 uses Huawei’s latest Ascend 910C AI processor, which acknowledges inherent performance limitations but highlights architectural advantages: “Huawei is a generation behind in chips, but its scale-up solution is arguably a generation ahead of Nvidia and AMD’s current products in the market.”

The assessment reveals how Huawei AI computing strategies have evolved beyond traditional hardware specifications toward system-level optimisation and architectural innovation.

Market implications and deployment reality

Beyond laboratory demonstrations, Huawei has operationalised CloudMatrix 384 systems in multiple Chinese data centres in Anhui Province, Inner Mongolia, and Guizhou Province. Such practical deployments validate the architecture’s viability and establishes an infrastructure framework for broader market adoption.

The system’s scalability potential – supporting tens of thousands of linked processors – positions it as a compelling platform for training increasingly sophisticated AI models. The capability addresses growing industry demands for massive-scale AI implementation in diverse sectors.

Industry disruption and future considerations

Huawei’s architectural breakthrough introduces both opportunities and complications for the global AI ecosystem. While providing viable alternatives to Nvidia’s market-leading solutions, it simultaneously accelerates the fragmentation of international technology infrastructure along geopolitical lines.

The success of Huawei AI computing initiatives will depend on developer ecosystem adoption and sustained performance validation. The company’s aggressive developer conference outreach indicated a recognition that technical innovation alone cannot guarantee market acceptance.

For organisations evaluating AI infrastructure investments, the Supernode 384 represents a new option that combines competitive performance with independence from US-controlled supply chains. However, long-term viability remains contingent on continued innovation cycles and improved geopolitical stability.

(Image from Pixabay)

See also: Oracle plans $40B Nvidia chip deal for AI facility in Texas

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Explore other upcoming enterprise technology events and webinars powered by TechForge here.

The post Huawei Supernode 384 disrupts Nvidia’s AI market hold appeared first on AI News.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

华为 Supernode 384 AI架构 英伟达
相关文章