Nvidia Blog 06月06日 04:25
NVIDIA Blackwell Delivers Breakthrough Performance in Latest MLPerf Training Results
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

英伟达正与全球公司合作构建AI工厂,加速下一代AI应用(使用最新的训练和推理技术)的训练和部署。最新的MLPerf Training基准测试中,英伟达Blackwell架构在每个基准测试中都展现出最高的规模性能,并在最严苛的大型语言模型(LLM)测试Llama 3.1 405B预训练中取得最佳成绩。Blackwell架构在Llama 3.1 405B预训练基准测试上的性能是上一代产品的2.2倍。性能的提升得益于Blackwell架构的创新,包括高密度液冷机架、高速互连技术以及NVIDIA NeMo Framework软件栈的创新。

🚀英伟达Blackwell架构旨在满足新一代AI应用对性能的需求,并在MLPerf Training v5.0基准测试中取得优异成绩,尤其是在LLM相关的测试中。

💻Blackwell架构在Llama 3.1 405B预训练基准测试中,性能是上一代产品的2.2倍;在Llama 2 70B LoRA微调基准测试中,性能提升了2.5倍,这突显了Blackwell架构的进步。

💡Blackwell架构的创新包括高密度液冷机架、高速互连技术以及NVIDIA NeMo Framework软件栈的创新,这些都有助于提升性能,加速AI应用开发。

🤝英伟达的合作伙伴生态系统积极参与MLPerf测试,包括CoreWeave、IBM等,共同推动AI技术的发展,加速AI工厂的建设。

NVIDIA is working with companies worldwide to build out AI factories — speeding the training and deployment of next-generation AI applications that use the latest advancements in training and inference.

The NVIDIA Blackwell architecture is built to meet the heightened performance requirements of these new applications. In the latest round of MLPerf Training — the 12th since the benchmark’s introduction in 2018 — the NVIDIA AI platform delivered the highest performance at scale on every benchmark and powered every result submitted on the benchmark’s toughest large language model (LLM)-focused test: Llama 3.1 405B pretraining.

The NVIDIA platform was the only one that submitted results on every MLPerf Training v5.0 benchmark — underscoring its exceptional performance and versatility across a wide array of AI workloads, spanning LLMs, recommendation systems, multimodal LLMs, object detection and graph neural networks.

The at-scale submissions used two AI supercomputers powered by the NVIDIA Blackwell platform: Tyche, built using NVIDIA GB200 NVL72 rack-scale systems, and Nyx, based on NVIDIA DGX B200 systems. In addition, NVIDIA collaborated with CoreWeave and IBM to submit GB200 NVL72 results using a total of 2,496 Blackwell GPUs and 1,248 NVIDIA Grace CPUs.

On the new Llama 3.1 405B pretraining benchmark, Blackwell delivered 2.2x greater performance compared with previous-generation architecture at the same scale.

On the Llama 2 70B LoRA fine-tuning benchmark, NVIDIA DGX B200 systems, powered by eight Blackwell GPUs, delivered 2.5x more performance compared with a submission using the same number of GPUs in the prior round.

These performance leaps highlight advancements in the Blackwell architecture, including high-density liquid-cooled racks, 13.4TB of coherent memory per rack, fifth-generation NVIDIA NVLink and NVIDIA NVLink Switch interconnect technologies for scale-up and NVIDIA Quantum-2 InfiniBand networking for scale-out. Plus, innovations in the NVIDIA NeMo Framework software stack raise the bar for next-generation multimodal LLM training, critical for bringing agentic AI applications to market.

These agentic AI-powered applications will one day run in AI factories — the engines of the agentic AI economy. These new applications will produce tokens and valuable intelligence that can be applied to almost every industry and academic domain.

The NVIDIA data center platform includes GPUs, CPUs, high-speed fabrics and networking, as well as a vast array of software like NVIDIA CUDA-X libraries, the NeMo Framework, NVIDIA TensorRT-LLM and NVIDIA Dynamo. This highly tuned ensemble of hardware and software technologies empowers organizations to train and deploy models more quickly, dramatically accelerating time to value.

The NVIDIA partner ecosystem participated extensively in this MLPerf round. Beyond the submission with CoreWeave and IBM, other compelling submissions were from ASUS, Cisco, Dell Technologies, Giga Computing, Google Cloud, Hewlett Packard Enterprise, Lambda, Lenovo, Nebius, Oracle Cloud Infrastructure, Quanta Cloud Technology and Supermicro.

Learn more about MLPerf benchmarks.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

英伟达 Blackwell架构 AI工厂 MLPerf
相关文章