EnterpriseAI 2024年08月28日
IBM’s Telum II and Spyre Accelerator Bring Advanced AI Capabilities to Modern Mainframes
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

IBM在Hot Chips 2024活动上公布新Telum II处理器和Spyre AI加速器规格,旨在推动AI工作负载,满足高效、安全和可扩展需求

🎯IBM的新Telum II处理器在前代基础上有多项关键改进,具有全新数据处理单元,能加速复杂IO协议,提高关键组件性能,有效处理更大更复杂数据集

💪Spyre加速器为Telum II处理器提供额外AI计算能力,支持‘集成AI建模方法’,增强AI预测的稳健性,每个芯片含32个计算核心,降低延迟提高吞吐量

🚀Telum II处理器和Spyre加速器的强大规格使AI应用更高效安全,前者能有效管理大规模AI工作负载,后者适合处理复杂AI模型和生成式AI用例,二者集成增强了主机可靠性和安全性

📅这两种技术都将由三星Foundry使用5nm工艺制造,IBM预计Telum II处理器和Spyre加速器将于2025年提供给LinuxOnE和IBM Z客户

IBM, a leader in global hybrid cloud and AI solutions, unveiled the specifications of its new Telum II processors and Spyre AI accelerators, which are set to drive AI workloads on the latest IBM Z mainframe systems.

The announcement was made at the Hot Chips 2024 event happening this week at Stanford University. 

As GenAI initiatives transition from proof of concept to production, there is an increasing demand for power-efficient, secure, and scalable solutions. According to research from Morgan Stanley, GenAI power demands will rise 70% annually for the next few years. The research indicates that by 2027, GenAI can consume as much energy as Spain needed to power itself in 2022. 

The new Telum II process and Spyre AI accelerators are engineered to address these escalating demands effectively. 

The Telum II processor builds on its predecessor, the first-generation Telum chip, with several key improvements. The new processor features a completely new data processing unit (DPU) designed to accelerate complex IO protocols for networking and storage on the mainframe and to improve key component performance.

IBM claims the new DPU offers increased frequency, memory capacity, and an integrated AI accelerator core. This allows it to handle larger and more complex datasets efficiently. 

The Spyre accelerator complements the Telum II processor by offering additional AI compute capabilities and supports what IBM calls “ensemble methods of AI modeling” - which combines multiple models to potentially boost prediction accuracy. 

Ensemble methods enhance the robustness of AI predictions, making them more reliable and less sensitive to errors or variations compared to individual models. The IBM Spyre accelerator chip will be delivered as an add-on option. Each Spyre chip contains 32 compute cores for AI applications, which reduces latency and enhances throughput across various AI tasks.

"Our robust, multi-generation roadmap positions us to remain ahead of the curve on technology trends, including escalating demands of AI," said Tina Tarquinio, VP, Product Management, IBM Z, and LinuxONE. 

"The Telum II processor and Spyre accelerator are designed to deliver high-performance, secured, and more power-efficient enterprise computing solutions. After years in development, these innovations will be introduced in our next-generation IBM Z platform so clients can leverage LLMs and generative AI at scale."

The new Telum II processor is a major upgrade to the original Telum processor that debuted in 2021. With 8 high-performance cores running at 5.5 gigahertz and with 36 megabytes of memory per core, Telum II processors offer an increase of 40% in on-chip cache capacity for a total of 360MB. The integrated AI accelerator enables low-latency, high-throughput AI inferencing during transactions. 

The powerful specifications of the two new technologies translate to better efficiency and security for AI-powered applications. While the  Telum II processor is designed to efficiently manage large-scale AI workloads and data-intensive business needs, the Spyre accelerator is geared toward handling complex AI models and generative AI use cases. 

The integration of Telum II and Spyre accelerators eliminates the need to transfer data to external GPU-equipped servers, thereby enhancing the mainframe's reliability and security.

Both technologies will be manufactured by IBM's long-standing fabrication partner, Samsung Foundry, using a 5 nm process. IBM expects the Telum II processor to be available to LinuxOnE and IBM Z clients in 2025. The Spyre accelerator is also expected to be available in 2025. 

Related Items 

Groq’s $640 Million Funding Poised to Disrupt AI Chip Market 

Is the GenAI Bubble Finally Popping? 

AWS and Fujitsu Expand Partnership to Modernize Legacy Cloud Applications 

 

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

IBM Telum II 处理器 Spyre 加速器 AI 应用
相关文章