MarkTechPost@AI 2024年10月22日
IBM Releases Granite 3.0 2B and 8B AI Models for AI Enterprises
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

IBM 正式发布 Granite 3.0 AI 模型,为企业提供高性能、安全且可信的 AI 解决方案。该模型支持多种企业应用场景,在技术上有诸多优势,且注重开放性、可扩展性和透明度,有助于企业克服 AI 应用的传统障碍。

IBM 的 Granite 3.0 AI 模型基于大型语言模型构建,专为企业 AI 应用设计。包括 8B 和 2B 参数密集的解码器模型,在多种基准测试中表现出色,训练数据丰富,涵盖多种语言和编程语言,为自然语言处理任务提供强大支持,并确保隐私和安全。

这些模型具有开放性和可扩展性,开发者可根据企业需求进行调整。采用 Apache 2.0 许可,训练数据和方法公开,可在 IBM Watsonx 平台及合作伙伴处获取。且模型训练使用 100%可再生能源,体现了 IBM 对可持续性的承诺。

Granite 3.0 在行业特定任务中准确性提高,增强了企业用户的决策效率。在学术和企业任务基准测试中表现优异,在对抗性提示基准测试中显示出可靠性。还包括推理高效的产品,如 MoE 模型和推测解码器模型,满足企业对高性能、高效且成本效益的部署需求。

Artificial intelligence is advancing rapidly, but enterprises face many obstacles when trying to leverage AI effectively. Organizations require models that are adaptable, secure, and capable of understanding domain-specific contexts while also maintaining compliance and privacy standards. Traditional AI models often struggle with delivering such tailored performance, requiring businesses to make a trade-off between customization and general applicability. Additionally, many AI models lack transparency, hindering trust among enterprise users.

IBM has officially released Granite 3.0 AI Models, a new line of foundation models designed to bring advanced AI capabilities to enterprises. These models represent a crucial step forward in IBM’s ongoing efforts to provide businesses with AI solutions that are not only high-performing but also secure and trustworthy. Granite 3.0 models are built to support diverse use cases in enterprise environments, ranging from natural language understanding to facilitating enhanced decision-making processes. Built on IBM’s watsonx AI and data platform, Granite 3.0 aims to allow companies to easily integrate AI in their workflows, thus improving efficiency while adhering to the specific security and privacy needs that enterprises often require.

Technically speaking, IBM’s Granite 3.0 AI models are built upon large language models (LLMs), designed specifically for enterprise AI applications. These include 8B and 2B parameter-dense decoder-only models, which outperformed similarly sized Llama-3.1 8B in Hugging Face’s OpenLLM Leaderboard (v2). The models are trained on over 12 trillion tokens across 12 languages and 116 programming languages, providing a versatile base for natural language processing (NLP) tasks and ensuring privacy and security. With capabilities that span across understanding unstructured data, generating content, summarizing information, and even facilitating complex decision-making, Granite 3.0 delivers powerful NLP features in a secure and transparent manner.

Moreover, these models are open and extensible, giving developers the freedom to adapt them as per their enterprise requirements. The models are licensed under Apache 2.0, with disclosed training data and methods and are available on the IBM Watsonx platform as well as through partners. Notably, the models were trained using 100% renewable energy, underscoring IBM’s commitment to sustainability.

One of the critical reasons why Granite 3.0 is a significant development is its focus on openness, extensibility, and transparency, which addresses one of the key barriers to AI adoption in enterprise environments—trust. Granite 3.0 provides transparency into how the models are built, with full documentation available, making it easier for enterprises to understand how the model makes decisions. Additionally, Granite 3.0’s integration with the Watsonx platform means that it benefits from Watsonx’s suite of tools, which include capabilities for data governance, model monitoring, and prompt-tuning.

According to IBM’s benchmarks, Granite 3.0 has shown improved accuracy in industry-specific tasks compared to previous models, leading to enhanced decision-making efficiency for enterprise users. The models rival Meta and Mistral AI models on academic benchmarks, lead on RAGBench for enterprise tasks, excel on cybersecurity benchmarks, and outperform peers on function calling benchmarks. The industry-leading robustness on the adversarial prompt benchmark AttaQ further demonstrates Granite 3.0’s reliability. The use of open-source elements also allows organizations to audit and refine the models to suit their specific needs, reducing the time and effort required for AI customization and deployment.

The Granite 3.0 release also includes inference-efficient offerings, such as Mixture of Experts (MoE) models—3B-A800M and 1B-A400M—designed for high efficiency in on-device, CPU servers and low-latency use cases. Additionally, a speculative decoder model accelerates inference by 220%, thanks to innovations in token conditioning and two-phase training. These advancements make Granite 3.0 particularly appealing for enterprises that require not only high performance but also efficient and cost-effective deployment options.

IBM Granite 3.0 AI Models mark an important leap in enterprise AI, focusing on the specific requirements of security, adaptability, and transparency. By providing open and extensible models that integrate with IBM’s Watsonx AI platform, Granite 3.0 helps enterprises overcome some of the traditional barriers to AI adoption, such as concerns about privacy, lack of customization, and trust in AI systems. The versatility of Granite 3.0 for natural language tasks, combined with its transparency and easy integration capabilities, positions it as a valuable tool for enterprises looking to leverage AI effectively and responsibly. As organizations continue to navigate the complexities of AI implementation, IBM’s Granite 3.0 serves as an ideal foundation for driving innovation, operational efficiency, and enhanced decision-making across industries.


Check out the Details, Paper, and Model on Hugging Face. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter.. Don’t Forget to join our 50k+ ML SubReddit.

[Upcoming Live Webinar- Oct 29, 2024] The Best Platform for Serving Fine-Tuned Models: Predibase Inference Engine (Promoted)

The post IBM Releases Granite 3.0 2B and 8B AI Models for AI Enterprises appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

IBM Granite 3.0 企业 AI 自然语言处理 AI 解决方案 可持续性
相关文章