share_log

IBM Introduces Granite 3.0: High Performing AI Models Built for Business

IBM Introduces Granite 3.0: High Performing AI Models Built for Business

IBM推出Granite 3.0:专为业务打造的高性能人工智能模型
PR Newswire ·  10/21 00:01
  • New Granite 3.0 8B & 2B models, released under the permissive Apache 2.0 license, show strong performance across many academic and enterprise benchmarks, able to outperform or match similar-sized models
  • New Granite Guardian 3.0 models deliver IBM's most comprehensive guardrail capabilities to advance safe and trustworthy AI
  • New Granite 3.0 Mixture-of-Experts models enable extremely efficient inference and low latency, suitable for CPU-based deployments and edge computing
  • New Granite Time Series model achieved state-of-the-art performance in zero/few-shot forecasting, outperforming models 10 times larger
  • IBM unveils next generation of Granite-powered watsonx Code Assistant for general purpose coding; Debuts new tools in watsonx.ai for building and deploying AI applications and agents
  • Announces Granite will become the default model of Consulting Advantage, an AI-powered delivery platform used by IBM's 160,000 consultants to bring new solutions to clients faster
  • 新的花岗岩3.0 80亿 和 20亿 款型,发布在宽松的阿帕奇石油2.0许可下,展示了在许多学术和企业基准中表现出色的性能,能够胜过或与类似规模的模型相匹配
  • 新的花岗守护者3.0 模型提供IBM最全面的道路护栏功能,以推动安全和可靠的人工智能
  • 新的花岗岩3.0 专家混合模型实现了极高效的推理和低延迟,适用于基于CPU的部署和边缘计算
  • 新的花岗时间序列模型在零/少拍摄预测中取得了最先进的性能,胜过体积大10倍的模型
  • IBM推出了基于Granite的下一代watsonx 代码助手,用于通用编码;在watsonx.ai中推出了用于构建和部署人工智能应用程序和代理的新工具
  • 宣布Granite将成为Consulting Advantage的默认模型,这是IBM 16万名顾问使用的AI驱动型交付平台,以更快地为客户带来新解决方案

ARMONK, N.Y., Oct. 21, 2024 /PRNewswire/ -- Today, at IBM's (NYSE: IBM) annual TechXchange event the company announced the release of its most advanced family of AI models to date, Granite 3.0. IBM's third-generation Granite flagship language models can outperform or match similarly sized models from leading model providers on many academic and industry benchmarks, showcasing strong performance, transparency and safety.

2024年10月21日,纽约ARMONk /美通社/ -- 今天,在IBM (纽交所: IBM) 年度TechXchange活动上,该公司宣布发布迄今为止最先进的AI模型系列,Granite 3.0。IBM第三代Granite旗舰语言模型在许多学术和行业基准上能够超越或与领先的模型提供商的类似规模模型相匹配,展示出色的性能,透明度和安全性

IBM Granite
IBM Granite

Consistent with the company's commitment to open-source AI, the Granite models are released under the permissive Apache 2.0 license, making them unique in the combination of performance, flexibility and autonomy they provide to enterprise clients and the community at large.

符合公司对开源人工智能的承诺,Granite模型以宽松的Apache 2.0许可协议发布,使其在性能、灵活性和自主性的结合方面为企业客户和整个社区提供了独特的优势。

IBM's Granite 3.0 family includes:

IBM的Granite 3.0系列包括:

  • General Purpose/Language: Granite 3.0 8B Instruct, Granite 3.0 2B Instruct, Granite 3.0 8B Base, Granite 3.0 2B Base
  • Guardrails & Safety: Granite Guardian 3.0 8B, Granite Guardian 3.0 2B
  • Mixture-of-Experts: Granite 3.0 3B-A800M Instruct, Granite 3.0 1B-A400M Instruct, Granite 3.0 3B-A800M Base, Granite 3.0 1B-A400M Base
  • 通用语言:Granite 3.0 80亿 Instruct、Granite 3.0 20亿 Instruct、Granite 3.0 80亿 Base、Granite 3.0 20亿 Base
  • 防护栏与安全:Granite Guardian 3.0 80亿、Granite Guardian 3.0 2B
  • 专家组合:Granite 3.0 30亿.A80000万 Instruct、Granite 3.0 10亿.A40000万 Instruct、Granite 3.0 30亿.A80000万 Base、Granite 3.0 10亿.A40000万 Base

The new Granite 3.0 8B and 2B language models are designed as 'workhorse' models for enterprise AI, delivering strong performance for tasks such as Retrieval Augmented Geneneration (RAG), classification, summarization, entity extraction, and tool use. These compact, versatile models are designed to be fine-tuned with enterprise data and seamlessly integrated across diverse business environments or workflows.

全新的Granite 3.0 80亿和20亿语言模型被设计为企业人工智能的“工作马”型号,为检索增强生成(RAG)、分类、摘要、实体提取和工具使用等任务提供强大性能。这些紧凑、多功能的模型旨在通过企业数据进行微调,并在不同的业务环境或工作流中实现无缝集成。

While many large language models (LLMs) are trained on publicly available data, a vast majority of enterprise data remains untapped. By combining a small Granite model with enterprise data, especially using the revolutionary alignment technique InstructLab – introduced by IBM and RedHat in May – IBM believes businesses can achieve task-specific performance that rivals larger models at a fraction of the cost (based on an observed range of 3x-23x less cost than large frontier models in several early proofs-of-concept1).

虽然许多大型语言模型(LLMs)是在公开可用数据上进行训练的,但绝大多数企业数据仍未被利用。通过将小型Granite模型与企业数据结合,尤其是使用5月份由IBm和RedHat介绍的革命性对齐技术InstructLab,IBm相信企业可以以低于大型先驱模型多达3倍至23倍的成本(基于早期证明概念1的观察范围)实现特定任务的性能。

The Granite 3.0 release reaffirms IBM's commitment to building transparency, safety, and trust in AI products. The Granite 3.0 technical report and responsible use guide provide a description of the datasets used to train these models, details of the filtering, cleansing, and curation steps applied, along with comprehensive results of model performance across major academic and enterprise benchmarks.

Granite 3.0发布再次确认了IBM在构建人工智能产品的透明度、安全性和信任度方面的承诺。Granite 3.0技术报告和负责任使用指南提供了训练这些模型所使用数据集的描述,所应用的过滤、清洁和筛选步骤的详细信息,以及对主要学术和企业基准测试中模型性能的全面结果。

Critically, IBM provides an IP indemnity for all Granite models on watsonx.ai so enterprise clients can be more confident in merging their data with the models.

重要的是,IBM为watsonx.ai上的所有Granite模型提供知识产权保护,因此企业客户可以更加放心地将其数据与这些模型合并。

Raising the bar: Granite 3.0 benchmarks

提升标准:Granite 3.0基准测试

The Granite 3.0 language models also demonstrate promising results on raw performance.

Granite 3.0语言模型在原始性能方面也展现出令人期待的结果。

On standard academic benchmarks defined by Hugging Face's OpenLLM Leaderboard, the Granite 3.0 8B Instruct model's overall performance leads on average against state-of-the-art-performance of similar-sized open source models from Meta and Mistral. On IBM's state-of-the-art AttaQ safety benchmark, the Granite 3.0 8B Instruct model leads across all measured safety dimensions compared to models from Meta and Mistral.2

根据Hugging Face的OpenLLM排行榜所定义的标准学术基准测试,Granite 3.0 80亿 Instruct模型的整体性能平均领先于Meta和Mistral等规模相似的开源模型的最新性能。在IBM的最新AttaQ安全基准测试中,Granite 3.0 80亿 Instruct模型在所有测量的安全维度上均领先于Meta和Mistral的模型。

Across the core enterprise tasks of RAG, tool use, and tasks in the Cybersecurity domain, the Granite 3.0 8B Instruct model shows leading performance on average compared to similar-sized open source models from Mistral and Meta.3

在RAG、工具使用和网络安全领域的核心企业任务中,Granite 3.0 80亿 Instruct模型的整体表现平均优于来自Mistral和Meta等规模相似的开源模型。

The Granite 3.0 models were trained on over 12 trillion tokens on data taken from 12 different natural languages and 116 different programming languages, using a novel two-stage training method, leveraging results from several thousand experiments designed to optimize data quality, data selection, and training parameters. By the end of the year, the 3.0 8B and 2B language models are expected to include support for an extended 128K context window and multi-modal document understanding capabilities.

Granite 3.0模型在来自12种不同自然语言和116种不同编程语言的超过12万亿标记的数据上进行了训练,采用了一种新颖的两阶段训练方法,利用了数千次实验的结果,旨在优化数据质量、数据选择和训练参数。到年底,预计3.0 80亿和20亿语言模型将支持扩展的12.8万上下文窗口和多模态文档理解能力。

Demonstrating an excellent balance of performance and inference cost, IBM offers its Granite Mixture of Experts (MoE) Architecture models, Granite 3.0 1B-A400M and Granite 3.0 3B-A800M, as smaller, lightweight models that could be deployed for low latency applications as well as CPU-based deployments.

展示了出色的性能与推断成本之间的平衡,IBM提供其Granite专家混合(MoE)架构模型,Granite 3.0 10亿.A40000万和Granite 3.0 30亿.A80000万,作为较小、轻量化的模型,可用于低延迟应用以及基于CPU的部署。

IBM is also announcing an updated release of its pre-trained Granite Time Series models, the first versions of which were released earlier this year. These new models are trained on 3 times more data and deliver strong performance on all three major time series benchmarks, outperforming 10 times larger models from Google, Alibaba, and others. The updated models also provide greater modeling flexibility with support for external variables and rolling forecasts.4

IBM还宣布更新了其预训练的Granite时间序列模型,其中第一个版本是在今年早些时候发布的。这些新模型经过了3倍数据训练,在所有三个主要时间序列基准测试上表现出色,超过了来自Google、阿里巴巴等机构10倍更大的模型。更新后的模型还提供更大的建模灵活性,支持外部变量和滚动预测。

Introducing Granite Guardian 3.0: ushering the next era of responsible AI

推出Granite Guardian 3.0:迈入下一个负责任人工智能时代

As part of this release, IBM is also introducing a new family of Granite Guardian models that permit application developers to implement safety guardrails by checking user prompts and LLM responses for a variety of risks. The Granite Guardian 3.0 8B and 2B models provide the most comprehensive set of risk and harm detection capabilities available in the market today.

作为此版本的一部分,IBm还推出了一系列新的Granite Guardian模型,允许应用程序开发人员通过检查用户提示和LLm响应来实施安全防护措施,以防范各种风险。Granite Guardian 3.0 80亿和20亿模型在市场上提供了最全面的风险和危害检测能力。

In addition to harm dimensions such as social bias, hate, toxicity, profanity, violence, jailbreaking and more, these models also provide a range of unique RAG-specific checks such as groundedness, context relevance, and answer relevance. In extensive testing across 19 safety and RAG benchmarks, the Granite Guardian 3.0 8B model has higher overall accuracy on harm detection on average than all three generations of Llama Guard models from Meta. It also showed on par overall performance in hallucination detection on average with specialized hallucination detection models WeCheck and MiniCheck.5

除了社会偏见、仇恨、毒性、亵渎、暴力、越狱等危害维度,这些模型还提供一系列独特的RAG特定检查,如扎根性、上下文相关性和答案相关性。在对19个安全和RAG基准的广泛测试中,Granite Guardian 3.0 80亿模型的平均危害检测准确率高于Meta的三代Llama Guard模型。它在幻觉检测的整体表现方面展现出与专门的幻觉检测模型WeCheck和MiniCheck相当的性能水平。

While the Granite Guardian models are derived from the corresponding Granite language models, they can be used to implement guardrails alongside any open or proprietary AI models.

虽然Granite Guardian模型源自相应的Granite语言模型,但它们可以用来在任何开源或专有人工智能模型旁边实施防护栏。

Availability of Granite 3.0 models
The entire suite of Granite 3.0 models and the updated time series models are available for download on HuggingFace under the permissive Apache 2.0 license. The instruct variants of the new Granite 3.0 8B and 2B language models and the Granite Guardian 3.0 8B and 2Bmodels are available today for commercial use on IBM's watsonx platform. A selection of the Granite 3.0 models will also be available as NVIDIA NIM microservices and through Google Cloud's Vertex AI Model Garden integrations with HuggingFace.

Granite 3.0模型的可用性
整套Granite 3.0模型和更新后的时间序列模型可在HuggingFace上按照Apache 2.0许可证进行下载。新的Granite 3.0 80亿和20亿语言模型以及Granite Guardian 3.0 80亿和20亿模型的指示变体今天已在IBM的watsonx平台上商用。选定的Granite 3.0模型还将作为NVIDIA NIm微服务和通过Google Cloud的Vertex AI Model Garden与HuggingFace集成提供。

To help provide developer choice and ease of use and support local, edge deployments, a curated set of the Granite 3.0 models are also available on Ollama and Replicate.

为了帮助提供开发者选择和易用性,并支持本地、边缘部署,Granite 3.0模型的精选套件也可在Ollama和Replicate上使用。

The latest generation of Granite models expand IBM's robust open-source catalog of powerful LLMs. IBM has collaborated with ecosystem partners like AWS, Docker, Domo, Qualcomm Technologies, Inc. via its Qualcomm AI Hub, Salesforce, SAP, and others to integrate a variety of Granite models into these partners' offerings or make Granite models available on their platforms, offering greater choice to enterprises across the world.

Granite最新一代模型扩展了IBM强大的开源LLM目录。IBm已与AWS、Docker、Domo、高通技术公司(通过其高通AI中心)、Salesforce、SAP等生态系统合作伙伴合作,将各种Granite模型整合到这些合作伙伴的产品中,或在其平台上提供Granite模型,为全球企业提供更多选择。

Assistants to Agents: realizing the future for enterprise AI

助理到代理商:实现企业人工智能的未来

IBM is advancing enterprise AI through a spectrum of technologies – from models and assistants, to the tools needed to tune and deploy AI specifically for companies' unique data and use-cases. IBM is also paving the way for future AI agents that can self-direct, reflect, and perform complex tasks in dynamic business environments.

英伟达正在通过各种技术推动企业人工智能发展,从模型和助手到为公司的独特数据和用例调整和部署人工智能所需的工具。英伟达还为未来能够自主、反思并在动态商业环境中执行复杂任务的人工智能代理铺平了道路。

IBM continues to evolve its portfolio of AI assistant technologies – from watsonx Orchestrate to help companies build their own assistants via low-code tooling and automation, to a wide set of pre-built assistants for specific tasks and domains such as customer service, human resources, sales, and marketing. Organizations around the world have used watsonx Assistant to help them build AI assistants for tasks like answering routine questions from customers or employees, modernizing their mainframes and legacy IT applications, helping students explore potential career paths, or providing digital mortgage support for home buyers.

英伟达继续发展其AI助手技术组合,从watsonx Orchestrate开始,帮助公司通过低代码工具和自动化构建他们自己的助手,到为特定任务和领域提供广泛的预构建助手,如客户服务、人力资源、销售和营销。世界各地的组织已经使用watsonx助手帮助他们构建AI助手,例如回答来自客户或员工的常见问题,使其主机和传统IT应用程序现代化,帮助学生探索潜在职业道路,或为购房者提供数字抵押贷款支持。

Today IBM also unveiled the upcoming release of the next generation of watsonx Code Assistant, powered by Granite code models, to offer general-purpose coding assistance across languages like C, C++, Go, Java, and Python, with advanced application modernization capabilities for Enterprise Java Applications.6 Granite's code capabilities are also now accessible through a Visual Studio Code extension, IBM Granite.Code.

今天英伟达还揭示了即将推出的下一代watsonx Code Assistant,由Granite代码模型驱动,提供跨C、C++、Go、Java和Python等语言的通用编码辅助功能,具有用于企业Java应用程序的先进应用现代化能力。Granite的代码功能现在还通过Visual Studio Code扩展可访问,英伟达Granite.Code。

IBM also plans to release new tools to help developers build, customize and deploy AI more efficiently via watsonx.ai – including agentic frameworks, integrations with existing environments and low-code automations for common use-cases like RAG and agents.7

英伟达还计划通过watsonx.ai发布新工具,帮助开发人员更有效地构建、定制和部署人工智能,包括代理框架、与现有环境的集成以及适用于常见用例如RAG和代理的低代码自动化。

IBM is focused on developing AI agent technologies which are capable of greater autonomy, sophisticated reasoning and multi-step problem solving. The initial release of the Granite 3.0 8B model features support for key agentic capabilities, such as advanced reasoning and a highly-structured chat template and prompting style for implementing tool use workflows. IBM also plans to introduce a new AI agent chat feature to IBM watsonx Orchestrate, which uses agentic capabilities to orchestrate AI Assistants, skills, and automations that help users increase productivity across their teams.8 IBM plans to continue building agent capabilities across its portfolio in 2025, including pre-built agents for specific domains and use-cases.

英伟达专注于开发能够拥有更大的自主性、复杂推理和多步问题解决能力的AI代理技术。Granite 3.0 80亿型的首次发布具有对关键代理能力的支持,如高级推理和用于实现工具使用工作流的高度结构化的聊天模板和提示风格。英伟达还计划向英伟达watsonx Orchestrate引入新的AI代理聊天功能,利用代理能力协调AI助手、技能和自动化,帮助用户在团队中提高生产率。英伟达计划在2025年继续跨其组合构建代理能力,包括特定领域和用例的预构建代理。

Expanded AI-powered delivery platform to supercharge IBM consultants with AI

扩展的人工智能交付平台将为IBm咨询顾问提供AI支持,以加速他们的工作效率

IBM is also announcing a major expansion of its AI-powered delivery platform, IBM Consulting Advantage. The multi-model platform contains AI agents, applications, and methods like repeatable frameworks that can empower 160,000 IBM consultants to deliver better and faster client value at a lower cost.

IBm还宣布了其基于人工智能的交付平台IBm Consulting Advantage的重大扩展。这个多模型平台包含AI代理、应用程序和可重复使用的框架等方法,可以让16万名IBm顾问以更低的成本提供更好更快的客户价值。

As part of the expansion, Granite 3.0 language models will become the default model in Consulting Advantage. Leveraging Granite's performance and efficiency, IBM Consulting will be able to help maximize the return-on-investment for the generative AI projects of IBM clients.

作为扩展的一部分,Granite 3.0语言模型将成为Consulting Advantage的默认模型。借助Granite的性能和效率,IBm Consulting将能够帮助最大程度地提高IBm客户生成式人工智能项目的投资回报。

Another key part of the expansion is the introduction of IBM Consulting Advantage for Cloud Transformation and Management and IBM Consulting Advantage for Business Operations. Each includes domain-specific AI agents, applications, and methods infused with IBM's best practices so IBM consultants can help accelerate client cloud and AI transformations in tasks, like code modernization and quality engineering, or transform and execute operations across domains, like finance, HR and procurement.

扩展的另一个关键部分是推出IBm Consulting Advantage for Cloud Transformation and Management以及IBm Consulting Advantage for Business Operations。每个都包括领域特定的AI代理、应用程序和融入了IBM最佳实践的方法,使IBm顾问能够帮助加速客户在云和人工智能转型方面的工作,比如代码现代化和质量工程,或在财务、人力资源和采购等领域转型和执行操作。

To learn more about Granite and IBM's AI for Business strategy, visit .

要了解更多关于Granite和IBM的企业AI战略,请访问。

1 Cost calculations are based on API cost per million tokens pricing of IBM watsonx for open models and openAI for GPT4 models (assuming blend of 80% inout, 20% output) for customer proofs-of-concept.
2 IBM Research technical paper: Granite 3.0 Language Models
3 IBM Research technical paper: Granite 3.0 Language Models
4 The Tiny Time Mixer: Fast Pre-Trained Models for Enhanced Zero/Few Shot Forecasting on Multivariate Time Series
5 Evaluation results published in Granite Guardian GitHub Repo
6 Planned availability for Q4 2024
7 Planned availability for Q4 2024
8 Planned availability for Q1 2025

成本计算基于IBm watsonx的API每百万标记定价以及openAI的GPT4模型(假设80%输入,20%输出的混合)用于客户概念验证。
IBm研究技术论文:Granite 3.0语言模型
IBm研究技术论文:Granite 3.0语言模型
4 小型时间混合器:针对多变量时间序列的增强零/少样本预测的快速预训练模型
5 评估结果已发布在Granite Guardian GitHub Repo
6 计划于2024年第四季度推出
7 计划于2024年第四季度推出
8 计划于2025年第一季度推出

Media Contact:
Amy Angelini
[email protected]

媒体联系人:
Amy Angelini
[email protected]

SOURCE IBM

资料来源IBM

WANT YOUR COMPANY'S NEWS FEATURED ON PRNEWSWIRE.COM?

想要您公司的新闻在PRNEWSWIRE.COM上特色呈现吗?

440k+
440k+

Newsrooms &
新闻发布室&

Influencers
影响力
9k+
9k+

Digital Media
数字媒体

Outlets
卖场
270k+
270k+

Journalists
新闻记者

Opted In
已选择加入
GET STARTED
开始使用
声明:本内容仅用作提供资讯及教育之目的,不构成对任何特定投资或投资策略的推荐或认可。 更多信息
    抢沙发