IBM Introduces Granite 3.0: High Performing AI Models Built for Business
IBM Introduces Granite 3.0: High Performing AI Models Built for Business
- New Granite 3.0 8B & 2B models, released under the permissive Apache 2.0 license, show strong performance across many academic and enterprise benchmarks, able to outperform or match similar-sized models
- New Granite Guardian 3.0 models deliver IBM's most comprehensive guardrail capabilities to advance safe and trustworthy AI
- New Granite 3.0 Mixture-of-Experts models enable extremely efficient inference and low latency, suitable for CPU-based deployments and edge computing
- New Granite Time Series model achieved state-of-the-art performance in zero/few-shot forecasting, outperforming models 10 times larger
- IBM unveils next generation of Granite-powered watsonx Code Assistant for general purpose coding; Debuts new tools in watsonx.ai for building and deploying AI applications and agents
- Announces Granite will become the default model of Consulting Advantage, an AI-powered delivery platform used by IBM's 160,000 consultants to bring new solutions to clients faster
- 新的花崗岩3.0 80億 和 20億 款型,發佈在寬鬆的阿帕奇石油2.0許可下,展示了在許多學術和企業基準中表現出色的性能,能夠勝過或與類似規模的模型相匹配
- 新的花崗守護者3.0 模型提供IBM最全面的道路護欄功能,以推動安全和可靠的人工智能
- 新的花崗岩3.0 專家混合模型實現了極高效的推理和低延遲,適用於基於CPU的部署和邊緣計算
- 新的花崗時間序列模型在零/少拍攝預測中取得了最先進的性能,勝過體積大10倍的模型
- IBM推出了基於Granite的下一代watsonx 代碼助手,用於通用編碼;在watsonx.ai中推出了用於構建和部署人工智能應用程序和代理的新工具
- 宣佈Granite將成爲Consulting Advantage的默認模型,這是IBM 16萬名顧問使用的AI驅動型交付平台,以更快地爲客戶帶來新解決方案
ARMONK, N.Y., Oct. 21, 2024 /PRNewswire/ -- Today, at IBM's (NYSE: IBM) annual TechXchange event the company announced the release of its most advanced family of AI models to date, Granite 3.0. IBM's third-generation Granite flagship language models can outperform or match similarly sized models from leading model providers on many academic and industry benchmarks, showcasing strong performance, transparency and safety.
2024年10月21日,紐約ARMONk /美通社/ -- 今天,在IBM (紐交所: IBM) 年度TechXchange活動上,該公司宣佈發佈迄今爲止最先進的AI模型系列,Granite 3.0。IBM第三代Granite旗艦語言模型在許多學術和行業基準上能夠超越或與領先的模型提供商的類似規模模型相匹配,展示出色的性能,透明度和安全性
Consistent with the company's commitment to open-source AI, the Granite models are released under the permissive Apache 2.0 license, making them unique in the combination of performance, flexibility and autonomy they provide to enterprise clients and the community at large.
符合公司對開源人工智能的承諾,Granite模型以寬鬆的Apache 2.0許可協議發佈,使其在性能、靈活性和自主性的結合方面爲企業客戶和整個社區提供了獨特的優勢。
IBM's Granite 3.0 family includes:
IBM的Granite 3.0系列包括:
- General Purpose/Language: Granite 3.0 8B Instruct, Granite 3.0 2B Instruct, Granite 3.0 8B Base, Granite 3.0 2B Base
- Guardrails & Safety: Granite Guardian 3.0 8B, Granite Guardian 3.0 2B
- Mixture-of-Experts: Granite 3.0 3B-A800M Instruct, Granite 3.0 1B-A400M Instruct, Granite 3.0 3B-A800M Base, Granite 3.0 1B-A400M Base
- 通用語言:Granite 3.0 80億 Instruct、Granite 3.0 20億 Instruct、Granite 3.0 80億 Base、Granite 3.0 20億 Base
- 防護欄與安全:Granite Guardian 3.0 80億、Granite Guardian 3.0 2B
- 專家組合:Granite 3.0 30億.A80000萬 Instruct、Granite 3.0 10億.A40000萬 Instruct、Granite 3.0 30億.A80000萬 Base、Granite 3.0 10億.A40000萬 Base
The new Granite 3.0 8B and 2B language models are designed as 'workhorse' models for enterprise AI, delivering strong performance for tasks such as Retrieval Augmented Geneneration (RAG), classification, summarization, entity extraction, and tool use. These compact, versatile models are designed to be fine-tuned with enterprise data and seamlessly integrated across diverse business environments or workflows.
全新的Granite 3.0 80億和20億語言模型被設計爲企業人工智能的「工作馬」型號,爲檢索增強生成(RAG)、分類、摘要、實體提取和工具使用等任務提供強大性能。這些緊湊、多功能的模型旨在通過企業數據進行微調,並在不同的業務環境或工作流中實現無縫集成。
While many large language models (LLMs) are trained on publicly available data, a vast majority of enterprise data remains untapped. By combining a small Granite model with enterprise data, especially using the revolutionary alignment technique InstructLab – introduced by IBM and RedHat in May – IBM believes businesses can achieve task-specific performance that rivals larger models at a fraction of the cost (based on an observed range of 3x-23x less cost than large frontier models in several early proofs-of-concept1).
雖然許多大型語言模型(LLMs)是在公開可用數據上進行訓練的,但絕大多數企業數據仍未被利用。通過將小型Granite模型與企業數據結合,尤其是使用5月份由IBm和RedHat介紹的革命性對齊技術InstructLab,IBm相信企業可以以低於大型先驅模型多達3倍至23倍的成本(基於早期證明概念1的觀察範圍)實現特定任務的性能。
The Granite 3.0 release reaffirms IBM's commitment to building transparency, safety, and trust in AI products. The Granite 3.0 technical report and responsible use guide provide a description of the datasets used to train these models, details of the filtering, cleansing, and curation steps applied, along with comprehensive results of model performance across major academic and enterprise benchmarks.
Granite 3.0發佈再次確認了IBM在構建人工智能產品的透明度、安全性和信任度方面的承諾。Granite 3.0技術報告和負責任使用指南提供了訓練這些模型所使用數據集的描述,所應用的過濾、清潔和篩選步驟的詳細信息,以及對主要學術和企業基準測試中模型性能的全面結果。
Critically, IBM provides an IP indemnity for all Granite models on watsonx.ai so enterprise clients can be more confident in merging their data with the models.
重要的是,IBM爲watsonx.ai上的所有Granite模型提供知識產權保護,因此企業客戶可以更加放心地將其數據與這些模型合併。
Raising the bar: Granite 3.0 benchmarks
提升標準:Granite 3.0基準測試
The Granite 3.0 language models also demonstrate promising results on raw performance.
Granite 3.0語言模型在原始性能方面也展現出令人期待的結果。
On standard academic benchmarks defined by Hugging Face's OpenLLM Leaderboard, the Granite 3.0 8B Instruct model's overall performance leads on average against state-of-the-art-performance of similar-sized open source models from Meta and Mistral. On IBM's state-of-the-art AttaQ safety benchmark, the Granite 3.0 8B Instruct model leads across all measured safety dimensions compared to models from Meta and Mistral.2
根據Hugging Face的OpenLLM排行榜所定義的標準學術基準測試,Granite 3.0 80億 Instruct模型的整體性能平均領先於Meta和Mistral等規模相似的開源模型的最新性能。在IBM的最新AttaQ安全基準測試中,Granite 3.0 80億 Instruct模型在所有測量的安全維度上均領先於Meta和Mistral的模型。
Across the core enterprise tasks of RAG, tool use, and tasks in the Cybersecurity domain, the Granite 3.0 8B Instruct model shows leading performance on average compared to similar-sized open source models from Mistral and Meta.3
在RAG、工具使用和網絡安全領域的核心企業任務中,Granite 3.0 80億 Instruct模型的整體表現平均優於來自Mistral和Meta等規模相似的開源模型。
The Granite 3.0 models were trained on over 12 trillion tokens on data taken from 12 different natural languages and 116 different programming languages, using a novel two-stage training method, leveraging results from several thousand experiments designed to optimize data quality, data selection, and training parameters. By the end of the year, the 3.0 8B and 2B language models are expected to include support for an extended 128K context window and multi-modal document understanding capabilities.
Granite 3.0模型在來自12種不同自然語言和116種不同編程語言的超過12萬億標記的數據上進行了訓練,採用了一種新穎的兩階段訓練方法,利用了數千次實驗的結果,旨在優化數據質量、數據選擇和訓練參數。到年底,預計3.0 80億和20億語言模型將支持擴展的12.8萬上下文窗口和多模態文檔理解能力。
Demonstrating an excellent balance of performance and inference cost, IBM offers its Granite Mixture of Experts (MoE) Architecture models, Granite 3.0 1B-A400M and Granite 3.0 3B-A800M, as smaller, lightweight models that could be deployed for low latency applications as well as CPU-based deployments.
展示了出色的性能與推斷成本之間的平衡,IBM提供其Granite專家混合(MoE)架構模型,Granite 3.0 10億.A40000萬和Granite 3.0 30億.A80000萬,作爲較小、輕量化的模型,可用於低延遲應用以及基於CPU的部署。
IBM is also announcing an updated release of its pre-trained Granite Time Series models, the first versions of which were released earlier this year. These new models are trained on 3 times more data and deliver strong performance on all three major time series benchmarks, outperforming 10 times larger models from Google, Alibaba, and others. The updated models also provide greater modeling flexibility with support for external variables and rolling forecasts.4
IBM還宣佈更新了其預訓練的Granite時間序列模型,其中第一個版本是在今年早些時候發佈的。這些新模型經過了3倍數據訓練,在所有三個主要時間序列基準測試上表現出色,超過了來自Google、阿里巴巴等機構10倍更大的模型。更新後的模型還提供更大的建模靈活性,支持外部變量和滾動預測。
Introducing Granite Guardian 3.0: ushering the next era of responsible AI
推出Granite Guardian 3.0:邁入下一個負責任人工智能時代
As part of this release, IBM is also introducing a new family of Granite Guardian models that permit application developers to implement safety guardrails by checking user prompts and LLM responses for a variety of risks. The Granite Guardian 3.0 8B and 2B models provide the most comprehensive set of risk and harm detection capabilities available in the market today.
作爲此版本的一部分,IBm還推出了一系列新的Granite Guardian模型,允許應用程序開發人員通過檢查用戶提示和LLm響應來實施安全防護措施,以防範各種風險。Granite Guardian 3.0 80億和20億模型在市場上提供了最全面的風險和危害檢測能力。
In addition to harm dimensions such as social bias, hate, toxicity, profanity, violence, jailbreaking and more, these models also provide a range of unique RAG-specific checks such as groundedness, context relevance, and answer relevance. In extensive testing across 19 safety and RAG benchmarks, the Granite Guardian 3.0 8B model has higher overall accuracy on harm detection on average than all three generations of Llama Guard models from Meta. It also showed on par overall performance in hallucination detection on average with specialized hallucination detection models WeCheck and MiniCheck.5
除了社會偏見、仇恨、毒性、褻瀆、暴力、越獄等危害維度,這些模型還提供一系列獨特的RAG特定檢查,如扎根性、上下文相關性和答案相關性。在對19個安全和RAG基準的廣泛測試中,Granite Guardian 3.0 80億模型的平均危害檢測準確率高於Meta的三代Llama Guard模型。它在幻覺檢測的整體表現方面展現出與專門的幻覺檢測模型WeCheck和MiniCheck相當的性能水平。
While the Granite Guardian models are derived from the corresponding Granite language models, they can be used to implement guardrails alongside any open or proprietary AI models.
雖然Granite Guardian模型源自相應的Granite語言模型,但它們可以用來在任何開源或專有人工智能模型旁邊實施防護欄。
Availability of Granite 3.0 models
The entire suite of Granite 3.0 models and the updated time series models are available for download on HuggingFace under the permissive Apache 2.0 license. The instruct variants of the new Granite 3.0 8B and 2B language models and the Granite Guardian 3.0 8B and 2Bmodels are available today for commercial use on IBM's watsonx platform. A selection of the Granite 3.0 models will also be available as NVIDIA NIM microservices and through Google Cloud's Vertex AI Model Garden integrations with HuggingFace.
Granite 3.0模型的可用性
整套Granite 3.0模型和更新後的時間序列模型可在HuggingFace上按照Apache 2.0許可證進行下載。新的Granite 3.0 80億和20億語言模型以及Granite Guardian 3.0 80億和20億模型的指示變體今天已在IBM的watsonx平台上商用。選定的Granite 3.0模型還將作爲NVIDIA NIm微服務和通過Google Cloud的Vertex AI Model Garden與HuggingFace集成提供。
To help provide developer choice and ease of use and support local, edge deployments, a curated set of the Granite 3.0 models are also available on Ollama and Replicate.
爲了幫助提供開發者選擇和易用性,並支持本地、邊緣部署,Granite 3.0模型的精選套件也可在Ollama和Replicate上使用。
The latest generation of Granite models expand IBM's robust open-source catalog of powerful LLMs. IBM has collaborated with ecosystem partners like AWS, Docker, Domo, Qualcomm Technologies, Inc. via its Qualcomm AI Hub, Salesforce, SAP, and others to integrate a variety of Granite models into these partners' offerings or make Granite models available on their platforms, offering greater choice to enterprises across the world.
Granite最新一代模型擴展了IBM強大的開源LLM目錄。IBm已與AWS、Docker、Domo、高通技術公司(通過其高通AI中心)、Salesforce、SAP等生態系統合作伙伴合作,將各種Granite模型整合到這些合作伙伴的產品中,或在其平台上提供Granite模型,爲全球企業提供更多選擇。
Assistants to Agents: realizing the future for enterprise AI
助理到代理商:實現企業人工智能的未來
IBM is advancing enterprise AI through a spectrum of technologies – from models and assistants, to the tools needed to tune and deploy AI specifically for companies' unique data and use-cases. IBM is also paving the way for future AI agents that can self-direct, reflect, and perform complex tasks in dynamic business environments.
英偉達正在通過各種技術推動企業人工智能發展,從模型和助手到爲公司的獨特數據和用例調整和部署人工智能所需的工具。英偉達還爲未來能夠自主、反思並在動態商業環境中執行復雜任務的人工智能代理鋪平了道路。
IBM continues to evolve its portfolio of AI assistant technologies – from watsonx Orchestrate to help companies build their own assistants via low-code tooling and automation, to a wide set of pre-built assistants for specific tasks and domains such as customer service, human resources, sales, and marketing. Organizations around the world have used watsonx Assistant to help them build AI assistants for tasks like answering routine questions from customers or employees, modernizing their mainframes and legacy IT applications, helping students explore potential career paths, or providing digital mortgage support for home buyers.
英偉達繼續發展其AI助手技術組合,從watsonx Orchestrate開始,幫助公司通過低代碼工具和自動化構建他們自己的助手,到爲特定任務和領域提供廣泛的預構建助手,如客戶服務、人力資源、銷售和營銷。世界各地的組織已經使用watsonx助手幫助他們構建AI助手,例如回答來自客戶或員工的常見問題,使其主機和傳統IT應用程序現代化,幫助學生探索潛在職業道路,或爲購房者提供數字抵押貸款支持。
Today IBM also unveiled the upcoming release of the next generation of watsonx Code Assistant, powered by Granite code models, to offer general-purpose coding assistance across languages like C, C++, Go, Java, and Python, with advanced application modernization capabilities for Enterprise Java Applications.6 Granite's code capabilities are also now accessible through a Visual Studio Code extension, IBM Granite.Code.
今天英偉達還揭示了即將推出的下一代watsonx Code Assistant,由Granite代碼模型驅動,提供跨C、C++、Go、Java和Python等語言的通用編碼輔助功能,具有用於企業Java應用程序的先進應用現代化能力。Granite的代碼功能現在還通過Visual Studio Code擴展可訪問,英偉達Granite.Code。
IBM also plans to release new tools to help developers build, customize and deploy AI more efficiently via watsonx.ai – including agentic frameworks, integrations with existing environments and low-code automations for common use-cases like RAG and agents.7
英偉達還計劃通過watsonx.ai發佈新工具,幫助開發人員更有效地構建、定製和部署人工智能,包括代理框架、與現有環境的集成以及適用於常見用例如RAG和代理的低代碼自動化。
IBM is focused on developing AI agent technologies which are capable of greater autonomy, sophisticated reasoning and multi-step problem solving. The initial release of the Granite 3.0 8B model features support for key agentic capabilities, such as advanced reasoning and a highly-structured chat template and prompting style for implementing tool use workflows. IBM also plans to introduce a new AI agent chat feature to IBM watsonx Orchestrate, which uses agentic capabilities to orchestrate AI Assistants, skills, and automations that help users increase productivity across their teams.8 IBM plans to continue building agent capabilities across its portfolio in 2025, including pre-built agents for specific domains and use-cases.
英偉達專注於開發能夠擁有更大的自主性、複雜推理和多步問題解決能力的AI代理技術。Granite 3.0 80億型的首次發佈具有對關鍵代理能力的支持,如高級推理和用於實現工具使用工作流的高度結構化的聊天模板和提示風格。英偉達還計劃向英偉達watsonx Orchestrate引入新的AI代理聊天功能,利用代理能力協調AI助手、技能和自動化,幫助用戶在團隊中提高生產率。英偉達計劃在2025年繼續跨其組合構建代理能力,包括特定領域和用例的預構建代理。
Expanded AI-powered delivery platform to supercharge IBM consultants with AI
擴展的人工智能交付平台將爲IBm諮詢顧問提供AI支持,以加速他們的工作效率
IBM is also announcing a major expansion of its AI-powered delivery platform, IBM Consulting Advantage. The multi-model platform contains AI agents, applications, and methods like repeatable frameworks that can empower 160,000 IBM consultants to deliver better and faster client value at a lower cost.
IBm還宣佈了其基於人工智能的交付平台IBm Consulting Advantage的重大擴展。這個多模型平台包含AI代理、應用程序和可重複使用的框架等方法,可以讓16萬名IBm顧問以更低的成本提供更好更快的客戶價值。
As part of the expansion, Granite 3.0 language models will become the default model in Consulting Advantage. Leveraging Granite's performance and efficiency, IBM Consulting will be able to help maximize the return-on-investment for the generative AI projects of IBM clients.
作爲擴展的一部分,Granite 3.0語言模型將成爲Consulting Advantage的默認模型。藉助Granite的性能和效率,IBm Consulting將能夠幫助最大程度地提高IBm客戶生成式人工智能項目的投資回報。
Another key part of the expansion is the introduction of IBM Consulting Advantage for Cloud Transformation and Management and IBM Consulting Advantage for Business Operations. Each includes domain-specific AI agents, applications, and methods infused with IBM's best practices so IBM consultants can help accelerate client cloud and AI transformations in tasks, like code modernization and quality engineering, or transform and execute operations across domains, like finance, HR and procurement.
擴展的另一個關鍵部分是推出IBm Consulting Advantage for Cloud Transformation and Management以及IBm Consulting Advantage for Business Operations。每個都包括領域特定的AI代理、應用程序和融入了IBM最佳實踐的方法,使IBm顧問能夠幫助加速客戶在雲和人工智能轉型方面的工作,比如代碼現代化和質量工程,或在財務、人力資源和採購等領域轉型和執行操作。
To learn more about Granite and IBM's AI for Business strategy, visit .
要了解更多關於Granite和IBM的企業AI戰略,請訪問。
1 Cost calculations are based on API cost per million tokens pricing of IBM watsonx for open models and openAI for GPT4 models (assuming blend of 80% inout, 20% output) for customer proofs-of-concept.
2 IBM Research technical paper: Granite 3.0 Language Models
3 IBM Research technical paper: Granite 3.0 Language Models
4 The Tiny Time Mixer: Fast Pre-Trained Models for Enhanced Zero/Few Shot Forecasting on Multivariate Time Series
5 Evaluation results published in Granite Guardian GitHub Repo
6 Planned availability for Q4 2024
7 Planned availability for Q4 2024
8 Planned availability for Q1 2025
成本計算基於IBm watsonx的API每百萬標記定價以及openAI的GPT4模型(假設80%輸入,20%輸出的混合)用於客戶概念驗證。
IBm研究技術論文:Granite 3.0語言模型
IBm研究技術論文:Granite 3.0語言模型
4 小型時間混合器:針對多變量時間序列的增強零/少樣本預測的快速預訓練模型
5 評估結果已發佈在Granite Guardian GitHub Repo
6 計劃於2024年第四季度推出
7 計劃於2024年第四季度推出
8 計劃於2025年第一季度推出
Media Contact:
Amy Angelini
[email protected]
媒體聯繫人:
Amy Angelini
[email protected]
SOURCE IBM
資料來源IBM