From July 23, 2024, we will start the SLM fine-tuning custom service to improve the answer accuracy of the generated AI using small language models such as Phi-3, Llama-3, and GPT-4o mini.

Headwaters · Jul 22 11:00

2024.07.23 「Phi-3」「Llama-3」「GPT-4o mini」などの小規模言語モデルを使用して生成AIの回答精度を向上させる「SLMファインチューニング」カスタムサービスを開始

　AIソリューション事業を手掛ける株式会社ヘッドウォータース（本社：東京都新宿区、代表取締役：篠田庸介、以下「ヘッドウォータース」)は、生成AIの業務活用を推進する企業向けに「SLMファインチューニング」カスタムサービスの提供を開始しました。
同サービスは、マイクロソフト株式会社が提供する「Azure AI モデルカタログ」から選べるオープンソースAI基盤モデル「Phi-3」、「Llama-3」、並びに「GPT-4o mini」を中心とした小規模言語モデルを使用して、生成AIの回答精度を向上させるサービスとなり、生成AIが作成する文章の精度では業務利用が厳しいと考えられている企業に有用なサービスとなっております。
big

ヘッドウォータースでは、Azure OpenAI Serviceによる企業向けGPTサービスラインナップの拡充を行い、企業向けの生成AIならびにLLM（Large Language Model：大規模言語モデル）、ヘッドウォータースの技術力を活かしたRAG(Retrieval Augmented Generation)システム、SLM（Small Language Model：小規模言語モデル）を使ったエッジAIなど多くのソリューションを開発してまいりました。
生成AIの業務活用において多くのお客様から、「専門用語、業界用語、社内用語に対応させたい」「特定のキーワードが出た場合にサジェストやレコメンドを行いたい」「回答精度を向上させたい」と言う共通の課題であり、ヘッドウォータースでもこれまで課題に対するソリューションを模索してまいりました。
このような声に応えるため、ヘッドウォータースでは、マイクロソフトのSLM「Phi-3」と、Meta社の「Llama-3」、OpenAI社の「GPT-4o mini」を中心とした「SLMファインチューニング」カスタムサービスを開始しました。
■RAGの課題
通常、LLMを各社が業務活用するためには、各社の業務データや独自プロンプトなどによってLLMをカスタムする必要があります。カスタム手法としてはRAGとファインチューニングの2種類に分けられますが、ファインチューニングは難易度が高く効果的なデータが必要であり、コストパフォーマンス面でバランスの良いRAGから着手するケースが主流となっています。
一方、RAGの課題感として、「多くのデータを参照させすぎることによるハルシネーション（誤回答）」や、「生成AIで正答率を高めたいケースへの対応」、「一般人が知っている用語ではない社内用語、業界用語、専門用語への対応」など特定のタスクにおいて不十分なシーンも見受けられます。
そこでヘッドウォータースは、「社内用語、業界用語、専門用語への対応」「正答率を高めるための対応」としてSLMの活用によって課題解決を行います。
■SLMの特徴
SLMの主な特徴は、「LLMの軽量化」にありますが、その他の特徴として「扱うデータ量が少ない」事にあります。
SLMの学習データとして「業界固有の用語やニュアンス」や「間違ってはいけない回答」など「他のナレッジよりも優先するべきナレッジ」をSLMに用意することで、不正確さや無関係な情報を生成するリスクを最小限に抑える事ができます。
さらにSLMはLLMと比較してコンピューティングリソースが抑えられる為、運用コスト効率が高く、応答時間の短縮や消費エネルギー削減と言ったメリットがあります。
SLMには、マイクロソフトが提供する「Phi-3」を活用することでMicrosoft AzureやCopilot+ PCとの親和性を考慮、さらにSLMの弱点とされている日本語対応を解決するために、Meta社の「Llama-3」をベースに開発された日本語学習済みモデルを利用し、運用コストと速度における課題解決のためにOpenAI社の「GPT-4o mini」を活用します。
本来高コストになりやすいファインチューニングも「扱うデータ量が少ないSLMを利用する」事で、LLMのファインチューニングよりも安価に提供が可能となりました。
ファインチューニングには、本来データサイエンスの専門知識が必要となりますが、長年培ってきた機械学習の知見と複数人在籍するKaggleメダリストの知見を掛け合わせることで、LLMと社内用語や業界用語を明示的に分ける手法において複数の導入実績があり、精度の向上も確認されています。
ヘッドウォータースでは、SLMファインチューニングとMicrosoft Fabricを活用したAdvanced RAGサービスや、Microsoft Azureで構成された生成AI基盤「SyncLect Generative AI」とSLMファインチューニングを組み合わせて提供することで、更なるコストパフォーマンスの向上に努めています。比較的高い正答率を求められる製造業や金融業、放送業、ヘルスケア業などのエンタープライズ企業で生成AIの業務活用や、生成AIを活用したお客様のサービスプラットフォーム支援を行ってまいります。
■今後について
今後は、SLMサービスラインナップを拡充することで次のようなソリューション展開を図ってまいります。
・マルチモーダルSLM「GPT-4o mini」「Phi-3 Vision」や「Florence-2」を活用したマルチタスクエッジ映像解析
・個人情報をクラウドに持ち出さない生成AI×オンプレミス
・オフライン環境に対応するローカルSLM
・Copilot+ PC上で稼働するWindows AIアプリケーション
・モバイルデバイス上で稼働するオンデバイスSLMアプリケーション ...etc
ヘッドウォータースでは、アライアンス戦略を中長期戦略の柱として掲げており、顧客企業ともビジネスパートナーになることで共に生成AI経済圏を拡大する取り組みを行っております。顧客ビジネスに生成AIを組み込み、相互送客することで生成AIがより身近に、当たり前に利用される世界へと近づけてまいります。
なお、本件による当社の当期業績に与える影響は軽微であります。今後開示すべき事項が発生した場合には速やかにお知らせいたします。
■SLM（小規模言語モデル）とは
SLM（小規模言語モデル）は、LLM（大規模言語モデル）よりもサイズが小さく軽量化された言語モデルです。高速なトレーニングと推論が可能で、リソース効率も高まり、コストパフォーマンスに優れています。また、リソースに制約のあるデバイスやエッジコンピューティングに適しており、セキュアで機密性が高いと言った様々な特徴があります。より小型となる言語モデルの可能性が生成AIカテゴリーで注目されており、小規模言語モデルの採用が増加しております。
■ファインチューニング(Fine-Tuning：微調整)とは
ファインチューニングとは、既に学習済みのモデルに新たな層を追加し、モデル全体を再学習する手法です。モデルを再利用するため、一から学習するよりも短時間で少ないデータでモデルの構築が可能です。
■Phi-3とは
マイクロソフトが提供するオープンソースの小規模言語モデル (SLM) です。さまざまな言語、推論、コーディング、数学ベンチマークで同等のサイズと次段階のサイズのモデルを上回る、最高レベルの能力とコスト効率を発揮します。

■GPT-4o miniとは
OpenAI社が提供するマルチモーダル言語モデルの「GPT-4o」の小型モデルです。開発者の利用コストが「GPT3.5」よりも60%以上安価なモデルであり、精度の向上に加えて速度も大幅に速くなっています。
■Azure AI モデルカタログとは
すぐに使えるようにパッケージ化されたトップクラスの基盤モデルで、OpenAI、Meta、Mistral AI、Stability AI、Hugging Faceなど主要なオープンソース生成AIモデルの開発を加速します。
■RAG(Retrieval Augmented Generation)とは
Retrieval Augmented Generation（RAG）は、大規模言語モデル（LLM）と外部のデータベースや情報源を結びつけるための新しい技術です。外部の知識ソースを検索し、より強化した文章生成を行います。
■Copilot+ PCとは
Copilot+ PC は、リアルタイム翻訳や画像生成などの AI を多用するプロセス専用のコンピューターチップであり、40 兆回以上の操作を秒速で実行 (TOPS) できる超高速ニューラルプロセッシングユニット (NPU) を搭載した、新しいクラスの Windows 11 PC です。

■参考
Microsoft Fabricをデータプラットフォームとした「Advanced RAG」サービス開始
o.jp/news/gen_ai_microsoft_fabric_advanced_rag.html
産業用エッジ生成AIソリューション「LLaVA Edge Vision」を開発

生成AI×エッジAIに向け小規模言語モデルSLMと画像言語モデルVLMの検証を開始

Azure OpenAI Service リファレンスアーキテクチャの Advancedパートナー認定について

■商標について
Microsoft、Windows、 Azure は、米国 Microsoft Corporationの米国およびその他の国における登録商標または商標です。
Windows の正式名称は、Microsoft Windows Operating System です。
その他、記載されている製品名などの固有名詞は、各社の商標または登録商標です。
＜会社情報＞
会社名：株式会社ヘッドウォータース
所在地：〒163-1304　東京都新宿区西新宿6-5-1 新宿アイランドタワー４階
代表者：代表取締役　篠田庸介
設立　：2005年11月
URL ：
＜本件のお問い合わせ＞
株式会社ヘッドウォータース
メール：info@ml.headwaters.co.jp

Starting custom service of SLM fine-tuning to improve the answer accuracy of AI generated using small language models such as Phi-3, Llama-3, and GPT-4o mini.

Headwaters Co., Ltd. (Headquarters: Shinjuku-ku, Tokyo, Representative Director: Yosuke Shinoda, hereinafter referred to as Headwaters), which handles AI solution business, has started offering the custom service of SLM fine-tuning for companies that promote the use of generated AI for business operations.
This service uses small language models centered on open source AI platform models such as Phi-3, Llama-3, and GPT-4o mini that can be selected from the Azure AI Model Catalog provided by Microsoft Corporation to improve the answer accuracy of generated AI. This service is useful for companies that are considered difficult to use in business due to the accuracy of the sentences created by generated AI.
big

Headwaters has developed many solutions, such as generated AI and LLM (Large Language Model), RAG (Retrieval Augmented Generation) system using Headwaters' technical capabilities, and edge AI using SLM (Small Language Model). We have expanded the lineup of GPT services for companies using the Azure OpenAI Service, and have been exploring solutions to common issues such as 'wanting to support specialized terms, industry terms, and in-house terms,' 'wanting to suggest and recommend when specific keywords appear,' and 'wanting to improve answer accuracy' from many customers who use generated AI for business operations.
One common challenge in the business use of generated AI is 'wanting to support specialized terms, industry terms, and in-house terms,' 'wanting to suggest and recommend when specific keywords appear,' and 'wanting to improve answer accuracy.' Headwaters has been exploring solutions to these issues.
In response to such requests, Headwaters has launched a custom service of SLM fine-tuning centered on 'Phi-3' of Microsoft 'SLM,' 'Llama-3' of Meta, and 'GPT-4o mini' of OpenAI.
Issues with RAG
Usually, to use LLM for business use, each company needs to customize LLM based on its business data or unique prompts. Customization methods are divided into two types: RAG and fine-tuning. However, because fine-tuning requires a high level of difficulty and effective data, and is costly, it is common to start with RAG, which has a well-balanced cost performance.
On the other hand, some of the issues with RAG are 'hallucination (incorrect answer) due to referring to too much data,' 'response to cases where it is desired to increase the accuracy of generated AI,' 'corresponding to in-house terms, industry terms, and specialized terms that are not well known to the general public.'
To address these issues, Headwaters will use SLM to solve the problem by utilizing SLM for 'in-house terms, industry terms, and specialized terms' and 'increasing the accuracy of the answers.'
Features of SLM
The main feature of SLM is 'reducing the weight of LLM,' but another feature is the 'small amount of data handled.'
By preparing 'industry-specific terms and nuances' and 'answers that should be given priority over other knowledge, such as answers that must not be incorrect' as SLM learning data, the risk of generating inaccuracies and irrelevant information can be minimized. Furthermore, compared to LLM, SLM can save on computing resources, which makes it a cost-effective and efficient solution that can reduce response time and energy consumption.
In addition, to improve the affinity with Microsoft Azure and Copilot+ PC, Headwaters uses 'Phi-3' provided by Microsoft to consider the affinity with Microsoft Azure and Copilot+ PC, and uses a Japanese-learning model developed based on 'Llama-3' of Meta, which solves the weakness of SLM, which is considered to be the Japanese correspondence. To solve the issues in terms of operating costs and speed, we use 'GPT-4o mini' of OpenAI.
By using SLM, which handles a small amount of data, it is now possible to provide fine-tuning, which would otherwise be expensive, at a lower cost than LLM fine-tuning.

Fine tuning requires expertise in data science, but by combining machine learning knowledge accumulated over the years and expertise of Kaggle medalists with implicit and industry-specific terminology separation method, there are several introduction cases and improvement of accuracy has been confirmed.
At Headwaters, we strive for further cost performance improvement by providing Advanced RAG services using SLM fine tuning and Microsoft Fabric as well as a generative AI platform "SyncLect Generative AI" composed of Microsoft Azure and SLM fine tuning, and business utilization of generative AI in enterprise companies such as manufacturing, finance, broadcasting, health care and support for customer service platforms utilizing generative AI with relatively high correct answer rates.
■ Future prospects
In the future, we will expand the SLM service lineup and realize the following solution deployments.
- Multimodal SLM "GPT-4o mini," "Phi-3 Vision," and "Florence-2" for multitask edge video analysis
- Generating AI x on-premises that does not take out personal information to the cloud
- Local SLM that supports offline environments
- Copilot+ Windows AI application that runs on a PC
- On-device SLM application that runs on mobile devices ...etc
Headwaters regards alliance strategy as one of the pillars of medium- to long-term strategy, and is working to expand the generative AI economy in partnership with customer companies. We will incorporate generative AI into customer business and exchange customers to bring generative AI closer to a world where it is used naturally and close.
Please use your Futubull account to access the feature.
SLM (Small Language Model) is a language model that is smaller in size and lighter weight than LLM (Large Language Model). It enables high-speed training and inference, enhances resource efficiency, and excels in cost performance. In addition, there are various features such as being suitable for devices with limited resources and edge computing, and being secure and highly confidential. The potential for smaller language models is attracting attention in the generative AI category, and the adoption of small language models is increasing.
SLM (Small Language Model) is a language model that is smaller in size and lighter weight than LLM (Large Language Model). It enables high-speed training and inference, enhances resource efficiency, and excels in cost performance. In addition, there are various features such as being suitable for devices with limited resources and edge computing, and being secure and highly confidential. The potential for smaller language models is attracting attention in the generative AI category, and the adoption of small language models is increasing.
Fine-tuning is a method of adding new layers to an already trained model and retraining the entire model. By reusing the model, it is possible to construct the model with less data in a shorter time than learning from scratch.
Fine-tuning is a method of adding new layers to an already trained model and retraining the entire model. By reusing the model, it is possible to construct the model with less data in a shorter time than learning from scratch.
Phi-3 is an open source small language model (SLM) provided by Microsoft. It demonstrates the highest level of ability and cost efficiency, exceeding models of equal and the next size in various languages, inference, coding, and mathematical benchmarks.
GPT-4o mini is a small model of the multimodal language model "GPT-4o" provided by to OpenAI. It is a model that is more than 60% cheaper for developers to use than GPT3.5 and has significantly increased its speed in addition to improving its accuracy.

Azure AI Model Catalog is a packaged top-level infrastructure model that accelerates development of major open-source generative AI models such as OpenAI, Meta, Mistral AI, Stability AI, and Hugging Face, etc.
GPT-4o mini is a small model of the multimodal language model "GPT-4o" provided by to OpenAI. It is a model that is more than 60% cheaper for developers to use than GPT3.5 and has significantly increased its speed in addition to improving its accuracy.
Azure AI Model Catalog is a packaged top-level infrastructure model that accelerates development of major open-source generative AI models such as OpenAI, Meta, Mistral AI, Stability AI, and Hugging Face, etc.
Azure AI Model Catalog is a packaged top-level infrastructure model that accelerates development of major open-source generative AI models such as OpenAI, Meta, Mistral AI, Stability AI, and Hugging Face, etc.
RAG (Retrieval Augmented Generation) is ...
Retrieval Augmented Generation (RAG) is a new technology that combines large language models (LLMs) with external databases and information sources. It searches external knowledge sources for more enhanced article generation.
What is Copilot+ PC?
Copilot+ PC is a new class of Windows 11 PC designed specifically for process-intensive tasks such as real-time translation and image generation, utilizing AI that features a super high-speed neural processing unit (NPU) capable of executing over 40 trillion operations per second (TOPS) and equipped with advanced computer chips.

Reference
Microsoft Fabric-based Advanced RAG service launched.
o.jp/news/gen_ai_microsoft_fabric_advanced_rag.html
Development of LLaVA Edge Vision, an industrial edge generation AI solution.

Verification of small language models (SLMs) and image language models (VLMs) for generated AI x edge AI.

Advanced partner certification for Azure OpenAI Service reference architecture.

■ Trademarks
Microsoft, Windows and Azure are registered trademarks or trademarks of Microsoft Corporation in the United States and other countries.
The official name of Windows is Microsoft Windows Operating System.
Other proper nouns such as product names mentioned are trademarks or registered trademarks of their respective companies.
Founded: November 2005 URL:
Company Name: Headwaters Co., Ltd. Location: Shinjuku Island Tower 4th Floor, 6-5-1 Nishishinjuku, Shinjuku-ku, Tokyo 163-1304 Representative Director: Yusuke Shinoda Established: November 2005 URL: https://www.headwaters.co.jp
Address: Shinjuku Island Tower 4th Floor, 6-5-1 Nishishinjuku, Shinjuku-ku, Tokyo 163-1304
Representative Director: Yusuke Shinoda

Headwaters Co., Ltd.
Email: info@ml.headwaters.co.jp

Disclaimer: This content is for informational and educational purposes only and does not constitute a recommendation or endorsement of any specific investment or investment strategy. Read more

2024.07.23 「Phi-3」「Llama-3」「GPT-4o mini」などの 小規模言語モデルを使用して生成AIの回答精度を向上させる 「SLMファインチューニング」カスタムサービスを開始

From July 23, 2024, we will start the SLM fine-tuning custom service to improve the answer accuracy of the generated AI using small language models such as Phi-3, Llama-3, and GPT-4o mini.

2024.07.23 「Phi-3」「Llama-3」「GPT-4o mini」などの 小規模言語モデルを使用して生成AIの回答精度を向上させる 「SLMファインチューニング」カスタムサービスを開始

Starting custom service of SLM fine-tuning to improve the answer accuracy of AI generated using small language models such as Phi-3, Llama-3, and GPT-4o mini.

Risk Disclaimer

Statement

2024.07.23 「Phi-3」「Llama-3」「GPT-4o mini」などの小規模言語モデルを使用して生成AIの回答精度を向上させる「SLMファインチューニング」カスタムサービスを開始

2024.07.23 「Phi-3」「Llama-3」「GPT-4o mini」などの小規模言語モデルを使用して生成AIの回答精度を向上させる「SLMファインチューニング」カスタムサービスを開始