Huang Renxun's speech triggered a surge in the US stock market! NVIDIA surged 1.54 trillion overnight. What did Huang Renxun say?

Gelonghui Finance · Sep 11 22:07

Blackwell太抢手令客户关系紧张

黄仁勋一席话，让资本市场风云再起。

美股市场在经历了盘初的短暂低迷后，上演了一场惊心动魄的V型大反转，最终三大指数集体收涨，纳指更是以2.17%的涨幅，创下了自8月16日以来的最大单日涨幅。

这场逆转的幕后推手，除了市场的自我调整，更离不开英伟达等科技巨头的强劲表现，以及市场对于美联储加息预期的微妙变化。

英伟达在美股早盘一度小幅转跌下逼107美元，随后伴随着黄仁勋的讲话涨幅不断扩大，到尾盘时仍在不断刷新日高，最高涨8.4%一度升破117美元。

最终英伟达收涨8.15%%创六周最大涨幅，市值在短短一夜之间暴涨了惊人的2161亿美元（约合人民币1.54万亿元）。

在英伟达的带动下，半导体板块普涨，ARM涨超10%，博通涨超6%，台积电、美光涨超4%。

黄仁勋：Blackwell太抢手，有能力从台积电转单！

CEO黄仁勋在高盛集团一场会议上表示，英伟达的产品现已成为科技界最抢手的商品，客户对有限的供应争相竞争，尤其是AI芯片Blackwell供应的增速有限，导致一些客户感到沮丧。

他还暗示，若有必要，英伟达会减少对台积电的依赖，转向其他芯片制造供应商。

他告诉观众，“我们产品需求非常强劲，每个人都想要成为第一个收到货的，每个人都想收到最多的产品。我们今天可能有更多情绪化的客户，这也是情有可原的。关系很紧张，但我们正尽力做到最好。”

黄仁勋表示，公司最新芯片“Blackwell”（号称“最强AI芯片”）尤受欢迎，供应商正努力满足需求。

当黄仁勋被问到，巨大的AI支出是否为客户带来了投资回报时，黄仁勋表示，企业别无选择，只能接受“加速计算”。

他解释说，英伟达的技术不仅能加速传统的工作负载——数据处理，还能处理旧技术无法应对的AI任务。

黄仁勋还表示，英伟达在芯片生产方面严重依赖台积电，这是因为台积电在芯片制造航瑜中遥遥领先。

但他也表示，如果需要，英伟达可以转向其他供应商，因为我们已经做了两手准备，英伟达在内部开发了大部分技术。

然而，他表示，这样的改变可能会导致其芯片质量的下降。“台积电的敏捷性和他们响应我们需求的能力实在是令人难以置信。

此外，美国政府正在考虑允许英伟达向沙特阿拉伯出口先进芯片。消息发布后，英伟达股价大幅上涨，今年以来涨幅已超过一倍。

附：英伟达黄仁勋对话高盛全文（中文梳理版）

1. 首先谈谈31年前，你创立公司时的一些想法。从那时起，你将公司从一个专注于游戏的GPU公司转型为一个为数据中心行业提供广泛硬件和软件的公司。你能不能先谈谈这个历程？当你开始时，你在想什么？它是如何演变的？你未来的关键优先事项是什么，以及你如何看待未来的世界？

黄仁勋：我想说，我们做对的一件事是，我们预见到未来会有另一种计算形式，它可以增强通用计算，解决通用工具永远无法解决的问题。这种处理器一开始会做一些对CPU来说极其困难的事情，那就是计算机图形处理。

但我们将逐步扩展到其他领域。我们选择的第一个领域当然是图像处理，这与计算机图形处理是互补的。我们将其扩展到物理模拟，因为在我们选择的视频游戏领域中，你不仅希望它美观，还希望它动态化，能够创建虚拟世界。我们一步一步地扩展，并将其引入科学计算。第一个应用之一是分子动力学模拟，另一个是地震处理，这基本上是逆物理。地震处理与CT重建非常相似，是另一种形式的逆物理。所以我们一步一步地解决问题，扩展到相邻行业，最终解决了这些问题。

我们一直坚守的核心理念是加速计算能够解决有趣的问题。我们的架构保持一致，意味着今天开发的软件可以在你留下的大量已安装基础上运行，过去开发的软件可以通过新技术加速。这种关于架构兼容性的思维方式、创建大量已安装基础、与生态系统共同发展的心理从1993年就开始了，我们一直延续到今天。这就是为什么英伟达的CUDA拥有如此庞大的已安装基础的原因，因为我们一直在保护它。保护软件开发者的投资是我们公司自始至终的首要任务。

保护软件开发者的投资是我们公司自始至终的首要任务。展望未来，我们在一路上解决的一些问题，当然包括学习如何成为创始人、如何成为首席执行官、如何经营业务、如何建立公司，这些都是新的技能。这有点像发明现代计算机游戏行业。人们可能不知道，但英伟达是世界上最大的视频游戏架构的安装基础。GeForce拥有大约3亿玩家，仍然在快速增长，非常活跃。所以我认为，每次我们进入一个新市场时，我们都需要学习新的算法、市场动态，创建新的生态系统。

我们需要这样做的原因是，与通用计算机不同，通用计算机一旦构建好处理器，所有的东西最终都会运行。但我们是加速计算机，这意味着你需要问自己，你要加速什么？不存在所谓的通用加速器。

2. 深入谈谈一般用途和加速计算之间的差异？

黄仁勋：如果你看看现在的软件，你写的软件中有大量的文件输入输出，有设置数据结构的部分，还有一些魔法般的算法核心。这些算法不同，取决于它们是用于计算机图形处理、图像处理还是其他什么。它可以是流体、粒子、逆物理或者图像领域的东西。所以这些不同的算法都是不同的。如果你创建一个处理器，专门擅长这些算法，并补充CPU处理它擅长的任务，那么理论上，你可以极大地加速应用程序的运行。原因是通常5%到10%的代码占据了99.99%的运行时间。

因此，如果你把那5%的代码卸载到我们的加速器上，技术上，你可以将应用程序的速度提高100倍。这并不罕见。我们经常可以将图像处理加速500倍。现在我们做的是数据处理。数据处理是我最喜欢的应用之一，因为几乎所有与机器学习相关的内容都在演进。它可以是SQL数据处理、Spark类型的数据处理，或者是向量数据库类型的处理，处理无结构或结构化的数据，这些数据都是数据帧。

我们对这些进行极大的加速，但为了做到这一点，你需要创建一个顶级的库。在计算机图形处理领域，我们很幸运有了Silicon Graphics的OpenGL和Microsoft的DirectX，但在这些之外，没有真正存在的库。因此，举个例子，我们最著名的一个库是与SQL类似的库。SQL是存储计算的库，我们创建了一个库，它是世界上第一个神经网络计算库。

我们有cuDNN（用于神经网络计算的库），还有cuOpt（用于组合优化的库），cuQuantum（用于量子模拟和仿真的库），以及很多其他的库，比如用于数据帧处理的cuDF，类似于SQL的功能。因此，所有这些不同的库都需要被发明出来，它们可以把应用程序中的算法重新整理，使我们的加速器能够运行。如果你使用这些库，你就可以实现100倍的加速，获得更多的速度，非常惊人。

因此，概念很简单，而且非常有意义，但问题是，你如何去发明这些算法，并让视频游戏行业使用它们，编写这些算法，让整个地震处理和能源行业使用它们，编写新的算法并让整个AI行业使用它们。你明白我的意思吗？因此，所有这些库，每一个库，首先我们必须完成计算机科学的研究，其次，我们必须经历生态系统的开发过程。

我们必须去说服每个人使用这些库，然后还要考虑它们运行在哪些类型的计算机上，每种计算机都不一样。因此，我们一步一步地进入一个领域又一个领域。我们为自动驾驶汽车创建了一个非常丰富的库，为机器人开发了一个非常出色的库，还有一个令人难以置信的库，用于虚拟筛选，无论是基于物理的虚拟筛选还是基于神经网络的虚拟筛选，还有一个令人惊叹的库用于气候技术。

我们必须去结交朋友，创建市场。事实证明，英伟达真正擅长的事情是创建新的市场。我们现在已经做了这么久，以至于英伟达的加速计算似乎无处不在，但我们确实必须一步步地完成，一次一个行业地开发市场。

3. 现场的许多投资者非常关注数据中心市场，能否分享一下你对中长期机会的看法？显然，你的行业推动了你所称的“下一次工业革命”。你如何看待数据中心市场的现状以及未来的挑战？

黄仁勋：有两件事同时在发生，它们经常被混为一谈，分开讨论有助于理解。首先，我们假设没有AI存在的情况下。在没有AI的世界里，通用计算已经停滞不前了。大家都知道，半导体物理学中的一些原理，比如摩尔定律、Denard缩放等，已经结束了。我们不再看到CPU的性能每年翻倍的现象。我们已经很幸运了，能在十年内看到性能翻倍。摩尔定律曾经意味着五年内性能提升十倍，十年内提升一百倍。

然而现在这些已经结束了，所以我们必须加速一切能加速的东西。如果你在做SQL处理，加速它；如果你在进行任何数据处理，加速它；如果你在创建一个互联网公司并拥有推荐系统，必须加速它。如今最大的推荐系统引擎已经全部加速了。几年前这些还在CPU上运行，而现在已经全部加速了。因此，第一个动态是，全世界价值数万亿美元的通用数据中心将会现代化，转变为加速计算的数据中心。这是不可避免的。

此外，因为英伟达的加速计算带来了如此巨大的成本降低，过去十年中，计算能力不是以100倍，而是以100万倍的速度增长。那么问题来了，如果你的飞机能快一百万倍，你会做什么不同的事情呢？

因此，人们突然意识到：“为什么我们不让计算机来编写软件，而不是我们自己去想象这些功能，或者我们自己去设计算法呢？”我们只需要把所有的数据、所有的预测性数据交给计算机，让它去找出算法——这就是机器学习，生成式AI。因此，我们在许多不同的数据领域大规模应用了它，计算机不仅知道如何处理数据，还理解数据的含义。因为它同时理解多种数据模式，它可以进行数据翻译。

因此，我们可以从英文转换为图像，从图像转换为英文，从英文转换为蛋白质，从蛋白质转换为化学物质。因为它理解了所有的数据，因此可以进行所有这些翻译过程，我们称之为生成式AI。它可以将大量的文字转换为少量的文字，或者将少量的文字扩展为大量的文字，等等。我们现在正处于这个计算机革命的时代。

而现在令人惊讶的是，第一批价值数万亿美元的数据中心将被加速，并且我们还发明了这种新型的软件，称为生成式AI。生成式AI不仅仅是一种工具，它是一种技能。正是因为这个原因，新的行业正在被创造出来。

这是为什么？如果你看看直到现在的整个IT行业，我们一直在制造人们使用的工具和仪器。而第一次，我们正在创造出能够增强人类能力的技能。因此，人们认为AI将超越价值数万亿美元的数据中心和IT行业，进入技能的世界。

那么，什么是技能呢？比如数字货币是一种技能，自动驾驶汽车是一种技能，数字化的装配线工人，机器人，数字化的客户服务，聊天机器人，数字化的员工为英伟达规划供应链。这可以是一个SAP的数字代理。我们公司大量使用ServiceNow，我们现在拥有了数字员工服务。因此，我们现在拥有了这些数字化的人类，这就是我们现在正处的AI浪潮。

4. 金融市场中存在一个持续的辩论，即随着我们继续建设AI基础设施，投资回报是否足够？你如何评估客户在这个周期中获得的投资回报率？如果你回顾历史，回顾PC和云计算，它们在类似的采用周期中，回报率如何？与现在相比有什么不同？

黄仁勋：这是个非常好的问题。让我们来看看。在云计算之前，最大的趋势是虚拟化，如果大家还记得的话。虚拟化基本上意味着我们将数据中心中的所有硬件虚拟化为虚拟数据中心，然后我们可以跨数据中心移动工作负载，而不必直接与特定的计算机相关联。结果是，数据中心的利用率提高了，我们看到了数据中心成本减少了两倍到两倍半，几乎是在一夜之间完成的。

接着，我们将这些虚拟计算机放到云中，结果是，不仅仅是一家公司，很多公司都可以共享相同的资源，成本再次下降，利用率再次提高。

这些年的所有进步，掩盖了底层的根本变化，那就是摩尔定律的终结。我们从利用率提升中获得了两倍、甚至更多的成本降低，然而这也碰到了晶体管和CPU性能的极限。

接着，所有的这些利用率的提升已经达到极限，这也是为什么我们现在看到数据中心和计算通胀的原因。因此，第一件正在发生的事情就是加速计算。因此，当你在处理数据时，比如使用Spark——这是当今世界上使用最广泛的数据处理引擎之一——如果你使用Spark并通过英伟达加速器加速它，你可以看到20倍的加速。这意味着你会节省10倍的成本。

当然，你的计算成本会上升一点，因为你需要支付英伟达GPU的费用，计算成本可能会增加一倍，但你将减少计算时间20倍。因此，你最终节省了10倍的成本。而这样的投资回报率对于加速计算来说并不罕见。因此，我建议你加速一切可以加速的工作，然后使用GPU进行加速，这样可以立即获得投资回报。

除此之外，生成式AI的讨论是当前AI的第一波浪潮，基础设施玩家（比如我们自己和所有云服务提供商）将基础设施放在云上，供开发人员使用这些机器来训练模型、微调模型、为模型提供保护等等。由于需求如此之大，每花费1美元在我们这里，云服务提供商可以获得5美元的租金回报，这种情况正在全球范围内发生，一切都供不应求。因此，对这种需求的需求非常巨大。

我们已经看到的一些应用，当然包括一些知名的应用，比如OpenAI的ChatGPT、GitHub的Copilot，或者我们公司内部使用的共同生成器，生产力提升是不可思议的。我们公司里的每一个软件工程师现在都使用共同生成器，不管是我们自己为CUDA创建的生成器，还是用于USD（我们公司使用的另一种语言），或者Verilog、C和C++的生成器。

因此，我认为每一行代码都由软件工程师编写的日子已经彻底结束了。未来，每一个软件工程师都将有一个数字工程师伴随在身边，24/7随时协助工作。这就是未来。因此，我看英伟达，我们有32000名员工，但这些员工周围将有更多的数字工程师，可能会多100倍的数字工程师。

5. 很多行业都在接受这些变化。哪些用例、行业是你最兴奋的？

黄仁勋：在我们公司，我们在计算机图形学方面使用AI。如果没有人工智能，我们无法再进行计算机图形学。我们只计算一个像素，然后推测其余的32个像素。也就是说，我们在某种程度上“幻想”出其余的32个像素，它们在视觉上是稳定的，看起来是照片级真实的，图像质量和性能都非常出色。

计算一个像素需要大量的能量，而推测其他32个像素的能量需求则非常少，而且可以非常快速地完成。因此，AI并不仅仅是训练模型，这只是第一步。更重要的是如何使用模型。当你使用模型时，你会节省大量的能量和时间。

如果没有AI，我们无法为自动驾驶汽车行业提供服务。如果没有AI，我们在机器人技术和数字生物学领域的工作也是不可能的。现在几乎每一个科技生物公司都以英伟达为中心，他们正在使用我们的数据处理工具来生成新蛋白质，小分子生成、虚拟筛选等领域也将因为人工智能而被彻底重塑。

6. 谈谈竞争和你们的竞争壁垒吧。目前有很多公私公司希望能打破你们的领导地位。你如何看待你们的竞争壁垒？

黄仁勋：首先，我认为有几件事让我们与众不同。第一点要记住，AI并不仅仅是关于芯片的。AI是关于整个基础设施的。如今的计算机不是制造一块芯片然后人们购买它并放入计算机中。那种模式属于上世纪90年代。如今的计算机是以超级计算集群、基础设施或超级计算机为名开发的，这不是一块芯片，也不完全是计算机。

所以，我们实际上是在构建整个数据中心。如果你去看一下我们其中一个超级计算集群，你会发现管理这个系统所需的软件是非常复杂的。并没有一个“Microsoft Windows”可以直接用于这些系统。这种定制化的软件是我们为这些超级集群所开发的，所以设计芯片的公司、构建超级计算机的公司以及开发这些复杂软件的公司，理所当然的是同一家公司，这样可以确保优化、性能和效率。

其次，AI本质上是一种算法。我们非常擅长理解算法的运作机制，并且了解计算堆栈如何分布计算，以及如何在数百万个处理器上运行数天，保持计算机的稳定性、能源效率以及快速完成任务的能力。我们在这方面非常擅长。

最后，AI计算的关键是安装基础（installed base）。拥有跨所有云计算平台和内部部署（on-premise）的统一架构非常重要。无论你是在云中构建超级计算集群，还是在某台设备上运行AI模型，都应该有相同的架构以运行所有相同的软件。这就是所谓的安装基础。而这种自1993年以来的架构一致性是我们能够取得今天成就的关键原因之一。

因此，今天如果你要创办一家AI公司，最明显的选择就是使用英伟达的架构，因为我们已经遍布所有的云平台，不论你选择哪台设备，只要它有英伟达的标识，你就可以直接运行相同的软件。

7. Blackwell在训练上快了4倍，推理速度比它的前代产品Hopper快了30倍。你们的创新速度如此之快，你们能否保持这样的节奏？你们的合作伙伴能否跟上你们的创新步伐？

黄仁勋：我们的基本创新方法是确保我们不断推动架构创新。每个芯片的创新周期大约是两年，在最好的情况下是两年。我们每年还会对它们进行中期升级，但整体架构的革新大约是每两年一次，这已经非常快了。

我们有七个不同的芯片，这些芯片共同作用于整个系统。我们可以每年推出新的AI超级计算集群，并且比上一代更强大。这是因为我们拥有多个可以进行优化的部分。因此我们可以非常快速地交付更高的性能，并且这些性能的提升直接转化为总拥有成本（TCO）的下降。

Blackwell在性能上的提升意味着，对于拥有1千兆瓦电力的客户，他们可以获得3倍的收入。性能直接转化为吞吐量，吞吐量则转化为收入。如果你有1千兆瓦的电力可用，你可以获得3倍的收入。

因此，这种性能提升的回报是无与伦比的，也无法通过芯片成本的降低来弥补这3倍的收入差距。

8. 如何看待对亚洲供应链的依赖？

黄仁勋：亚洲的供应链非常复杂并且高度互联。英伟达的GPU不仅仅是一块芯片，它是由成千上万个组件组成的复杂系统，类似于一辆电动车的构造。因此，亚洲的供应链网络非常广泛且复杂。我们力求在每一个环节上设计出多样性和冗余性，确保即使出现问题，我们也能迅速将生产转移到其他地方进行制造。总的来说，即使供应链出现中断，我们也有能力进行调整，以确保供应的连续性。

我们目前在台积电进行制造，因为它是世界上最好的，不仅仅是好一点点，而是好得多。我们与他们有着长期的合作历史，他们的灵活性和规模能力都令人印象深刻。

去年，我们的收入出现了大幅增长，这离不开供应链的快速反应。台积电的敏捷性以及他们满足我们需求的能力是非常了不起的。在不到一年的时间里，我们大幅提升了产能，并且我们明年将继续扩大，后年还要进一步扩大。因此，他们的敏捷性和能力都很出色。不过，如果有需要，我们当然也可以转向其他供应商。

9. 贵公司处于非常有利的市场位置。我们已经讨论了很多非常好的话题。你最担心的是什么？

黄仁勋：我们的公司目前与全球每一家AI公司都有合作，也与每一家数据中心有合作。我不知道有哪家云服务提供商或计算机制造商我们没有合作的。因此，随着这样的规模扩展，我们肩负着巨大的责任。我们的客户非常情绪化，因为我们的产品直接影响他们的收入和竞争力。需求太大，满足这些需求的压力也很大。

我们目前正全面生产Blackwell，并计划在第四季度开始发货并进一步扩展。需求如此之大，每个人都希望能尽早拿到产品，获取最多的份额。这种紧张和激烈的氛围实在是前所未有。

虽然在创造下一代计算机技术时非常令人兴奋，也令人惊叹地看到各种应用的创新，但我们肩负着巨大的责任，感到压力很大。但我们尽力去做好工作。我们已经适应了这种强度，并将继续努力。

Blackwell is too popular, which causes tension in customer relationships.

Huang Renxun's remarks have reignited the capital markets.

After a brief morning slump, the US stock market staged a thrilling V-shaped major reversal, with all three major indices closing higher. The Nasdaq rose by 2.17%, achieving its largest single-day gain since August 16.

The key driver behind this reversal is not only the market's self-adjustment, but also the strong performance of technology giants like Nvidia, as well as the subtle shifts in market expectations for Fed interest rate hikes.

Nvidia initially dipped slightly below $107 in the early trading session, but steadily rose throughout the day following Huang Renxun's speech, reaching new daily highs and briefly surpassing $117 by the closing bell.

Ultimately, Nvidia surged by 8.15%, marking its largest six-week gain, with its market cap skyrocketing an astonishing $216.1 billion overnight (approximately RMB 1.54 trillion).

Boosted by Nvidia, the semiconductor sector saw a general rise, with ARM up by over 10%, Broadcom by over 6%, Taiwan Semiconductor and Micron by over 4%.

Huang Renxun: Blackwell is in high demand and is capable of transferring orders from Taiwan Semiconductor Manufacturing Company (TSMC)!

CEO Huang Renxun stated at a Goldman Sachs conference that NVIDIA's products have become the hottest commodities in the technology industry, and customers are competing for limited supply. In particular, the limited growth of Blackwell, the AI chip supplier, has frustrated some customers.

He also hinted that if necessary, NVIDIA would reduce dependence on TSMC and turn to other chip manufacturers for supply.

He told the audience, "Our product demand is very strong, and everyone wants to be the first to receive the goods and receive the most products. Today, we may have more emotional customers, which is understandable. The relationship is tense, but we are doing our best."

Huang Renxun stated that the company's latest chip, "Blackwell" (known as the "most powerful AI chip"), is particularly popular, and suppliers are working hard to meet demand.

When asked if the significant AI spending brings investment returns to customers, Huang Renxun said that businesses have no choice but to accept "accelerated computing".

He explained that NVIDIA's technology not only accelerates traditional workloads, such as data processing, but also handles AI tasks that old technologies cannot cope with.

Huang Renxun also stated that NVIDIA heavily relies on Taiwan Semiconductor for chip production because Taiwan Semiconductor is leading in chip manufacturing.

However, he also stated that if necessary, Nvidia can turn to other suppliers because we have made preparations on both hands. Nvidia has developed most of the technology internally.

However, he said that such a change could lead to a decrease in the quality of their chips. "TSMC's agility and their ability to respond to our needs are incredible."

In addition, the US government is considering allowing Nvidia to export advanced chips to Saudi Arabia. After the news was released, Nvidia's stock price soared, and its increase has surpassed double this year.

Attached: Full text of the dialogue between Jensen Huang and Goldman Sachs (Chinese version)

1. First, talk about some of your thoughts when you founded the company 31 years ago. Since then, you have transformed the company from a GPU company focused on gaming to one that provides a wide range of hardware and software for the datacenter industry. Can you talk about this journey? What were you thinking when you started? How did it evolve? What are your key priorities for the future, and how do you view the future world?

Huang Renxun: I want to say that one thing we did right is that we anticipated that there would be another form of computing in the future that could enhance general computing and solve problems that general tools can never solve. This processor would initially do things that are extremely difficult for CPUs, like computer graphics processing.

But we gradually expanded into other areas. The first area we chose was, of course, image processing, which is complementary to computer graphics processing. We expanded it to physical simulation because in the video game field we chose, you not only want it to be beautiful, but you also want it to be dynamic, to create a virtual world. We gradually expanded and introduced it to scientific computing. One of the first applications was molecular dynamics simulation, and another was seismic processing, which is essentially inverse physics. Seismic processing is very similar to CT reconstruction and is another form of inverse physics. So, step by step, we solved problems, expanded into adjacent industries, and ultimately solved these problems.

The core philosophy we have always adhered to is that accelerating computing can solve interesting problems. Our architecture remains consistent, which means that software developed today can run on a large installed base you leave behind, and software developed in the past can be accelerated with new technology. This mindset of architecture compatibility, creating a large installed base, and developing together with the ecosystem has been with us since 1993 and continues to this day. This is why NVIDIA's CUDA has such a huge installed base because we have been protecting it. Protecting the investment of software developers has always been our top priority.

Protecting the investment of software developers has always been our top priority. Looking to the future, some of the problems we solve along the way, including learning how to become a founder, how to become a CEO, how to operate a business, and how to build a company, are new skills. It's a bit like inventing the modern computer gaming industry. People may not know, but NVIDIA has the largest installed base for video game architecture in the world. GeForce has about 0.3 billion players and is still growing rapidly and very active. So I think every time we enter a new market, we need to learn new algorithms, market dynamics, and create new ecosystems.

The reason we need to do this is that unlike general-purpose computers, once a general-purpose computer is built, everything will eventually run on it. But we are an accelerator computer, which means you need to ask yourself, what do you want to accelerate? There is no such thing as a universal accelerator.

2. Let's talk in depth about the difference between general-purpose computing and accelerated computing.

Huang Renxun: If you look at the software now, there are a lot of file input and output in the software you write, there are parts that set up data structures, and some magical algorithm cores. These algorithms are different depending on whether they are used for computer graphics, image processing, or something else. It can be something in the fluid, particle, inverse physics, or image domain. So these different algorithms are different. If you create a processor specifically designed for these algorithms and complement the CPU to handle tasks that it is good at, in theory, you can greatly accelerate the operation of the application. The reason is that usually 5% to 10% of the code takes up 99.99% of the running time.

Therefore, if you offload that 5% of code to our accelerator, you can technically speed up the application by 100 times. This is not uncommon. We often accelerate image processing by 500 times. Now we are doing data processing. Data processing is one of my favorite applications because almost everything related to machine learning is evolving. It can be SQL data processing, Spark-like data processing, or vector database-like processing, handling unstructured or structured data, which are data frames.

We greatly accelerate these, but to do this, you need to create a top-level library. In the field of computer graphics, we were fortunate to have Silicon Graphics' OpenGL and Microsoft's DirectX, but beyond these, there are no truly existing libraries. So, for example, one of our most famous libraries is a library similar to SQL. SQL is a library for storage and calculation, and we created a library that is the world's first neural network computing library.

We have cuDNN (a library for neural network computing), cuOpt (a library for combinatorial optimization), cuQuantum (a library for quantum simulation and simulation), and many other libraries, such as cuDF for data frame processing, similar to SQL functionality. Therefore, all these different libraries need to be invented, and they can rearrange the algorithms in the application so that our accelerator can run. If you use these libraries, you can achieve 100 times acceleration and get more speed, which is amazing.

Therefore, the concept is very simple and meaningful, but the problem is how do you invent these algorithms and make the video game industry use them, write these algorithms and make the entire earthquake processing and energy industry use them, write new algorithms and make the entire AI industry use them. Do you understand what I mean? Therefore, all these libraries, each library, first we must complete the research of computer science, and secondly, we must go through the development process of the ecosystem.

We have to convince everyone to use these libraries, and then consider the types of computers on which they run, each computer is different. Therefore, we step by step into one field after another. We created a very rich library for autonomous driving cars, a very outstanding library for robot development, and an incredible library for virtual filtering, both physical-based and neural network-based virtual filtering, as well as an amazing library for climate technology.

We must go out and make friends, create markets. In fact, nvidia is truly good at creating new markets. We've been at this for so long now that nvidia's accelerated computing seems to be everywhere, but we really must go step by step, developing markets one industry at a time.

3. Many investors in the field are very concerned about the data center market. Can you share your views on medium- and long-term opportunities? Obviously, your industry is driving what you call the 'next industrial revolution'. How do you view the current state of the data center market and the future challenges?

Huang Renxun: Two things are happening simultaneously, and they are often confused and discussed separately to help understand. First, let's assume that there is no AI. In a world without AI, general-purpose computing has come to a standstill. As we all know, some principles in semiconductor physics, such as Moore's Law and Denard scaling, have come to an end. We no longer see the phenomenon of doubling the performance of CPUs every year. We have been very lucky to see performance double in ten years. Moore's Law used to mean a tenfold performance increase in five years and a hundredfold increase in ten years.

But now these have come to an end, so we have to accelerate everything that can be accelerated. If you are doing SQL processing, speed it up; if you are doing any data processing, speed it up; if you are creating an internet company with a recommendation system, it must be accelerated. The largest recommendation system engines today are all accelerated. A few years ago, these were still running on CPUs, and now they are all accelerated. Therefore, the first dynamic is that the global trillion-dollar general data centers will be modernized and transformed into accelerated computing data centers. This is inevitable.

In addition, because Nvidia's accelerated computing has brought such tremendous cost reductions, computational power has grown not at a rate of 100 times, but at a rate of 1 million times in the past decade. So the question is, if your plane can be a million times faster, what would you do differently?

So people suddenly realized, 'Why don't we let computers write software instead of imagining these functions ourselves, or designing the algorithms ourselves?' We just need to give all the data, all the predictive data to the computer and let it find the algorithms - that is machine learning, generative AI. Therefore, we have applied it on a large scale in many different data fields, where computers not only know how to process data, but also understand the meaning of the data. Because it understands multiple data patterns at the same time, it can perform data translation.

Therefore, we can convert from English to images, from images to English, from English to proteins, and from proteins to chemicals. Because it understands all the data, it can perform all these translation processes, which we call generative AI. It can convert a large amount of text into a small amount of text, or expand a small amount of text into a large amount of text, and so on. We are now in the era of this computer revolution.

And now what's surprising is that the first wave of datacenters worth trillions of dollars will be accelerated, and we've also invented this new type of software called Generative AI. Generative AI is not just a tool, it's a skill. It's because of this that a new industry is being created.

Why is this? If you look at the entire IT industry until now, we have been making tools and instruments for people to use. For the first time, we are creating skills that can enhance human abilities. Therefore, people believe that AI will surpass the value of tens of trillions of dollars of data centers and the IT industry, and enter the world of skills.

So, what is a skill? For example, digital currency is a skill, autonomous driving cars are a skill, digital assembly line workers, robots, digital customer service, chatbots, digitally planning the supply chain for Nvidia. This can be a digital agent for SAP. Our company heavily uses ServiceNow, and we now have digital employee services. So, we now have these digitized humans, this is the AI wave we are currently in.

4. There is an ongoing debate in the financial market about whether the investment return is sufficient as we continue to build AI infrastructure. How do you assess the return on investment that clients have received in this cycle? If you look back on history, look back on PCs and cloud computing, how was the return on investment in similar adoption cycles? What are the differences compared to now?

Huang Renxun: That's a very good question. Let's take a look. Before cloud computing, the biggest trend was virtualization, if you remember. Virtualization basically meant that we virtualized all the hardware in the data centers into virtual data centers, and then we could move workloads across data centers without being directly tied to specific computers. The result was an increase in data center utilization, and we saw data center costs reduced by half to two and a half times, almost overnight.

Then, we put these virtual computers into the cloud, and as a result, not just one company, but many companies can share the same resources, costs drop again, and utilization rates rise again.

All the progress of these years has obscured the underlying fundamental change, which is the end of Moore's Law. We have gained a doubling, or even more, of cost reduction from the increase in utilization, but this has also reached the limit of transistors and CPU performance.

Furthermore, all of these improvements in utilization have reached their limits, which is why we are now seeing the inflation of data centers and computing. As a result, the first thing that is happening is the acceleration of computation. So, when you're dealing with data, for example, using Spark - which is one of the most widely used data processing engines in the world today - if you use Spark and accelerate it with NVIDIA accelerators, you can see a 20x speedup. This means you'll save 10 times the cost.

Of course, your computing costs will increase a bit because you need to pay for Nvidia's GPU. However, your computing costs may double, but you will reduce computing time by 20 times. Therefore, you ultimately save 10 times the cost. And such a return on investment is not uncommon for accelerated computing. So I suggest you accelerate any work that can be accelerated and use GPU acceleration to immediately achieve investment returns.

In addition, the discussion of generative AI is the first wave of AI today. Infrastructure players, such as ourselves and all cloud service providers, put the infrastructure in the cloud for developers to use these machines to train models, fine-tune models, and provide protection for models, among other things. Due to the high demand, for every $1 spent with us, cloud service providers can get a rental return of $5. This situation is happening globally, and demand is extremely high for this kind of demand.

We have seen some applications, including some well-known ones like OpenAI's ChatGPT, GitHub's Copilot, or the shared generator we use internally, which have incredible productivity improvements. Every software engineer in our company now uses the shared generator, whether it's the one we created for CUDA or the one used for USD (another language used in our company), or the generators for Verilog, C, and C++.

Therefore, I believe the days when every line of code was written by software engineers are over. In the future, every software engineer will have a digital engineer by their side, available 24/7 to assist with work. That's the future. So, when I look at NVIDIA, we have 32,000 employees, but there will be many more digital engineers around them, possibly 100 times more digital engineers.

5. Many industries are embracing these changes. Which use cases and industries are you most excited about?

Huang Renxun: In our company, we use AI in computer graphics. Without artificial intelligence, we can no longer do computer graphics. We calculate only one pixel and then infer the other 32 pixels. In other words, we 'imagine' the other 32 pixels to some extent, and they are visually stable and look like photo-quality realism. The image quality and performance are both excellent.

Calculating one pixel requires a lot of energy, while inferring the other 32 pixels requires very little energy and can be done very quickly. So, AI is not just about training models, that's just the first step. What's more important is how you use the models. When you use models, you save a lot of energy and time.

Without AI, we would not be able to provide services to the autonomous driving industry. Without AI, our work in robot technology and digital biology would also be impossible. Now, almost every tech biotech company revolves around Nvidia, and they are using our data processing tools to generate new proteins, small molecule generation, virtual screening, and other areas that will be completely reshaped by artificial intelligence.

6. Let's talk about competition and your competitive barriers. Currently, there are many public and private companies that hope to break your leadership position. How do you view your competitive barriers?

First of all, I think there are a few things that set us apart. The first point to remember is that AI is not just about chips. AI is about the entire infrastructure. Today's computers are not made by manufacturing a chip and people buying it and putting it into a computer. That model belongs to the 90s. Today's computers are developed under the name of supercomputing clusters, infrastructure, or supercomputers. It's not just a chip, and it's not entirely a computer.

So, in fact, we are building the entire data center. If you take a look at one of our supercomputer clusters, you will find that the software required to manage this system is very complex. There is no "Microsoft Windows" that can be directly used for these systems. This customized software is developed by us for these superclusters. Therefore, the company that designs chips, build supercomputers, and develops complex software naturally is the same company, ensuring optimization, performance, and efficiency.

Secondly, AI is fundamentally an algorithm. We are very good at understanding how algorithms work and how the computing stack distributes computations, and how to run on millions of processors for days, maintaining the stability of the computer, energy efficiency, and the ability to complete tasks quickly. We are very good at this.

Finally, the key to AI computing is the installed base. It is important to have a unified architecture that spans across all cloud computing platforms and on-premise deployments. Whether you are building supercomputer clusters in the cloud or running AI models on a device, there should be the same architecture to run all the same software. This is called the installed base. And this consistency in architecture since 1993 is one of the key reasons why we have achieved what we have today.

Therefore, if you want to start an AI company today, the most obvious choice is to use Nvidia's architecture because we are already present on all cloud platforms. No matter which device you choose, as long as it has the Nvidia logo, you can run the same software directly.

7. Blackwell is 4 times faster in training and 30 times faster in inference speed compared to its predecessor product, Hopper. With such a fast pace of innovation, can you maintain this rhythm? Can your partners keep up with your pace of innovation?

Huang Renxun: Our fundamental innovation approach is to ensure that we constantly drive architectural innovation. The innovation cycle for each chip is about two years, at best. Each year, we also perform midterm upgrades, but the overall architectural innovation is about once every two years, which is already very fast.

We have seven different chips that collectively act on the entire system. We can introduce a new AI supercomputing cluster every year, which is more powerful than the previous generation. This is because we have multiple parts that can be optimized. Therefore, we can deliver higher performance very quickly, and this performance improvement directly translates into a decrease in total cost of ownership (TCO).

Blackwell's performance improvement means that customers with 1 gigawatt of power can receive three times the income. Performance directly translates into throughput, and throughput translates into income. If you have 1 gigawatt of power available, you can receive three times the income.

Therefore, the return on this performance improvement is unparalleled, and the 3x income gap cannot be compensated for by reducing chip costs.

8. How to view the dependence on the Asian supply chain?

Jensen Huang: The supply chain in Asia is very complex and highly interconnected. Nvidia's GPU is not just a chip, it is a complex system made up of thousands of components, similar to the structure of an electric car. Therefore, the supply chain network in Asia is extensive and complex. We strive to design diversity and redundancy in every link, ensuring that even if there are problems, we can quickly shift production to other locations. Overall, even if there is a disruption in the supply chain, we have the ability to adapt and ensure continuity of supply.

We are currently manufacturing at Taiwan Semiconductor because it is the best in the world, not just a little better, but much better. We have a long history of cooperation with them, and their flexibility and scale capabilities are very impressive.

Last year, our revenue saw significant growth, thanks to the fast response of the supply chain. Taiwan Semiconductor's agility and their ability to meet our needs are remarkable. In less than a year, we have significantly increased our production capacity, and we will continue to expand next year, and further expand the following year. Therefore, their agility and capability are excellent. However, if needed, we can certainly turn to other suppliers.

Your company is in a very advantageous market position. We have discussed many excellent topics. What are you most worried about?

Huang Renxun: Currently, our company collaborates with every AI company globally and every datacenter. I don't know of any cloud computing service provider or computer manufacturer that we do not collaborate with. Therefore, with such an expansion in scale, we bear a huge responsibility. Our customers are very emotional because our products directly impact their income and competitiveness. The demand is high, and the pressure to meet these demands is also significant.

We are currently in full production of Blackwell and plan to start shipping and expanding further in the fourth quarter. The demand is so high that everyone wants to get the product as soon as possible and get the maximum share. Such a tense and intense atmosphere is unprecedented.

While it is very exciting to create the next generation of computer technology and see the innovation of various applications, we feel a huge responsibility and significant pressure. But we strive to do our best. We have adapted to this intensity and will continue to work hard.

Disclaimer: This content is for informational and educational purposes only and does not constitute a recommendation or endorsement of any specific investment or investment strategy. Read more

黄仁勋一席话引爆美股！英伟达一夜暴涨1.54万亿，黄仁勋说了些什么？

Huang Renxun's speech triggered a surge in the US stock market! NVIDIA surged 1.54 trillion overnight. What did Huang Renxun say?

黄仁勋：Blackwell太抢手，有能力从台积电转单！

Huang Renxun: Blackwell is in high demand and is capable of transferring orders from Taiwan Semiconductor Manufacturing Company (TSMC)!

Risk Disclaimer

Statement