English
Back
Download
Log in to access Online Inquiry
Back to the Top

There are reports that the Meta H100 is experiencing malfunctions every 3 hours.

$NVIDIA (NVDA.US)$During the training of the Llama 3 large-scale language model, it became apparent that Meta was plagued by frequent failures of the H100 GPU. With 16,384 H100 80GB GPUs in training, unexpected component failures were occurring at an average rate of once every 3 hours. More than half of these astonishingly frequent failures were attributable to the GPU or memory.
Disclaimer: Community is offered by Moomoo Technologies Inc. and is for educational purposes only. Read more
5
2
3
+0
2
See Original
Report
15K Views
Comment
Sign in to post a comment
136Followers
2Following
253Visitors
Follow
Discussing
Trump 2.0 countdown: What's the next big opportunity in the markets?
Trump is gearing up for a return to the political stage, and his "America First" tariff policies, along with his stance on cryptocurrency an Show More