Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://poa.st/objects/907e9e00-10c4-4e23-8f85-6d366f5ce14c">shippoaster (shippoaster@poa.st)'s status on Thursday, 30-Jan-2025 17:32:00 JST</a><a href="https://poa.st/users/shippoaster" title="shippoaster@poa.st"><img src="https://gnusocial.jp/avatar/316803-48-20250130083200.webp" width="48" height="48" alt="shippoaster" style="position: absolute; left: 0; top: 0;">shippoaster</a><div><a href="https://poa.st/objects/66899515-3c98-496d-9b2f-157802d3ed42" rel="in-reply-to">in reply to</a><ul><li><li><a href="https://gnusocial.jp/user/1773" title="iamal_pharius@poa.st">Chinese man ? #nobot</a></li><li><a href="https://gnusocial.jp/user/76450" title="codeki@poa.st">Codeki :njp:</a></li><li><a href="https://gnusocial.jp/user/132538" title="rustycrab@clubcyberia.co">Rusty Crab</a></li><li><a href="https://gnusocial.jp/user/246838" title="populistright@poa.st">PopulistRight</a></li><li><a href="https://gnusocial.jp/user/253307" title="snappler@poa.st">snap</a></li></ul></div></section><article><a href="https://poa.st/users/IAMAL_PHARIUS">@IAMAL_PHARIUS</a> <a href="https://poa.st/users/Codeki">@Codeki</a> <a href="https://poa.st/users/PopulistRight">@PopulistRight</a> <a href="https://poa.st/users/snappler">@snappler</a> <a href="https://clubcyberia.co/users/RustyCrab">@RustyCrab</a> <a href="https://clubcyberia.co/users/Inginsub">@Inginsub</a> I think what's especially pernicious about the scheme is the complete lack of apparent elegance in the hardware solutions on offer. Nvidia's original tensor cores *happened* to be good for AI, but their actual capabilities were clearly targeting scientific/HPC. There's been scarce improvement in any area except deepening the block.<br>There are plenty of ASICs which outperform nvidias offerings on a per watt basis, but they cannot compete with Nvidias wafer purchasing power, or the very wide industry adoption of pytorch and cuda. Nvidias shipped libraries have been proven to be inefficient and haphazard and doing things the nvidia way, while typically good enough, is rarely the right way to extract maximum performance. But you can't really tap into the available resources without blindly relying on Nvidia's "trust me bro" middleware.<br>I call it pernicious because the entire industry buying into this one vendor is pigeonholing millions of man hours of engineering into nvidias black-box toolchain.<br>Nvidia knows that this is a gravy train. They have no intention of meaningfully innovating fixing their flawed foundation. Maybe its partly because doing things right could break over a decade of CUDA and now tensorflow work, but I think it is more likely that Nvidia is in a downwards talent spiral. Nobody there now worked on laying the ground work or architecting the original IP blocks. Nvidia's total reliance of proprietary code for their value add features and workflows means any deviation from their current path is a massive Pandora's Box.<br>I know this thread is more about the "AI" side of things, and how the infinite money printer depended on infinite growth in specifically needing Nvidias miraculous one true computational solution. And how it turns out nvidia is as much in the way as it is helping. However, I thought it was a good idea to dredge into the past and point out that the entire valuation of Nvidia today is built on an IP block that *happens* to be *competent* at AI, and at this point NVIDIA can't risk a pivot. I know that the models coming from China still used Nvidia, but the first chink in the armor was the fact you don't need a gigawatt of GPU power to compete with o1. I think the next one will be an IP block specifically built for this kind of work - likely a hybrid FPGA-in-memory+matrix ASIC-in-memory, basically be able to constantly propagate forward in memory by staging FPGAs rather than go back and forth between memory levels and program levels. Nvidia can't up and invent any of this, they're already behind and even lost their ARM bid. <br>You'd think that companies spending hundreds of billions on leadership in a field would not want their entire business model to be hinged on a single vendor lmao</article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/4476257#notice-8763893">In conversation</a><time datetime="2025-01-30T17:32:00+09:00" title="Thursday, 30-Jan-2025 17:32:00 JST">about 3 months ago</time> <span>from <span><a href="https://poa.st/objects/907e9e00-10c4-4e23-8f85-6d366f5ce14c" rel="external" title="Sent from poa.st via ActivityPub">poa.st</a></span></span><a href="https://poa.st/objects/907e9e00-10c4-4e23-8f85-6d366f5ce14c">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
shippoaster (shippoaster@poa.st)'s status on Thursday, 30-Jan-2025 17:32:00 JSTshippoaster
in reply to
@IAMAL_PHARIUS @Codeki @PopulistRight @snappler @RustyCrab @Inginsub I think what's especially pernicious about the scheme is the complete lack of apparent elegance in the hardware solutions on offer. Nvidia's original tensor cores *happened* to be good for AI, but their actual capabilities were clearly targeting scientific/HPC. There's been scarce improvement in any area except deepening the block.
There are plenty of ASICs which outperform nvidias offerings on a per watt basis, but they cannot compete with Nvidias wafer purchasing power, or the very wide industry adoption of pytorch and cuda. Nvidias shipped libraries have been proven to be inefficient and haphazard and doing things the nvidia way, while typically good enough, is rarely the right way to extract maximum performance. But you can't really tap into the available resources without blindly relying on Nvidia's "trust me bro" middleware.
I call it pernicious because the entire industry buying into this one vendor is pigeonholing millions of man hours of engineering into nvidias black-box toolchain.
Nvidia knows that this is a gravy train. They have no intention of meaningfully innovating fixing their flawed foundation. Maybe its partly because doing things right could break over a decade of CUDA and now tensorflow work, but I think it is more likely that Nvidia is in a downwards talent spiral. Nobody there now worked on laying the ground work or architecting the original IP blocks. Nvidia's total reliance of proprietary code for their value add features and workflows means any deviation from their current path is a massive Pandora's Box.
I know this thread is more about the "AI" side of things, and how the infinite money printer depended on infinite growth in specifically needing Nvidias miraculous one true computational solution. And how it turns out nvidia is as much in the way as it is helping. However, I thought it was a good idea to dredge into the past and point out that the entire valuation of Nvidia today is built on an IP block that *happens* to be *competent* at AI, and at this point NVIDIA can't risk a pivot. I know that the models coming from China still used Nvidia, but the first chink in the armor was the fact you don't need a gigawatt of GPU power to compete with o1. I think the next one will be an IP block specifically built for this kind of work - likely a hybrid FPGA-in-memory+matrix ASIC-in-memory, basically be able to constantly propagate forward in memory by staging FPGAs rather than go back and forth between memory levels and program levels. Nvidia can't up and invent any of this, they're already behind and even lost their ARM bid.
You'd think that companies spending hundreds of billions on leadership in a field would not want their entire business model to be hinged on a single vendor lmao
In conversationabout 3 months ago from poa.stpermalink

Public

Embed Notice

HTML Code

Corresponding Notice