Embed Notice
HTML Code
Corresponding Notice
- Embed this notice@IAMAL_PHARIUS @Codeki @PopulistRight @RustyCrab @Inginsub Given the 50 series uplift is only about 8%, they were definitely counting on 24/7 A100s (or H100s in China) burning out and continued fever pitch from Big Tech and get-rich-quick start ups. I think they intended the Digits platform to sort of placate the prosumer home user/trainer, which is basically GDDR in a little shell with a mediocre processor. NVidia seemed to intentionally resist bumping VRAM on the consumer cards for this reason, especially considering the increasing VRAM use of AAA games.
If anyone wants to run real R1 at home (not the Ollama distills), grab a few Macs with their unified RAM and you can do network distributed inference of a decent quality quant with llama.cpp. Since it's only 27B active parameters, it should run at a decent clip as long as you can get it loaded up.