GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Conversation

Notices

  1. Embed this notice
    Gabriele Svelto (gabrielesvelto@mas.to)'s status on Thursday, 05-Mar-2026 11:47:30 JST Gabriele Svelto Gabriele Svelto

    A few years ago I designed a way to detect bit-flips in Firefox crash reports and last year we deployed an actual memory tester that runs on user machines after the browser crashes. Today I was looking at the data that comes out of these tests and now I'm 100% positive that the heuristic is sound and a lot of the crashes we see are from users with bad memory or similarly flaky hardware. Here's a few numbers to give you an idea of how large the problem is. 🧵 1/5

    In conversation about a month ago from mas.to permalink
    • Embed this notice
      Gabriele Svelto (gabrielesvelto@mas.to)'s status on Thursday, 05-Mar-2026 11:47:28 JST Gabriele Svelto Gabriele Svelto
      in reply to

      And for the record I'm looking at this mostly on computers and phones, but this affects *every* device. Routers, printers, etc... you name it. That fancy ARM-based MacBook with RAM soldered on the CPU package? We've got plenty of crashes from those, good luck replacing that RAM without super-specialized equipment and an extraordinarily talented technician doing the job. 5/5

      In conversation about a month ago permalink
    • Embed this notice
      Roy Tam (roytam1@miniwa.moe)'s status on Thursday, 05-Mar-2026 11:47:28 JST Roy Tam Roy Tam
      in reply to
      @gabrielesvelto I wonder how many reports are from DDR5 platforms? as DDR5 comes with ECC I wonder if that helps?
      In conversation about a month ago permalink
    • Embed this notice
      Gabriele Svelto (gabrielesvelto@mas.to)'s status on Thursday, 05-Mar-2026 11:47:29 JST Gabriele Svelto Gabriele Svelto
      in reply to

      In the last week we received ~470000 crash reports, these do not represent all crashes because it's an opt-in system, the real number of crashes will be several times larger. Still, out of these ~25000 crashes have been detected as having a potential bit-flip. That's one crash every twenty potentially caused by bad/flaky memory, it's huge! And because it's a conservative heuristic we're underestimating the real number, it's probably going to be at least twice as much. 2/5

      In conversation about a month ago permalink
    • Embed this notice
      Gabriele Svelto (gabrielesvelto@mas.to)'s status on Thursday, 05-Mar-2026 11:47:29 JST Gabriele Svelto Gabriele Svelto
      in reply to

      In other words up to 10% of all the crashes Firefox users see are not software bugs, they're caused by hardware defects! If I subtract crashes that are caused by resource exhaustion (such as out-of-memory crashes) this number goes up to around 15%. This is a bit skewed because users with flaky hardware will crash more often than users with functioning machines, but even then this dwarfs all the previous estimates I saw regarding this problem. 3/5

      In conversation about a month ago permalink
    • Embed this notice
      Gabriele Svelto (gabrielesvelto@mas.to)'s status on Thursday, 05-Mar-2026 11:47:29 JST Gabriele Svelto Gabriele Svelto
      in reply to

      And to reinforce this estimate I've looked at the numbers we got from the users who run the memory tester after having experienced a crash: for every two crashes we think are caused by a bit-flip the memory tester found one genuine hardware issue. Keep in mind that this is not doing an extensive test of all the machine's RAM, it only checks up to 1 GiB of memory and runs for no longer than 3 seconds... and it has found lots of real issues! 4/5

      In conversation about a month ago permalink
    • Embed this notice
      Gabriele Svelto (gabrielesvelto@mas.to)'s status on Thursday, 05-Mar-2026 19:30:17 JST Gabriele Svelto Gabriele Svelto
      in reply to
      • Roy Tam

      @roytam1 DDR5 has ECC only over transfers from the DRAM chips to the memory controller, unfortunately it doesn't have ECC covering the memory itself

      In conversation about a month ago permalink
      Roy Tam likes this.
    • Embed this notice
      Samantaz Fox (samantazfox@infosec.exchange)'s status on Friday, 06-Mar-2026 08:07:06 JST Samantaz Fox Samantaz Fox
      in reply to
      • Roy Tam

      @gabrielesvelto @roytam1 It's the other way around: DDR5 has on-chip ECC, but no ECC lines between the RAM chips and the memory controller. That's why "real" ECC needs compatible sticks, motherboard *and* CPU.

      In conversation about a month ago permalink
      Roy Tam likes this.

Feeds

  • Activity Streams
  • RSS 2.0
  • Atom
  • Help
  • About
  • FAQ
  • TOS
  • Privacy
  • Source
  • Version
  • Contact

GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.