10% of Firefox crashes are caused by bitflips

73 points by glacambre


viraptor

I asked a question in that thread because I'm either missing something about the claim or the claim is too strong, but didn't get a response. The author says the estimate is conservative and likely lower than reality, but the process summarised in the bug tracker is: If the bad pointer access is for an unmapped page and the there's a mapped page one bit-flip away, the access is classified as a hardware memory issue.

But that seems to ignore a whole class of issues where a valid pointer gets corrupted by either bad_ptr->flag=value or *bad_ptr += 2**x which also result in a bit flip.

lonjil

How about we all switch to ECC. Then we could tell if it's the hardware or the software at fault.

doug-moen

Note that they don't claim that 10% of all machines running Firefox have bad memory. The claim is that 10% of all reported crashes are caused by bad memory, which is quite different. Machines with bad memory presumably generate more crash reports.

glacambre

The hackernews thread also contains very interesting anecdotes : https://news.ycombinator.com/item?id=47252971

NicoElbera

I think a lot of those failures are on specific machines. Currently I have some corrupted memory that still works just good enough that I don't want to buy new sticks, but I have probably crashed firefox 50 times in the last month or so (likely all bitflips).