An analysis of the PHP 7.0.33 random number generation bug
According to Kevan, BlogNomic (and thus its dice roller) are currently running on PHP 7.0.33. This version of PHP contained a random number generation bug, which I’ll use this post to try to explain in case it informs voting on any of the relevant CFJs. (We were a bit unlucky with the version choice here; the very next version of PHP contained a partial fix for the bug, and it was fully fixed in 7.2.)
PHP 7.0.33’s random number generators, rand() and mt_rand(), suffer from a bug known as “modulo bias”. This often occurs naturally when trying to simulate a dice roll using a roll of a larger dice, if the number of sides of the smaller dice doesn’t divide into the number of sides of the larger dice, and don’t allow for the remainder of the division. For example, suppose we tried to simulate a DICE3 using a DICE100. The correct way to do that looks like this:
DICE100 result is 1-33: DICE3 result is 1
DICE100 result is 34-66: DICE3 result is 2
DICE100 result is 67-99: DICE3 result is 3
DICE100 result is 100: reroll
However, random number generators with modulo bias don’t have the reroll case, and instead treat the “remainder” results as one of the other cases, usually the lowest ones. If a random number generator with modulo bias tries to simulate a DICE3 using a DICE100, it does something like this:
DICE100 result is 1-34: DICE3 result is 1
DICE100 result is 35-67: DICE3 result is 2
DICE100 result is 68-100: DICE3 result is 3
This avoids rerolling, but the probabilities aren’t quite the same – there’s a very small bias towards low numbers (in this case, the result of 1 is ⅔% more likely than it should be, and the results of 2 and 3 are ⅓% less likely than they should be).
In PHP 7.0.33, rand() and mt_rand() simulate a random number by first rolling a DICE4294967296, and then mapping that onto the range of desired outputs using an algorithm with modulo bias. That means that if the number of sides on the dice doesn’t exactly divide into 4294967296, then some of the outcomes (usually the lowest) will be marginally more likely than others (usually the highest). However, the probabilities are only wrong in cases which would otherwise be a reroll. For example, 4294967296 ÷ 237 has a remainder of 208, so 4294967088/4294967296 of the time (i.e.99.999995% of the time), you get a fair result on a DICE237; the other 208/4294967296 of the time (i.e. 0.000005% of the time) you get a bias towards certain outcomes. (I’ll leave it up to you to decide whether you think such a small bias is relevant.)
It’s almost impossible to observe this sort of issue experimentally unless you roll a dice with a very large number of sides (which is why Clucky’s tests showed no obvious issue); the issue is that the probability of any given result can’t be wrong by more than 1 in 4,294,967,296, so the only way to observe the problem is to have a very large number of possible results so that the tiny differences in probability can add up (one 1-in-4-billion mistake isn’t observable, a billion of them cumulatively are). Even then, some numbers are better at exposing the issue than others. One of the best examples is DICE1717986918, which has a 60% chance to roll the “more likely” half of numbers and a 40% chance to roll the “less likely” half of numbers – this number is chosen due to being (approximately) 4294967296 ÷ 2½ and thus gives over 800 million possible reroll cases, wiith half the possible outputs selectable in a reroll case and the other half not selectable when the random number generator incorrectly fails to reroll.
That said, although modulo bias usually favours low results, PHP 7.033 seems to use a different pattern for which results are the “more likely” ones, i.e. the results it returns when it’s supposed to reroll – although I initially assumed that it would favour low results (due to being a modulo-bias bug and due to the way those normally work), this particular bug doesn’t follow that trend. In the case of DICE1717986918, the “more likely” values are the values that give a remainder of 2 or 3 when dividing by 4, and the “less likely” values are the values that give a remainder of 0 or 1 when dividing by 4. This is a more complex pattern than I was expecting, and it’s hard to tell how it would generalise to other numbers (it’s also almost impossible to determine a pattern experimentally, without already knowing it, because the amounts by which the probabilities are incorrect is so small).
Further reading about the specific bug in PHP, including some simulations, can be found in this offsite forum post that I found via an Internet search.