An overview of how Twitch uses staged rollouts to perform experiments, biases we found, and methods we use for significance testing.