Charlie BlakeinGraphcoreSimple FP16 & FP8 Training with Unit ScalingUnit Scaling is a new low-precision machine learning method able to train language models in FP16 and FP8 without loss scaling.Mar 29, 2023Mar 29, 2023