Jan Joseph MalininTowards Data ScienceUnderstanding Fixup initializationHow to train residual networks without normalization layers.Oct 12, 2019Oct 12, 2019