Shiv VidhyutA Brief Introduction to Distributed Training with Gradient DescentGradient descent is one of the most popular algorithms to perform optimization and by far the most common way to optimize neural networks…6 min read·Dec 13, 2019--1--1