While reading the “Deep Learning” book by Ian Goodfellow, I came to realize the importance of selecting the batch size and how it helps the optimization algorithms to converge efficiently. Earlier, like most of the people, I chose the batch size in such a way that it would maximize the usage of the GPU memory without being aware of how it affects the optimization of the network. I have tried to summarize how batch size affects the learning in the following points, hope you find it as enlightening as I did.

Yash Upadhyay

I help machines learn, see and talk. https://www.linkedin.com/in/yashupadhyay1997/

