Kingma, Diederik P., and Jimmy Ba. arXiv preprint arXiv:1412.6980 (2014) Link to original paper You pick up a paper and you come across this at the end of the first page, you know it is going to be an interesting read. Abstract: We introduce Adam, an algorithm for first-order gradient-based…