My Recommendations to Learn Mathematics for Machine Learning

Here is a compilation of books, videos, and papers to get you started.

Published in

DAIR.AI

3 min readOct 17, 2020

I have always emphasized on the importance of mathematics in machine learning. Here is a compilation of resources (books, videos, and papers) to get you going.

This is not an exhaustive list but I have carefully curated it based on my experience and observations.

This is a repost of my Twitter thread that you can find here. I will keep updating the list here as I come across more useful resources.

Mathematics for Machine Learning

by Marc Peter Deisenroth, A. Aldo Faisal, and Cheng Soon Ong

Source: https://mml-book.github.io

Note: this is probably the place you want to start. Start slowly and work on some examples. Pay close attention to the notation and get comfortable with it.

Pattern Recognition and Machine Learning

by Christopher Bishop

Source: https://www.microsoft.com/en-us/research/publication/pattern-recognition-machine-learning/

Note: Prior to the book above, this is the book that I used to recommend to get familiar with math-related concepts used in machine learning. A very solid book in my view and it’s heavily referenced in academia.

The Elements of Statistical Learning

by Jerome H. Friedman, Robert Tibshirani, and Trevor Hastie

Note: machine learning deals with data and in turn uncertainty which is what statistics teach. Get comfortable with topics like estimators, statistical significance,…

Probability Theory: The Logic of Science

by E. T. Jaynes

Source: https://bayes.wustl.edu/etj/prob/book.pdf

Note: In machine learning, we are interested in building probabilistic models and thus you will come across concepts from probability theory like conditional probability and different probability distributions.

Multivariate Calculus by Imperial College London

by Dr. Sam Cooper & Dr. David Dye

Video Playlist: https://www.youtube.com/playlist?list=PLiiljHvN6z193BBzS0Ln8NnqQmzimTW23

Note: backpropagation is a key algorithm for training deep neural nets that rely on Calculus. Get familiar with concepts like chain rule, Jacobian, gradient descent,.

The Matrix Calculus You Need For Deep Learning

by Terence Parr & Jeremy Howard

Source: https://arxiv.org/abs/1802.01528

Note: In deep learning, you need to understand a bunch of fundamental matrix operations. If you want to dive deep into the math of matrix calculus this is your guide.

Mathematics for Machine Learning — Linear Algebra

by Dr. Sam Cooper & Dr. David Dye

Video Playlist: https://www.youtube.com/playlist?list=PLiiljHvN6z1_o1ztXTKWPrShrMrBLo5P3

Note: a great companion to the previous video lectures. Neural networks perform transformations on data and you need linear algebra to get better intuitions.

Information Theory, Inference and Learning Algorithms

by David J. C. MacKay

Note: When you are applying machine learning you are dealing with information processing which in essence relies on ideas from information theory such as entropy and KL Divergence,…