How to get started with Machine Learning

Published in

Code Heroku

6 min readMar 3, 2019

Are you someone who is constantly trying to learn the concepts of machine learning and data science but don’t know from where to start?

If your answer is “YES”, then this is the right place for you.

Don’t worry, most people feel the same way as you when they first get introduced to machine learning. They have the enthusiasm to learn but, they get confused from where to start learning, which resources to follow and which prerequisites to learn at first.

So, we at Code Heroku, felt that it’s much needed for us to write an article about getting started with machine learning so that, aspiring machine learning engineers can follow a guided path to pursue their dream :)

Let’s start with the most common questions.

How much coding knowledge is required and what programming languages are best for machine learning?

So, exactly how much knowledge should you have about programming?

Just the basic syntax and knowledge about using libraries is enough.

Why?

Because, in majority of the cases, machine learning engineers don’t build their models from scratch. They would rather use some machine learning library such as Scikit Learn or TensorFlow in order to implement their models. These libraries provide well-documented high level APIs for implementing machine learning models. So, most of the hard work is already taken care by them. You just need to know how to use these libraries in order to implement your model.

To answer the second part of the question, there is really no specific answer to that. While different people have different opinions over this, we still tried to compare some of the widely used programming languages for machine learning (Python, R, MATLAB) in terms of their execution speed, available libraries and ease of use.

Now, let’s discuss some of the advantages and disadvantages of these languages.

Python:

Advantages:

Multipurpose programming language. Can be used to build actual commercial products and standalone applications.
A large number of Open Source Packages and libraries available for machine learning.
Though it is 5 times slower than C, it is still much faster than R and MATLAB in terms of execution speed.
Can be easily integrated with other languages.

Disadvantages:

More lines of code needs to be written as compared to R or MATLAB for performing the same operation.

Advantages:

Primarily developed for statistical computation and data analytics.
A large number of Open Source Packages and libraries available for machine learning.
Can be easily integrated with other languages such as C/C++.
It requires less number of codes to be written as compared to Python and MATLAB for performing the same task.

Disadvantages:

Slow execution speed as compared to Python.
Difficult to implement interactive charts.
Limited capabilities while creating standalone applications.

MATLAB:

Advantages:

Fastest mathematical and computational platform specially for vectorized operations, linear algebra, matrix manipulation.
Best for visualization of plots and interactive charts.

Disadvantages:

Slightly less number of Open Source Packages available.
Slow execution speed as compared to Python and R.
Not a good choice for implementing standalone applications.

From our point of view, we would recommend you to get started with Python if you are a beginner. Because, it’s an easy-to-learn multipurpose programming language with (comparatively) fast execution speed and wide range of open source libraries available for machine learning. We, at Code Heroku also use Python to teach our Introduction to Machine Learning course.

How much knowledge of mathematics is required?

In majority of the cases, people get scared of machine learning when they get to know about the mathematics part. Because, students are not well informed about which part of mathematics to learn and how much to learn.

So, here we have tried to provide you with a guided path of mathematical foundations for machine learning. Having a sound knowledge on the following topics is sufficient (but not limited to) for a machine learning engineer.

Linear Algebra
Analytic geometry
Matrix Decompositions
Vector Calculus
Probability and Distribution
Continuous Optimization

Don’t worry, you don’t need to learn all of this at once. Here are some useful tips for getting started:

Don’t try to learn all topics at once, learn them one by one.
While exploring a particular topic of machine learning, try to learn maths related to that topic only.

Our Machine Learning Course at Code Heroku is designed in such a way that it explains the maths behind every algorithm in our syllabus. So, even if you are someone from a non-mathematical background, you will still be able to understand the concepts without any problem.

What syllabus to follow?

We have tried to provide you with a simple yet powerful syllabus for machine learning which is sufficient (but not limited to) for a beginner. The same syllabus is being taught at our Introduction to Machine Learning course at Code Heroku.

Section 1: Introduction and Getting Started

Introduction to Machine Learning
Setup Python
Scikit-Learn and other libraries installation
An introduction to Python, Numpy, SciPy and Matplotlib

Section 2: Supervised Machine Learning

Introduction to Supervised Learning
Linear Regression using Scikit-Learn
Naive Bayes using Scikit Learn
Understanding Recommendation Engines
Understanding Gradient Descent Optimization

Section 3: Unsupervised Machine Learning

Understanding Support Vector Machines(SVM)
Understanding K-Means
Principal Component Analysis (PCA)
Understanding the Application of PCA
Model Selection and Boosting

Section 4: Reinforcement Learning

Introduction to Reinforcement Learning
Introduction to Open AI Gym
Deep Reinforcement Learning and Q-Learning
Understanding the Application of Deep Reinforcement Learning

Section 4: Neural Networks

Introduction to Neural Networks
Supervised Machine Learning using Neural Networks
Object Recognition with Neural Networks

What project ideas to follow?

While learning the theory is important, it is more important to apply that knowledge and build something useful out of it. Implementing projects while learning will give you hands-on experience on the things that you will learn. We at Code Heroku implement the following projects in our Introduction to Machine Learning course and would recommend you to try to implement the same.

Supervised Machine Learning Projects:

Student Exam Score Prediction System
Email Classifier System (Spam Detection System)
Movie Recommendation System

Unsupervised Machine Learning Projects:

Reinforcement Learning Projects:

Solving Mouse Cat Maze
Cartpole Balancing with Q Learning

Neural Network Projects:

Object Recognition with Neural Networks

Apart from the list above, you can also try to implement tons of other cool projects on your own. You can get some good project ideas from ml-showcase.com and try to implement them on your own.

What learning strategy to follow?

Most people are left out in the learning process because they do not follow a learning strategy. Though this thing depends from person to person according to their own preferences, we still tried to give you a generalized view over the learning process.

When you try to learn a topic related to machine learning, try to understand the theory behind it.
Once you have learned the theory, apply this knowledge to make a small project out of it.
Once you have explored enough topics, try to compare their performance while implementing the same project. For example, try to implement spam detection system using both Naive Bayes and SVM and then compare the results.
This will give you a better idea to understand which algorithm to use in which scenario.

Apart from this, try to manage your time effectively and learn the topics one by one. Do not rush to learn everything at once.

If this article was helpful to you, check out our Introduction to Machine Learning Course at Code Heroku for a complete guide to Machine Learning.

How to get started with Machine Learning

How much coding knowledge is required and what programming languages are best for machine learning?

How much knowledge of mathematics is required?

What syllabus to follow?

What project ideas to follow?

What learning strategy to follow?

Written by Code Heroku