Machine Learning, Data Science and Artificial Intelligence

Walid Hadri
Geek Culture
Published in
5 min readMar 7, 2021

The purpose of this article is to emphasize the meaning of these three big buzz words, the differences, the relationship and the links connecting them. A lot of people confuse these three different domains. Even some ML engineers and some Data Scientists have trouble drawing a line between them. I hope by the end of reading this article, we will clear that out.

In 2018, a report released regarding the misuse from companies claiming to use artificial intelligence on their products and services. According to the Verge, 40% of European startups claimed to use AI don’t use the technology. In 2017 TechTalks, also stumbled upon such misuse by companies claiming to use machine learning and advanced artificial intelligence to gather and examine thousands of users’ data to enhance user experience in their products and services.

I really recommand taking time to examine where they meet and where they don’t. And along this reading phase, take some time also to answer your questions about the work done by a Data Scientist and a ML engineer, where Deep Learning and Data Analysis are in all of this, what are the tools and the educational background needed for each job/technology.

Let us start with this quote that summarizes the relationship between the three technologies

“AI is a bigger concept to create intelligent machines that can simulate human thinking capability and behavior, whereas, machine learning is an application or subset of AI that allows machines to learn from data without being programmed explicitly. Data science tackles big data and includes data cleansing, preparation, and analysis. A data scientist gathers data from multiple sources and applies machine learning”.

It is obvious that the three domains overlap, but to see the difference, let’s take a look at the end-goal for each one them:

  1. Data science is about using data to provide value (money, growth, reputation, etc.) to an organization.
  2. Machine learning is about using data to make optimized inferences and predictions.
  3. Artificial intelligence is about using data to impart human-like decision making to machines.

Artificial Intelligence

The English mathematician Alan Turing created the phrase “artificial intelligence” in the 1950s. The goal was (and is) to build machines (computers) that we consider “smart,” and that can perform various tasks. AI is a science like mathematics or biology. It studies ways to build intelligent programs and machines that can creatively solve problems, which has always been considered a human prerogative.

AI is different from Machine Learning, because it is working to create an intelligent system which can perform various complex tasks, whereas ML is working to create machines that can perform only those specific tasks for which they are trained.

AI is divided into Weak AI, General AI, and Strong AI.

Machine Learning

Machine learning is the study of computer algorithms that allow computer programs to automatically improve through experience. Machine learning is a subset of artificial intelligence (AI) that provides systems the ability to automatically learn and improve from experience without being explicitly programmed. The basic premise here is to develop algorithms that can receive input data and leverage statistical models to predict an output while updating outputs as new data becomes available.

ML is mainly divided into Supervised learning, Unsupervised learning, and Reinforcement learning.

In business ML ≈ AI, Artificial Intelligence and Machine Learning usually refer to the same thing in business. Why? Because most business applications of AI amount to Supervised Learning, which is a subfield of Machine Learning.

“99% of the economic value created by AI today is through one type of AI, which is learning A to B or input to output mappings.” Andrew Ng.

However, let me give you an example of a problem that can be solved using ML and without ML (thus AI without ML): object detection. For example, giving an image and an object, I want to find on the image where this object appears, for this problem we can use ML (object detection, score, probabilistic approach) or without ML (template matching, score, OpenCV).
Machine Learning works on datasets, AI not necessarily.

Data Science

Data science is a concept used to tackle big data and includes data cleansing, preparation, and analysis. A data scientist gathers data from multiple sources and applies machine learning, predictive analytics, and sentiment analysis to extract critical information from the collected data sets.

Because data science is a broad term for multiple disciplines, machine learning fits within data science. Machine learning uses various techniques, such as regression and supervised clustering. On the other hand, the data in data science may or may not evolve from a machine or a mechanical process. The main difference between the two is that data science as a broader term not only focuses on algorithms and statistics but also takes care of the entire data processing methodology.

Data Scientist vs Machine Learning Engineer

In few words, data science is the researching, building, and interpretation of the model you have built, while machine learning is the production of that model.

Overall, a data scientist can be many things, but the main functions are to:

  1. Meet with stakeholders to define the business problem
  2. Pull data (SQL)
  3. EDA, feature engineering, model building, & prediction (Python and Jupyter Notebook)
  4. Depending on workplace, compile code to .py format and/or pickled model

Whereas the work of Machine Learning engineer could be:

  1. A pkl_file of data science model
  2. Storage bucket (GCP — Google Cloud Composer)
  3. DAG (for scheduling the trainer and evaluator of the model)
  4. Airflow (visualizes the process — ML pipeline)
  5. Docker (containers and virtualization)

Data Science: focuses on statistics and algorithms, unsupervised and supervised algorithms, regression and classification, interprets results, presents and communicates results.

Machine Learning: focus on software engineering and programming, automation, scaling, scheduling, incorporating model results into a table/warehouse/UI.

So the Machine Learning engineer deploys the model constructed by the Data Scientists.

Deep Learning

When we talk about Deep Learning, we are talking about algorithms capable of mimicking the actions of the human brain thanks to neural networks, hence the term Artificial Intelligence. It is often explained that the difference between Machine Learning and Deep Learning is that Machine Learning algorithms will process quantitative and structured data (numeric values), while Deep Learning algorithms will process unstructured data, such as sound, text, image. So the difference is Feature Extraction. Although, Deep Learning is a subset of Machine Learning.

This was a brief overview to show that these buzz words, often misused, don’t refer to the same thing. As I said before, some ML engineers and Data Scientists can’t tell the difference or they don’t have a clear idea about the differences, ready to be explained in simple words. So before jumping to Machine Learning, Data Science, AI, Deep Learning, Data Analysis… take some time and have a clear view about the world around you.

--

--