Analytics Vidhya
Published in

Analytics Vidhya

Correlation Vs Causation

We know that correlation and causation talks about the relationship between 2 variables but there is a slight difference.

Image made by Author using

Correlation is a statistical measure to know the strength of relationship between 2 variables.

It will be measured using the below equation.

To know more about this equation please check the article Why the Correlation Coefficient r ranges between -1 and +1.?

This formula states the measure of the strength of linear relationship between 2 variables.

We can think that, relationship means one variable makes some impact in another variable. This is called Causation.

But this does not mean that whenever the correlation coefficient is high, there is always a Causation.

Correlation does not mean Causation.

Correlation With Causation:

The change in one variable does makes an impact in another variable.

For example height and weight.

Correlation without Causation:

Sometimes the measure Correlation Coefficient is high but in real world scenario it does not mean anything. It means the data is purely a coincidence.

For example the correlation coefficient of a movie release of a famous actor and raining at the time of release is 0.9. But we know that this is purely a coincidence, and that actor or movie is nothing to do with the weather.

In this case Correlation does not mean any Causation. If we go with the correlation coefficients and make any model based on this statistics, that will fail for the new data set.

Hope you understand that Correlation need not to make a Causation.

Thank you! 👍

Like to support? Just click the clap icon 👏 as much as you like.

Happy Programming!🎈




Analytics Vidhya is a community of Analytics and Data Science professionals. We are building the next-gen data science ecosystem

Recommended from Medium

“Gauge Theory” Science-Research, October 2021 — summary from Astrophysics Data System, DOE Pages…

Linear Algebra

The Incompleteness Theorem

There is a mathematical equation that proves the existence of God

Header image, 111,111,111 multiplied by itself, set against a backdrop of clouds. Cloud image courtesy of Kaushik Panchal on Unsplash.

“I think that he thinks that I think that he thinks ……” An Introduction to Game theory

Binomial Distributions in R

How Mathematical Proofs can Help Unlock the Secrets of the Brain

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Asha Ponraj

Asha Ponraj

Data Science & Machine Learning Enthusiast | Software Developer | Blogger | |

More from Medium

Python For Data Science

Applications of Data Science: Genomics

Karl Pearson Correlation Coefficient

Machine Learning: What is ML and how does it work?