Must Know Mathematical Measures For Every Data Scientist
There are a large number of mathematical measures that every data scientist needs to be aware of. This article outlines the must-know statistical measures in a concise and succinct manner. Please read FinTechExplained disclaimer.
Mean
- Sum all values.
- Divide it by the total number of observations.
Mode
Take the most occurring value in the sample.
Median
- Sort the numbers in ascending order.
- Take the middle value.
Variance
- Calculate mean.
- Take difference between each value and the mean
- Square this difference.
- Sum all differences
- Finally, divide by the total number of observations.
Variance gives us dispersion of the values around the mean.
Standard Deviation
Square root of variance.
Standard deviation gives us dispersion of the values around the mean in the same units as the values (instead of squared value as variance)
Covariance
Covariance is used to find relationship between two variables. For each variable:
- Calculate mean.
- Take difference between each…