Statistics in Data Science

vishal singh
1 min readDec 24, 2022

--

Statistics is an important field in data science, as it allows you to analyze and interpret data in order to extract useful insights and make informed decisions. It involves collecting, organizing, analyzing, and interpreting data, and using statistical methods to draw conclusions and make predictions. Some common statistical techniques used in data science include:

  • Descriptive statistics: This involves summarizing and describing the characteristics of a dataset, such as calculating the mean, median, and standard deviation.
  • Inferential statistics: This involves using statistical techniques to make inferences about a population based on a sample of data.
  • Hypothesis testing: This involves using statistical tests to determine whether a certain relationship or difference between variables is likely to be true in the population, or if it is simply due to chance.
  • Regression analysis: This involves fitting a mathematical model to a dataset in order to understand the relationships between variables and make predictions.
  • Machine learning: This involves using algorithms to learn patterns in data and make predictions or decisions without explicit programming.

--

--

vishal singh

data science, SQL, Python, R, Numpy, Pandas and many more