Scipy — Python for Data Science

Numpy— Python for Data Science

Type of Distributions

Statistics Skew

Distribuição Normal Padronizada

Z-Score

Teorema Central do Limite

SCIPY.Stats

Naïve Bayes e distribuições

Conversão Atributos Categóricos => Numéricos Discreto

Aprendizagem Baseada em Distâncias — KNN

Linear Regression

Skewed Data with Machine Learning

Neural Networks Initiators

Normality Tests

- Parametric statistics: the data is in some distribution, usually the normal distribution.
- Non-parametric statistics: data is in another (or unknown) distribution
- If the data is “normal”, we use parametric statistics. Otherwise, we use non-parametric statistics.

Shapiro-Wilk Test

Shapiro-Wilk Test
p-value is used to interpret the statistical test.
p <= alpha: rejects hypothesis, not normal
p > alpha: don’t reject the hypothesis, it’s normal

Python Notebook Colab

--

--

Andre Vianna
My Data Science Journey

Software Engineer & Data Scientist #ESG #Vision2030 #Blockchain #DataScience #iot #bigdata #analytics #machinelearning #deeplearning #dataviz