Generalised Linear Model — Part 1:

Stefano Bosisio
Analytics Vidhya
Published in
7 min readMar 8, 2020

--

I will try to review and extend more and more this post, in order to show the power and importance of Generalised Linear Model and giving clear practical examples

Generalised Linear Model or GLM are a vast class of models, which try to fit a distribution of points (observations), independently from the distribution function of the observations under study. Such a statement sounds quite strong, but it has remarkable applications from biology to finance, creating a solid foundations for nowadays machine learning world. We often use models derived from GLM in our data science projects (e.g. Linear Regression, Logistic Regression, Poisson Regression), but we spend just very little time to think how everything has started and what is the mathematical background. Furthermore, we tend to define all these models as classification algorithm, while on the paper they are just fitting our data points. In this post I will try to give an introduction of GLM, underlining the most salient features and mathematical aspects.

The foundations

As a first step, we need to assess two main constraints to GLM. Primarly, a generalised linear model is characterised by the independence — or un-correlation — of the observations. As a consequence, data with autocorrelations of time series or spatial processes cannot be fitted using GLMs (e.g. we cannot use our…

--

--

Stefano Bosisio
Analytics Vidhya

Machine Learning Engineer, PhD in Computational Chemistry. My writing covers neuroscience research, coding tutorial and social-media analyses