We will interpret the coefficients of the following linear models using intuitive and concrete examples.
Simple Linear Regression
Used when the response variable is continuous and there is only one explanatory variable → models the value of the response variable
When X is continuous, the model is: sales = 0.1 + 0.2 * TV spending
- Intercept: sales is $0.1 when TV spending is $0
- Slope: sales increases by $0.2 for a one-unit increase in TV spending
When X is categorical, the model is: sales = 0.1 + 0.2 * is TV spending (1 for TV spending, 0 for no TV spending)
- Intercept: sales is $0.1 when there is no TV spending
- Slope: sales increases by $0.2 for TV spending relative to no TV spending
Multiple Linear Regression
Used when the response variable is continuous and there are multiple explanatory variables → models the value of the response variable
When X is continuous, the model is: sales = 0.1 + 0.2 * TV spending + 0.3 * radio spending
- Intercept: sales is $0.1 when TV spending is $0 and radio spending is $0
- Slope 1: sales increases by $0.2 for a one-unit increase in TV spending, all else being equal
- Slope 2: sales increases by $0.3 for a one-unit increase in radio spending, all else being equal
When X is categorical, the model is: sales = 0.1 + 0.2 * is TV spending + 0.3 * is radio spending (1 for TV spending, 0 for no TV spending, 1 for radio spending, 0 for no radio spending)
- Intercept: sales is $0.1 when there is no TV spending and no radio spending
- Slope 1: sales increases by $0.2 for TV spending relative to no TV spending, all else being equal
- Slope 2: sales increases by $0.3 for radio spending relative to no radio spending, all else being equal
Multiple Linear Regression with Interactions
Used when the response variable is continuous and there are multiple explanatory variables and interaction term(s) → models the value of the response variable
When X is continuous, the model is: sales = 0.1 + 0.2 * TV spending + 0.3 * radio spending + 0.4 * TV spending * radio spending
The model simplifies to: sales = 0.1 + (0.2 + 0.4 * radio spending) * TV spending + 0.3 * radio spending
- Slope 1: sales increases by $(0.2 + 0.4 * radio spending) for a one-unit increase in TV spending, all else being equal
- Slope 2: sales increases by $0.3 for a one-unit increase in radio spending, all else being equal
- Interaction: effectiveness of TV spending increases by $0.4 for a one-unit increase in radio spending; effectiveness of radio spending increases by $0.4 for a one unit increase in TV spending, all else being equal
When X is both categorical and continuous, the model is: sales = 0.1 + 0.2 * TV spending + 0.3 * is radio spending + 0.4 * TV spending * is radio spending
The model simplifies to: sales = (0.1 + 0.3) + (0.2 + 0.4) * TV spending if there is radio spending; sales = 0.1 + 0.2 * TV spending if there is no radio spending
- Interaction: a one-unit increase in TV spending impacts sales with and without radio spending differently, all else being equal
Polynomial Regression
Used when the response variable is continuous and there is a non-linear relationship between the explanatory variable(s) and the response variable → models the value of the response variable
The model is: sales = 0.1 + 0.2 * TV spending + 0.3 * TV spending ^ 2
- Slope: a one-unit increase in TV spending impacts sales differently depending on its value, all else being equal. The slope is not constant but changing with the value of TV spending
Logistic Regression
Used when the response variable is categorical → models the probability that the response variable belongs to a certain class
The model is: log odds of default = -0.1 + 0.2 * balance — 0.3 * income — 0.4 * is student (1 for student and 0 for non-student)
Slope 1: log odds of default increases by 0.2 for a one-unit increase in balance
Slope 2: log odds of default decreases by 0.3 for a one-unit increase in income
Slope 3: log odds of default decreases by 0.4 for students relative to non-students
References
An Introduction to Statistical Learning with Applications in R, Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani