Thank you for reading. Here y, is the variable for which you are trying to predict (label) and x, z are the predictors. If you don’t have any label, and you can see that feature having missing values is dependent on other features from correlation matrix, then I would advise you to try predicting the missing…
I am glad you read it. I agree some of the features you suggested will definitely improve the predictability, the only problem is that I don’t have that data. Whatever analysis I have done currently is based on the scraped data of HTML. If only I could get more information, I will try to incorporate those features. Anyways, thanks for suggestions.
Hi Sai! The reason why I wrote it as variance of residuals is because I was assuming the mean of residuals to be zero. Therefore to write variance of residuals, I can use the below formula which shows that sum of squares of residuals can be written as variance of residuals.