

Ph.D. candidate in Sociology. Computational social science, statistics, text mining, #rstats.
…s also important to not drop too many important features as it might lead to a drop in performance. Also, you can’t identify these noisy features using feature importance because they could be fairly important and still be very noisy!