Forget train_test_split: Pipeline, ColumnTransformer, FeatureUnion and FunctionTransformer are indispensable even if you use XGBoost or LGBM — Data preparation is famously the least-loved aspect of Data Science. If done right, however, it needn’t be such a headache. While scikit-learn has fallen out of vogue as a modelling library in recent years given the meteoric rise of PyTorch, LightGBM, and XGBoost, it’s still easily one of the best…