Role of OneHotEncoder and Pipelines in PySpark ML Feature — Part 2
Part 1 — What is StringIndexer?
We have already discussed regarding StringIndexer (link)
What is OneHotEncoder?
class pyspark.ml.feature.OneHotEncoder(inputCols=None, outputCols=None, handleInvalid=’error’, dropLast=True, inputCol=None, outputCol=None) — One Hot…