Role of OneHotEncoder and Pipelines in PySpark ML Feature — Part 2

Nutan
4 min readNov 6, 2020

Part 1 — What is StringIndexer?

We have already discussed regarding StringIndexer (link)

What is OneHotEncoder?

class pyspark.ml.feature.OneHotEncoder(inputCols=None, outputCols=None, handleInvalid=’error’, dropLast=True, inputCol=None, outputCol=None) — One Hot…

--

--

Nutan

knowledge of Machine Learning, React Native, React, Python, Java, SpringBoot, Django, Flask, Wordpress. Never stop learning because life never stops teaching.