Multi-Class Text Classification with Scikit-Learn
Susan Li
5.9K67

Hi Susan,

Thank you for such a good tutorial though I have a concern about a single point in the code. As far as I see, you create tf-idf values based on the whole data on this line: features = tfidf.fit_transform(df.Consumer_complaint_narrative).toarray()

Isn’t it a little bit problematic ?