Turkish Text Preprocessing with Zemberek in Python

Basic Turkish text pre-processing

Ayşe Kübra Kuyucu
Technology Hits

--

Screenshot by Ayşe Kübra Kuyucu

This article will go through the following main pre-processing steps for Turkish texts. I will be sharing all the source links for necessary files and Python codes.

  • Normalization
  • Tokenization
  • Stopword elimination

--

--