NLTK Short Course: An Intro to NLP Libraries

Brandon Gomes
1 min readOct 14, 2023

--

Series: NLP to LLM from Basic Principles

NLTK, or Natural Language Tool Kit is a Python library used for natural language processing and analyzing text.

NLTK supports key features in NLP such as tokenization, parsing, tagging, stemming, and semantic reasoning. These are all covered, although they may be named differently.

Please review the course overview prior to undertaking the course.

Here is my short comprehensive NLTK course using Jupyter Notebooks.

Course Overview:

Each section is labeled, and the code is documented to ease understanding alongside examples. It provides everything that you require to run the code including every import and download. Post download info, the course begins with tokenizing, tagging tokens, and analyzing tokenized text. After a quick activity, I introduce stop words, what they are, and how to remove them before covering stemming. The course also contains finding the sentiments within a sentence. Finally, I teach you how to extract information from words and a simple form of semantic reasoning.

Summary: (all are explained within the course)

  • Setup
  • Tokenization
  • Tagging
  • Analyzing tokenized text
  • Quick activity
  • Stop words
  • Stemming
  • Sentiments
  • Extracting information from words
  • Semantic reasoning
  • And a quick thank you!

--

--

Brandon Gomes

Entry-level software engineer interning and working toward a Bachelors.