In this article we will explore Gensim, a very popular Python library for training text-based machine learning models, to train a Word2Vec model from scratch — Word2Vec is a machine learning algorithm that allows you to create vector representations of words. These representations, called embeddings, are used in many natural language processing tasks, such as word clustering, classification, and text generation.