Member-only story

A Machine Learning Approach to Author Identification of Horror Novels from Text Snippets

Let’s Get Started…

Navoneel Chakrabarty
Towards Data Science
7 min readJan 23, 2019

--

There are many novels being written but among them, some acquire cult status over the years and are remembered for ages. The novels are of several genres and cross genres (mixture of several genres). Horror is one particular genre of novels. There are many famous horror novels, which are absolute favourites of readers even after decades of their release. For example, the Goosebumps Series (1998–2016) by RL Stine has been a household name and one of the most celebrated horror novels of the modern times. But many classic horror novels appeared prior to the 21st Century. For instance, the horror novel, The Dream-Quest of Unknown Kadath (1943) by H.P. Lovecraft has been one of the must-read horror novels of the 20th Century. From here, if we rewind further to the 19th Century, how can anyone forget Mary Shelley’s Frankenstein (1818 & 1823) and Edgar Allan Poe’s The Fall of the House of Usher (1839) ? But one thing is quite obvious that,

Every author, whether it is a Lovecraft or Mary Shelley or Poe, had their own style of writing which includes their signature fashion of using certain words, making their literature unique and recognisable

So, let’s use this fact to identify the author (Lovecraft/Mary Shelley/Poe) from…

--

--

Towards Data Science
Towards Data Science

Published in Towards Data Science

Your home for data science and AI. The world’s leading publication for data science, data analytics, data engineering, machine learning, and artificial intelligence professionals.

Navoneel Chakrabarty
Navoneel Chakrabarty

Written by Navoneel Chakrabarty

Data Mining | Data Analytics | Machine Learning | Financial Data Science | Natural Language Processing | Deep Learning

Responses (3)