Take a sample of 10 phishing e-mails and find the most common words.

Karthika A
Guvi
Published in
1 min readJan 7, 2020

The variables ph1 to ph10 represents 10 different real-time phishing emails. Note that these can be saved within a list as a list of values.

words1 contains all the words which are present in 10 phishing mails

These may contain stopwords so remove the stopwords by hardcoding the stopwords

Now use the counter to count the words and its times of occurrences. Finally, print the top-five or any customized numbers to see them frequently occurring words by phishers.

--

--