Take a sample of 10 phishing e-mails and find the most common words.
Published in
1 min readJan 7, 2020
The variables ph1 to ph10 represents 10 different real-time phishing emails. Note that these can be saved within a list as a list of values.
words1 contains all the words which are present in 10 phishing mails
These may contain stopwords so remove the stopwords by hardcoding the stopwords
Now use the counter to count the words and its times of occurrences. Finally, print the top-five or any customized numbers to see them frequently occurring words by phishers.