Geek Culture
Published in

Geek Culture

An easy way to determine similarity between two strings of text using Python

The last two posts I have written have been about sklearn’s TfidfVectorizer and cosine_similarity. The reason for this is because it is important to thoroughly learn these concepts in order to get to grips with natural language processing, or NLP. I had known about TfidfVectorizer because there have been a few Kaggle competitions that focused on NLP and it is…




A new tech publication by Start it up (

Recommended from Medium

Modular Health System in Unity

Spark 3.0: First hands-on approach with Adaptive Query Execution (Part 2)

Deploying Django API & Vue.js App (with NGINX)using Docker

TestNet reliability testing and improvements

Building an IoT Product — The Product Production Feedback Loop

How to divide the image into 4 parts using OpenCV

C++20 coroutines for asynchronous gRPC services

Fully on-chain trading bot using Balancer v2 pool

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store


I have close to five decades experience in the world of work, being in fast food, the military, business, non-profits, and the healthcare sector.

More from Medium

Agglomerative Hierarchical Clustering Using SciPy

How I used sklearn’s LabelEncoder function to solve Kaggles February 2022 tabular competition

Pipeline and Custom Transformer with a Hands-On Case Study in Python

Installing and setting up Apache Mahout