Feature extraction and similar image search with OpenCV for newbies

Published in

Machine Learning World

3 min readFeb 15, 2018

I think all of you saw Google Image Search and asked yourself “How it works?”, so today i will give you an answer on this question and we will build simple script for making image search right inside your console.

Want become an expert in Computer Vision & Object Detection?

Subscribe on New Practical Course

Image features
For this task, first of all, we need to understand what is an Image Feature and how we can use it.
Image feature is a simple image pattern, based on which we can describe what we see on the image. For example cat eye will be a feature on a image of a cat. The main role of features in computer vision(and not only) is to transform visual information into the vector space. This give us possibility to perform mathematical operations on them, for example finding similar vector(which lead us to similar image or object on the image)

Ok, but how to get this features from the image?
There are two ways of getting features from image, first is an image descriptors(white box algorithms), second is a neural nets(black box algorithms). Today we will be working with the first one.

There are many algorithms for feature extraction, most popular of them are SURF, ORB, SIFT, BRIEF. Most of this algorithms based on image gradient.
Today we will use KAZE descriptor, because it shipped in the base OpenCV library, while others are not, just to simplify installation.

So let’s write our feature exctractor:

Most of feature extraction algorithms in OpenCV have same interface, so if you want to use for example SIFT, then just replace KAZE_create with SIFT_create.

So extract_features first detect keypoints on image(center points of our local patterns). The number of them can be different depend on image so we add some clause to make our feature vector always same size(this is needed for calculation, cause you can’t compare vectors of different dimensions)
Then we build vector descriptors based on our keypoints, each descriptor has size 64 and we have 32 such, so our feature vector is 2048 dimension.
batch_extractor just run our feature extractor in a batch for all our images and saves feature vectors in pickled file for further use.

Now it’s time to build our Matcher class that will be matching our search image with images in our database.

Here we are loading our feature vectors from previous step and create from them one big matrix, then we compute cosine distance between feature vector of our search image and feature vectors database, and then just output Top N results.
Of course this is just a demo, for production use better to use some algorithm for fast computation of cosine distance for millions of images. I would recommend to use Annoy Index which is simple in use and pretty fast(search in 1M of images is taking about 2ms)

Now just put it all together and run

You can download this code from my github
Or run it right away in Google Colab (free service for online computation even with GPU support): https://colab.research.google.com/drive/1BwdSConGugBlGzPLLkXHTz2ahkdzEhQ9

Conclusion

When you run this code you will see that similar images are not always similar as we understand it. That’s because this algorithms is context-unaware, so they better in finding same images even modified, but not similar. If we want to find context similar images then we should use Convolutional Neural Network, and the next article will be about them, so don’t forget to follow me :)

Support

Become a Patron and support our community to make more interesting articles & tutorials

Get interesting articles every day — Subscribe on Telegram Channel

Read my other fresh articles

Best Popular Science YouTube Channels

One man said “To become a genius you need to learn new thing every day”.

medium.com

Shape Context descriptor and fast characters recognition

Shape Context — is a scale & rotation invariant shape descriptor (not image).

medium.com

Tutorial: Making Road Traffic Counting App based on Computer Vision and OpenCV

Today we will learn how to count road traffic based on computer vision and without heavy deep learning algorithms. For…

medium.com

Tutorial: Counting Road Traffic Capacity with OpenCV

Today I will show you very simple but powerful example of how to count traffic capacity with the algorithm that you can…

medium.com

Feature extraction and similar image search with OpenCV for newbies

Want become an expert in Computer Vision & Object Detection?

Subscribe on New Practical Course

Conclusion

Support

Get interesting articles every day — Subscribe on Telegram Channel

Best Popular Science YouTube Channels

One man said “To become a genius you need to learn new thing every day”.

Shape Context descriptor and fast characters recognition

Shape Context — is a scale & rotation invariant shape descriptor (not image).

Tutorial: Making Road Traffic Counting App based on Computer Vision and OpenCV

Today we will learn how to count road traffic based on computer vision and without heavy deep learning algorithms. For…

Tutorial: Counting Road Traffic Capacity with OpenCV

Today I will show you very simple but powerful example of how to count traffic capacity with the algorithm that you can…

Written by Andrey Nikishaev