A Comprehensive Exploration of Computer Vision: Past, Present, and Future (Part 4)

Swanand Katdare
3 min readAug 17, 2023

--

Photo by Conor Luddy on Unsplash

Emergence of Machine Learning:

Welcome back to our exploration of the evolution of computer vision! In this part of our blog, we’ll delve into an exciting era when machine learning techniques began to shape the world of computer vision. This era marked a significant step forward as computers started learning patterns directly from data, reducing the need for handcrafted features. Let’s dive into how machine learning techniques were introduced to computer vision and the challenges they encountered.

Imagine teaching a computer to recognize objects in images, not by explicitly telling it what features to look for, but by showing it thousands of examples. This is the power of machine learning! In the mid-2000s, researchers started applying machine learning techniques to computer vision. They believed that by feeding computers a massive amount of data, they could learn to recognize objects on their own, just as we do.

One of the first tasks tackled using machine learning in computer vision was image classification. Image classification involves teaching computers to categorize images into different classes. For example, recognizing whether an image contains a cat or a dog. Traditional machine learning algorithms like Support Vector Machines (SVMs) and Random Forests were used for this purpose.

Researchers would first gather a dataset of images, with each image labelled according to its content. For instance, an image of a cat would be labelled as “cat.” The computer would then learn from these labelled examples to distinguish between different classes.

Limited Success Due to Feature Engineering Challenges:

Remember how we discussed extracting features from images in the pre-deep learning era? Well, even in the machine learning era, feature engineering remained a challenge. While machine learning allowed computers to learn patterns, it still relied heavily on handcrafted features as inputs. These features needed to be carefully chosen to ensure the algorithms could recognize objects accurately.

Another challenge was the curse of dimensionality. Imagine you’re trying to teach a computer to recognize faces. Faces have many features, like eyes, nose, and mouth. But when you feed all these features into a machine learning algorithm, the number of dimensions (or features) can become overwhelming. This led to a phenomenon known as the curse of dimensionality, where algorithms struggled to perform well due to the high number of dimensions.

Overfitting was another hurdle. Imagine teaching a computer so well that it recognizes the images it learned from perfectly but struggles with new, unseen images. This is overfitting — the algorithm becomes too specific to the training data and fails to generalize to new data. Balancing between fitting the training data and generalizing to new data was a delicate task.

The emergence of machine learning techniques marked a pivotal era in the evolution of computer vision. Researchers began to explore ways in which computers could learn patterns directly from data, opening up new possibilities for image recognition and classification. Traditional machine learning algorithms like SVMs and Random Forests were introduced to tackle tasks like image classification. However, challenges such as feature engineering, the curse of dimensionality, and the balance between overfitting and generalization limited the success of these approaches.

Little did the world know that these challenges would pave the way for the revolutionary breakthroughs of deep learning. In the next part of our journey, we’ll dive into the era that changed everything — the era of deep learning and neural networks. Stay tuned for an exciting exploration of how these technologies transformed the field of computer vision!

--

--