Opening the eyes of the machine: Computer vision with AutoML (Part 1)

Gavita Regunath
Advancing Analytics
8 min readMay 25, 2022

By Gavita Regunath and Dan Lantos

Automated Machine Learning, or AutoML, has gained a lot of popularity and traction in both the research and business sectors.

Why?

61% OF DATA AND ANALYTICS DECISION-MAKERS WHOSE FIRMS ARE ADOPTING ARTIFICIAL INTELLIGENCE SAID THEY HAD IMPLEMENTED AUTOML SOFTWARE OR ARE IN THE PROCESS OF IMPLEMENTING IT”

FORRESTER RESEARCH

AutoML has proven to empower citizen data scientists and/or boost existing data scientists’ productivity whilst enabling businesses to gain valuable insights. There is no doubt that AutoML has helped democratise and accelerate the development of machine learning models in businesses. Recently cloud providers have introduced the capability of AutoML for Computer Vision tasks.

This article consists of two parts, part one aims to set the scene by introducing the concept of computer vision, how it is used widely in many industries, and how AutoML for computer vision aims to democratise machine learning.

In part two, we will provide a step-by-step guide on using AutoML for computer vision to jumpstart your computer vision journey.

WHAT IS AUTOML?

When you consider a typical machine learning lifecycle, many crucial steps need to be taken before developing a successful machine learning model. AutoML aims to do a lot of the heavy lifting by automating the most tedious parts of the machine learning lifecycle to produce the best model. Until recently, most AutoML tools were able to solve one of the three machine learning problems: regression, classification, and time-series forecasting problems.

However, more recently, cloud vendors have responded to AI and machine learning trends and have provided the capability to perform AutoML in computer vision. The computer vision market is expected to grow at a CAGR of 7.7% to reach USD 18.24 billion by 2025, according to a report by Grand View Research.

This is not surprising when you consider how tightly integrated computer vision is with our daily lives. Unlocking mobile devices through facial images, authorising payments through facial recognition when using mobile banking, using Google lens to translate content in real-time, and unlocking the front door using facial recognition are examples of computer vision in action. Having the ability to leverage AutoML in computer vision will be a game-changer for many businesses.

WHAT IS COMPUTER VISION? 💻👓

Computer vision is a field of artificial intelligence that uses machine learning to analyse images captured by cameras or satellites to derive meaningful information. Essentially, computer vision aims to mimic how humans process information from images.

In part two, we will demonstrate how to perform image classification using Azure’s new capability of AutoML for computer vision. At the time of writing this article, the AutoML for computer vision functionality (in preview) offers support for computer vision tasks such as multi-label image classification, image classification, object detection and image segmentation.

If you’re curious what the distinctions are, the diagram below might help. Beginning on the right, multi-label image classification and image classification are where the model identifies what is in an image. Because there are four distinct animals in the first image, it is classified as a multi-label image classification issue, as opposed to the second image, which only has a dog. Object detection models generate a bounding box around specific objects, whereas image segmentation models provide the exact outline of the object within an image.

WHY SHOULD WE CARE ABOUT COMPUTER VISION? 🤷🏽‍♀️🤷🏽‍♂️

The field of computer vision is progressing rapidly and is creating a lot of excitement. With ongoing research and technological advancement, computer vision is transforming industries globally, from autonomous vehicles to face recognition and diagnosing from x-ray images. Let’s explore the most popular computer vision application use cases across a number of industries to provide more context. Note that the market for computer vision is continually expanding and is being adopted in more industries than what is listed below.

1. Finance

Computer vision technology is beginning to significantly impact the financial services industry. Banks like the Spanish bank BBVA already use face recognition to onboard their customers remotely, reducing their onboarding times from hours to minutes. Facial recognition and retina scanning are also helping financial institutions to improve security procedures and therefore reducing fraud.

In the insurance business, Tokio Marine, a Japanese-based property and casualty insurer use computer vision to analyse and evaluate damaged cars, speeding up the evaluation process.

Auto-insurer Tokio Marine use computer vision system for examining damaged vehicles. Source: isurancejournal.com

2. Healthcare

The use of computer vision applications for healthcare is often regarded as a turning point in medical image processing and diagnosis. It has already proven to be highly effective at saving hundreds of patient lives. Some examples of how computer vision is used in the healthcare industry is the detection of various illness such as cancerous cells, the accurate detection of blood loss during childbirth and generating accurate and precise reports based on medical imaging for example detecting lung disease from X-Ray imaging. More recently, one popular way in which computer vision was used in the healthcare industry was utilised in the COVID-Net device developed by Darwin AI, Canada. COVID-Net was used successfully for detecting COVID-19 cases using digital chest x-ray radiography (CXR).

Machine Learning and Computer Vision play an important part here in detecting breast cancers well on time. Source: New York Times

3. Manufacturing

Some of the common applications of computer vision in the manufacturing industry are for predictive maintenance where cameras and sensors are used to monitor and take corrective actions on machinery. Other popular examples include the use of computer vision to monitor the quality of goods on a production line, to detect if workers are wearing suitable protective equipment and for the identification of defective products.

Computer vision used to detect hard hats on workers

4. Retail

Computer vision is being utilised in the retail business to analyse consumer behaviour and activity, which eventually delivers valuable insight back to the retailer. Retailers such as Sephora and Samsonite, for example, use computer vision in their stores to better understand their customers’ behaviours, test new merchandising strategies, and experiment with more successful layouts. Amazon Go, a cashier-less checkout uses computer vision to provide their customer with a hassle-free shopping experience, where customers walk in, pick items they want and simply walk out of the store.

Amazon Go uses computer vision to detect when a customer taken an item from the shelf and automatically calculates the prices. Source: Amazon.com

5. Transportation

Computer vision has been used in transportation for at least a decade now. Lane tracking, vehicle detection, traffic signs detection and pedestrian detection are core areas embedded within self-driving cars that utilise computer vision. Apart from self-driving cars, other examples of where computer vision is being used in the transportation sector are for parking occupancy detection, automated license plate recognition and road condition monitoring.

Tesla cars’ Autopilot enables the driver to steer the car, accelerate and brake automatically within its lane. Source: Tesla Autopilot

6. Agriculture

In agriculture, there are several examples of how computer vision is being used to improve the industry. RSIP Vision, has developed software to predict crop yield using computer vision and machine learning. Performing automatic weeding within large crops, which requires precision identification and classification of weeds, is another example of where computer vision is being used. Livestock monitoring, irrigation management, plant disease detection, and insect detection are a few other popular examples of how computer vision is used in agriculture.

RSIP vision uses computer vision and deep learning to perform agricultural yield prediction. Source: rsipvision.com

7. Biodiversity conservation

There are many ways in which computer vision is used for biodiversity conservation efforts. According to a recent Wildlabs.net analysis, computer vision is one of the top three promising conservation technologies. Camera trap imaging which is a way to automatically photograph animal species in the field is quickly becoming the gold standard in biodiversity conservation. Gramener and Microsoft — AI for Earth have devised a solution using computer vision to monitor the Emperor penguin population that is at serious risk.

Microsoft — AI for Earth and Gramener are helping save the penguins in Antarctica!🐧🐧. Source: Gramener

8. Security and Surveillance

Facial detection and recognition are some of the most prominent computer vision technology used to detect and prevent crime. In fact, in most public places like car parks, bus stations, underground stations, and motorways, computer vision is used to monitor and prevent criminal activities. On social media platforms, face detection algorithms are used to detect and stop the spread of fake news.

9. Fitness

Recently, computer vision has become an extremely popular application in the sports and fitness industry. Some examples of computer vision are tracking players in a match to offer insight, pose estimation of athletes to provide sports analytics, and using video assistant referees (VARs) to review decisions made.

In fact, Peloton recently released the Peloton Guide, where is uses machine learning and smart camera technology to track movements and offer recommendations. It is similar to having a personal trainer at home providing the right advice to gain maximum benefit.

Peloton Guide allows you to track your movements using computer vision technology. Source: onepeloton.com

10. Advertising

When it comes to advertising, the success of an advert is measured by analysing customers’ engagement. With computer vision, advertising agencies can now recognise customers’ emotional reactions and use this information to deliver personalised and targeted ads. One of the examples of this is the AI poster by M&C Saatchi, London, which uses computer vision to capture people’s emotions and provide targeted content.

Artificial Intelligence Poster, Oxford Street London by M&C Saatchi created the first ever artificially intelligent poster campaign in the world, which evolves unique ads based on how people react to it.

Summary

So far, we’ve covered the basics of AutoML and computer vision, as well as numerous examples of its use in different industries.

If you want to get started with computer vision but lack the technical know-how, or if you wanted to increase the productivity of your data science team, utilising AutoML for computer vision is a great place to start. It uses minimal code to expedite the creation of computer vision models for tasks such as image classification, object detection and instance segmentation. In the following article, we will provide a step-by-step guide that will guide you through the process of using Azure AutoML in computer vision capability for object detection.

Originally published at https://www.advancinganalytics.co.uk on May 23, 2022.

--

--

Gavita Regunath
Advancing Analytics

MVP AI | Principal Data Scientist at Advancing Analytics