Microsoft Azure: Understand computer vision

Gani Çalışkan
Turk Telekom Bulut Teknolojileri
4 min readAug 4, 2022

Definition of computer vision

Computer Vision is an area of AI that deals with visual processing. Let’s explore some of the possibilities that computer vision brings.

Computer Vision models and capabilities

Most computer vision solutions are based on machine learning models that can be applied to visual input from cameras, videos, or images. There are some few tasks that supports these models particularly.

This is the demo of to analyze the image by using computer vision which you can see every phases of the applications. You can try the demo by clicking the link has shown below.

1)Image classification

Image classification involves training a machine learning model to classify images based on their contents. For instance, in a traffic monitoring solution you might use an image classification model to classify images based on the type of vehicle they include, such as taxis, buses, cyclists etc.

2)Object detection

Object detection machine learning models are trained to classify individual objects within an image, and identify their location with a bounding box. For example, a traffic monitoring solution might use object detection to identify the location of different classes of vehicle.

3)Semantic segmentation

Semantic segmentation is an advanced machine learning technique in which individual pixels in the image are classified according to the object to which they belong. For example, a traffic monitoring solution might overlay traffic images with “mask” layers to highlight different vehicles using specific colors.

4)Image analysis

You can create solutions that combine machine learning models with advanced image analysis techniques to extract information from images, including “tags” that could help catalog the image or even descriptive captions that summarize and conclude the scene shown in the image.

5) Face detection, analysis, and recognition

Face detection is a specialized form of object detection that locates human faces in an image. This can be combined with classification and facial geometry analysis techniques to recognize individuals based on their facial features.

6)Optical character recognition (OCR)

Optical character recognition is a technique used to detect and read text in images. You can use OCR to read text in photographs (for example, road signs or store fronts) or to extract information from scanned documents such as letters, invoices, or forms.

Computer vision services in Microsoft Azure

a) Computer Vision

You can use this service to analyze images and video, and extract descriptions, tags, objects, and text.

b) Custom Vision

Use this service to train custom image classification and object detection models using your own images.

c) Face

The Face service enables you to build face detection and facial recognition solutions.

d) Form Recognizer

Use this service to extract information from scanned forms and invoices.

This topic inspired me while I’m taking various courses include role based and specialization pathway subjects which are supportive to IT and Software careers. Although you are not a IT and software graduate or graduated from IT related field, begineer level courses would assist you to start your new career. You can check the links to read these kind of topics and start to using Azure cloud computing applications on workspaces.

Resources

--

--