Applications of Deep Learning for real-time Object Detection
The global computer vision market was valued at $27.3 Billion in 2019 with a CAGR of 19% from 2020 to 2027 . Object detection is one of the core computer vision tasks that has a broad range of industrial applications such as -
- Cancer detection in radiology-based images [Healthcare]
- Detection of manufacturing defects, factory floor surveillance [Manufacturing]
- Detection of seat belts, parking in restricted areas [Public Safety]
- Stock level analysis and inventory management [Retail]
What is object detection?
There are four types of visual recognition tasks in computer vision. First, image classification, which is the assignment of labels to images, for example, labelling cows in a picture of farm as cows. Second, object detection, which is to not only label the cow but also to locate the cow using a bounding box. Third, semantic segmentation, which predicts the labels for each pixel of an image without differentiation between objects with the same label. Fourth, instance segmentation, which involves labelling as well as segmentation.
What is Deep Learning?
Deep learning is a subset of machine learning, that can process data from a very wide variety of sources. Compared to traditional machine learning, it requires lesser data preprocessing by humans and can often produce more accurate predictions from the data. In deep learning, interconnected layers of software-based calculators known as neurons, form a neural network. There are layers of such neurons, hence the word “deep” neural network. The network ingests data and processes them through each layer of the neural network, which each layer learning increasingly complex features of the data.
Once a deep neural network has learned how to make determinations from input data correctly, it can then use what it has learned to make determinations about new data. For example, once it learns what an object looks like, it can recognize the object in a new image. In other words, a deep neural network that has learned how to recognize cows, can quick detect cows in new images.
Technical detail: How does deep learning for object detection work?
Recent trends in applications of deep learning for object detection
Overall, the accuracy and performance of state-of-the-art deep learning models reported in 2019 are significantly higher than those of the previous years. Higher accuracy has a profound impact on application of the technology in medical imaging as well as surveillance systems. Improvement in performance means results can be inferred much faster on modern edge-based computing systems, paving the way for applications such as real-time drone based video analytics.
Specifically, the new improvements to deep learning models came by way of the following advancements:
- Face Detection Mean Average Precision went above 90%
Face detection is a computer vision problem to detect human faces in images, which is the first step to applications such as face verification, face alignment and facial recognition. Face detection is different from generic object detection in two ways. First, the range of scale of objects is larger in face detection and blurring is more common in face detection. Second, face detection has a single target and depends strongly on the structural characteristics of the face.
WIDER FACE is currently the most commonly used benchmark for evaluating face detection algorithms. The high variance of face scales and large number of faces per image make WIDER FACE the hardest benchmark for face detection, with three evaluation metrics (easy, medium and hard).
In 2019, PyramidBox++ , VIM-FD , ISRN , Retinaface , AlnnoFace  and RefineFace  all reported mAP scores of greater than 90% for the easy, medium and hard metrics. This is a significant improvment over the previous years.
2. Recent trends in pedestrian detection
CityPersons is a new and challenging benchmark for pedestrian detection. The dataset is split into different subsets according to the height and visibility level of the objects, and thus it’s able to evaluate the detectors in a more comprehensive manner. In 2019, the APD model reported a 30% improvement in the object detection performance over 2018 .
These and many more applications of this concept that can help enterprises in various types of monitoring and analysis. ‘Object Detection in Videos’ is the first topic to be covered in our upcoming workshop “Applied AI for Beginners” where we will train engineers from enterprises about the basics of AI along with hands-on implementation.
To participate in our future e-workshops and learn more about AI products & solutions for your enterprise (in any industry), please visit unitx.io & feel free to contact us.
Digital Transformation through Enterprise AI
We provide AI solutions, products and services for various industrial use cases. Grow your sales & revenues, reduce…
#ComputerVision #Artificial Intelligence #MachineLearning
 Xiongwei Wu Doyen S, Steven C.H. Hoi, Recent advances in deep learning for object detection, Neurocomputing, 396 (2020).
 Z. Li, X. Tang, J. Han, J. Liu, Pyramidbox ++ : high performance detector for finding tiny face, 2019. arXiv: 1904.00386 .
 Y. Zhang, X. Xu, X. Liu, Robust and high performance face detector, 2019. arXiv: 1901.02350 .
 S. Zhang, R. Zhu, X. Wang, H. Shi, T. Fu, S. Wang, T. Mei, S.Z. Li, Improved selective refinement network for face detection, 2019. arXiv: 1901.06651 .
 J. Deng, J. Guo, Y. Zhou, J. Yu, I. Kotsia, S. Zafeiriou, Retinaface: single-stage dense face localisation in the wild, 2019. arXiv: 1905.00641 .
 F. Zhang, X. Fan, G. Ai, J. Song, Y. Qin, J. Wu, Accurate face detection for high performance, 2019. arXiv: 1905.00641 .
 S. Zhang, C. Chi, Z. Lei, S.Z. Li, Refineface: refinement neural network for high performance face detection, 2019. arXiv: 1909.04376 .
 Y. Hu, J. Xie, J. Zhang, L. Lin, Y. Li, S.C. Hoi, Attribute-aware pedestrian detec- tion in a crowd, 2019. arXiv: 1910.09188 .