DL : CNN & Computer vision

Part 4.2 of Deep Learning Specialization

Published in

Sum up As A Service

3 min readApr 3, 2020

--

1. Classification vs Localization vs Detection

1.1 Object Localization

boundary center (bx, by)
boundary frame (bw, bh)

1.2 Loss Function

- p = 1 (object in image) : Sum(predict-actual)²
- p = 0 (no object in image) : (prob.predict — prob.actual)²

1.3 Anchor box for Overlapping objects

For each sliding window,
you predict N Anchor boxes for N class of objects

1.4 Object Detection — YOLO algorithm (You Look Only Once)

1. Slide image into small windows

2. Predict class & boundary box of each window

Classes of objects found in each window

Boundary of objects found in each window

3. Non-Max Suppression
- Remove low possible classes of objects
- Remove boundary boxes of the same objects

2. Face Verification vs Face Recognition

3. Neural Style Transfer

Content image + Style image = Generated image

Loss Function = (Wc* Lcontent) + (Ws * Lstyle)

Related Articles

Reference

Deep Learning Specialization: Convolutional Neural Network (Coursera) (Youtube)

Convolutional Network

Computer Vision

Machine Learning

Pisit J.

Written by Pisit J.

Editor for

Sum up As A Service

Software Engineer.

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams