DL : CNN & Computer vision

Part 4.2 of Deep Learning Specialization

Pisit J.
Sum up As A Service
3 min readApr 3, 2020

--

1. Classification vs Localization vs Detection

1.1 Object Localization

boundary center (bx, by)

boundary frame (bw, bh)

1.2 Loss Function

- p = 1 (object in image) : Sum(predict-actual)²

- p = 0 (no object in image) : (prob.predict — prob.actual)²

1.3 Anchor box for Overlapping objects

For each sliding window,

you predict N Anchor boxes for N class of objects

1.4 Object Detection — YOLO algorithm (You Look Only Once)

1. Slide image into small windows

2. Predict class & boundary box of each window

Classes of objects found in each window

Boundary of objects found in each window

3. Non-Max Suppression

- Remove low possible classes of objects

- Remove boundary boxes of the same objects

2. Face Verification vs Face Recognition

3. Neural Style Transfer

Content image + Style image = Generated image

Loss Function = (Wc* Lcontent) + (Ws * Lstyle)

Reference

Deep Learning Specialization: Convolutional Neural Network (Coursera) (Youtube)

--

--