Stable Diffusion — ControlNet Clearly Explained!

Generating images from line art, scribble, or pose key points using Stable Diffusion and ControlNet.

Steins
6 min readJun 6, 2023
An image generated using Stable Diffusion with ControlNet

ControlNet is a neural network that controls a pretrained image Diffusion model (e.g. Stable Diffusion). Its function is to allow input of a conditioning image, which can then be used to manipulate the image generation.

Table of Contents

├─ What Does ControlNet Do?
├─ Inner Architecture
│ ├─ Feedforward
│ ├─ Backpropagation
├─ Architecture with Stable Diffusion
│ ├─ Encoder
│ ├─ Overall Architecture
├─ Training
├─ Conditioning
│ ├─ Canny Edge
│ ├─ Line Art
│ ├─ Scribble
│ ├─ Hough Line
│ ├─ Semantic Segmentation
│ ├─ Depth
│ ├─ Normal Map
│ ├─ Open Pose
├─ Summary
├─ References

What Does ControlNet Do?

--

--

Steins

Developer & AI Researcher. Write about AI, web dev/hack. Be my referred member: https://medium.com/@steinsfu/membership. Support me: https://ko-fi.com/steinsfu