Generative Adversarial Nets Summary

Akash Bangera
2 min readSep 9, 2018

--

Link to the paper

Contribution

This paper introduces a new generative model that overcomes the difficulties faced by older models like Deep Belief Networks, Variational Autoencoders, etc.

Background

  1. Multilayer Perceptron Networks: A multilayer perceptron is a class of feedforward artificial neural network.
  2. Backpropagation: Backpropagation is a method used in artificial neural networks to calculate a gradient that is needed in the calculation of the weights to be used in the network.

Description

  1. This paper proposes the idea of adversarial training, in which the generative network competes with the discriminative network, which leads to both the networks becoming more efficient in their tasks.
  2. The generative network tries to learn a data distribution, and the discriminative network tries to classify between samples from original data and the generative network.

Methodology

  1. Two multilayer perceptron networks are used as Generator and Discriminator.
  2. The generator takes in noise z and gives as output a sample x=G(z).
  3. Discriminator takes in sample x and gives as output a probability D(x).
  4. D(x) represents whether sample x came from the generator or original data. D(x) will be closer to 1 if sample x came from original data, and it will be closer to 0 if sample x came from the generator.
  5. Generator tries to maximize the probability of Discriminator making a mistake.
  6. Discriminator tries to maximize the probability of assigning correct labels to the samples.
  7. Only backpropagation is used for obtaining gradients.
  8. After several iterations, the distribution of generated samples will overlap the distribution of original data.
  9. At this point, the discriminator fails to classify, giving output D(x)=0.5.

Experiments

  1. The networks were trained on datasets like MNIST, the Toronto Face Database, and CIFAR-10.
  2. The authors believe that the results obtained are at least competitive with other models in the literature.

Area of Applications

  1. Interactive Image Generation
  2. Text to Image Generation
  3. Image Editing
  4. Domain Transfer

--

--