[PR113] D3: MISO Multimodal Image-to-Image Translation

Sieun Park
Aug 19, 2021

--

Original Paper: MISO: Mutual Information Loss with Stochastic Style Representations for Multimodal Image-to-Image Translation

  1. Image-to-image translation is fundamentally a multimodal problem. Previous methods
  2. The authors assume a hierarchy between domain-invariant features(content) and domain-specific features(style) instead of a strictly dividing style and content. MISO pipeline is designed on this idea. The Mutual Information LOss(MILO) loss maximizes the mutual information between the feature and the image generated from that feature.
  3. MISO was able to outperform other unpaired multimodal translation models in variety and quality.
User study results

--

--

Sieun Park

17-year-old Korean, loves reading and writing about AI, DL💘. Passionate️ 🔥 about learning new technology. Original blog: https://medium.com/@sieunpark77