�ݺ�ߣ

Image to Image Translation with
Conditional Adversarial Networks
Presented by: Shahbaz Ali Khan

Image to Image Translation
? Different representations of scenes
? Example: One concept in English and Urdu
? RGB Edges
? RGB Gray-scale
2

Image to Image Translation
? Primary objective remains the same
�C Map pixels to pixels
? Why use different approaches?
? Why not a general purpose solution?
? Problem: Hand-crafted loss functions
�C Telling the network what we want to minimize
3

Discriminative vs. Generative
? Discriminative: identify objects i.e. determine
class scores
? Generative: capture the underlying distribution
and produce samples from that distribution
4

Generative Adversarial Networks
? The solution to this problem is provided by
GANs
? GANs learn a loss function
? Two neural networks (example Art forgery)
�C Generator (G): synthesizes images
�C Discriminator (D): identifies fakes
? A zero sum game is played to reach equilibrium
5

U-Net Architecture
? Underlying structure doesn��t change during
translation
? Allow low level info to shuttle across the network
? Motivated by use of L1 loss
8

Experiments
Architecture labels Semantic labels to street
to photo scene
10

Experiments
Map to aerial photo BW to color
11

Evaluation of Results
? Amazon Mechanical Turk
�C Is this image real or fake?
? FCN score
�C Object recognition by off-the-shelf recognition
systems
15

Conclusion
? Investigation of Conditional GANs as a general
purpose solution for image to image translation
tasks.
? Presentation of effective architecture for Generator
and Discriminator.
? Lesser accuracy for tasks like semantic segmentation
? Failure cases: (highly) sparse inputs
21

�ݺ�ߣ

Image-to-Image Translation pix2pix

More Related Content

Image-to-Image Translation pix2pix

Editor's Notes