ºÝºÝߣ

ºÝºÝߣShare a Scribd company logo
Image Colorization Using CycleGAN
Zahil Shanis Yash Saraf Sai Varun
Generative Adversarial Networks
¡ñ Generative models with two competing differentiable functions, represented by
neural networks.
¡ñ Generator: Generates data from random noise using feedback from discriminator.
¡ñ Discriminator: A classifier to identify real data from fake (synthesized) data.
We train the generator to create data towards what the discriminator thinks is real.
CycleGAN
¡ñ Proposed by Jun-Yan Zhu, Taesung Park, Phillip Isola and Alexei A. Efros
¡ñ Performs unpaired image to image translation.
¡ñ Unpaired translation - doesn¡¯t require a training set of aligned image pairs.
¡ñ Cycle GAN can translate an image from a source domain X to a target domain Y in the
absence of paired examples.
Cycle GAN Architecture
¡ñ Architecture consists of two mappings: G : X -> Y and F : Y -> X.
¡ñ A generator G is used to translate real image from domain X to domain Y.
¡ñ A generator F is used to translate real image from domain Y to domain X.
¡ñ Discriminators (Dx and Dy) are used to discriminate real and fake images at respective
domains.
Cycle GAN Cost Function
¡ñ In addition to the Generator and Discriminator losses, CycleGAN uses one more type of
loss called Cycle Consistency Loss.
¡ñ This enforces that the input and generated output are recognizably the same.
¡ñ Final Objective Function is given by:
Image Colorization with Cycle GAN
¡ñ Colorize gray scale images using Cycle GAN architecture.
¡ñ Training on unpaired flowers dataset - domain X as gray scale images and domain Y as
color images.
Network Architecture
¡ñ Generator: A UNet like architecture with an encoder, transformer and decoder.
¡ñ Discriminator: PatchGANS which look at a ¡°patch¡± of the input image, and output the
probability of the patch being ¡°real¡±.
¡ñ Trained with a batch size of 1 with Adam as the optimizer.
Image Colorization Results
Network Modifications
1) Cycle GAN with Stochastic Generators
¡ñ Inter domain mapping from unpaired data need not always be one-to-one or
deterministic.
¡ñ Stochastic Cycle GAN - Generates multiple color images for a single grayscale image.
¡ñ Can be achieved by modifying the generator GAB to take a vector of noise and a sample
from the source domain, and generates a non-deterministic sample in the target domain.
¡ñ With different noise z ~ p(z), model can generate different domain B mappings.
¡ñ Inspired from Conditional Instance Normalization for Style Transfer paper by Huang et al.
¡ñ We are working on implementing this.
Network Modifications
2) Cycle GAN with Capsule Nets
¡ñ In CNN, Pooling layers are used to increase
the field of view and predict higher order
features by combining values.
¡ñ Use of Capsule Nets helps preserve
hierarchical pose relationships between
object parts.
Network Modifications
¡ñ Capsule Networks and GANs - Using a
Capsule Network as a discriminator to
better train the model to understand
spatial differences.
¡ñ Papers CapsGAN, and CapsuleGAN, takes
forward the idea by replacing the DCGAN
discriminator with CapsuleGANs.
Conditional GAN (pix2pix)
¡ñ Performs paired image to image translation.
¡ñ In an unconditioned generative model, there is no control on modes of the data being
generated.
¡ñ In the CGAN, the generator learns to generate a fake sample with a specific condition or
characteristics rather than a generic sample from unknown noise distribution.
Conditional GAN (pix2pix)
Training a conditional GAN
Combined Loss Function
References
¡ñ Cycle GAN paper by Zhu et al - https://arxiv.org/pdf/1703.10593.pdf
¡ñ Blog Cycle GAN - https://medium.com/@jonathan_hui/gan-cyclegan-6a50e7600d7
¡ñ Cycle GAN implementation - https://github.com/eriklindernoren/Keras-GAN
¡ñ Keras documentation - https://keras.io/
¡ñ CapsuleGAN implementation - https://github.com/gusgad/capsule-GAN/blob/master/capsule_gan.ipynb
¡ñ CapsGAN - https://arxiv.org/abs/1806.03968
¡ñ CapsuleGAN - https://arxiv.org/abs/1802.06167
¡ñ Capsule Networks - https://arxiv.org/abs/1710.09829
¡ñ Blog Capsule Networks - https://medium.com/ai%C2%B3-theory-practice-business/understanding-hintons-capsule-networks-
part-i-intuition-b4b559d1159b
¡ñ Conditional Instance Normalization - https://arxiv.org/pdf/1703.06868.pdf
¡ñ Pix2pix implemntation - https://github.com/eriklindernoren/Keras-GAN/tree/master/pix2pix
¡ñ Pix2pix - https://arxiv.org/abs/1611.07004

More Related Content

What's hot (20)

Blue brain
Blue brainBlue brain
Blue brain
Ishtdeep Hora
?
blue brain
blue brainblue brain
blue brain
Praveen Kumar Meena
?
Blue Brain - The Magic of Man
Blue Brain - The Magic of ManBlue Brain - The Magic of Man
Blue Brain - The Magic of Man
Pranith Chander
?
Using synthetic data for computer vision model training
Using synthetic data for computer vision model trainingUsing synthetic data for computer vision model training
Using synthetic data for computer vision model training
Unity Technologies
?
Blue Brain
Blue BrainBlue Brain
Blue Brain
Deepika Andol
?
A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
 A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs) A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
Thomas da Silva Paula
?
Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling
Yu Huang
?
Blue Brain-Nidhi Bisht
Blue Brain-Nidhi BishtBlue Brain-Nidhi Bisht
Blue Brain-Nidhi Bisht
Nidhi Bisht
?
Screenless Display PPT Presentation
Screenless Display PPT PresentationScreenless Display PPT Presentation
Screenless Display PPT Presentation
Sai Mohith
?
Machine Learning for Medical Image Analysis: What, where and how?
Machine Learning for Medical Image Analysis:What, where and how?Machine Learning for Medical Image Analysis:What, where and how?
Machine Learning for Medical Image Analysis: What, where and how?
Debdoot Sheet
?
Screenless display
Screenless display Screenless display
Screenless display
Thiruvarasan Varasan
?
ChatGPT_ppf.pdf
ChatGPT_ppf.pdfChatGPT_ppf.pdf
ChatGPT_ppf.pdf
ssuser693b9a
?
Neural Network Based Brain Tumor Detection using MR Images
Neural Network Based Brain Tumor Detection using MR ImagesNeural Network Based Brain Tumor Detection using MR Images
Neural Network Based Brain Tumor Detection using MR Images
Aisha Kalsoom
?
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
Vitaly Bondar
?
Brain tumor detection using image segmentation ppt
Brain tumor detection using image segmentation pptBrain tumor detection using image segmentation ppt
Brain tumor detection using image segmentation ppt
Roshini Vijayakumar
?
Blue brain Technology
Blue brain TechnologyBlue brain Technology
Blue brain Technology
Monica Murarisetti
?
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks
Yunjey Choi
?
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
Sergio Orts-Escolano
?
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical AnalysisBrain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
MD Abdullah Al Nasim
?
Blue brain seminar by bhaumik patel
Blue brain seminar by bhaumik patelBlue brain seminar by bhaumik patel
Blue brain seminar by bhaumik patel
bhaumikpatel004
?
Blue Brain - The Magic of Man
Blue Brain - The Magic of ManBlue Brain - The Magic of Man
Blue Brain - The Magic of Man
Pranith Chander
?
Using synthetic data for computer vision model training
Using synthetic data for computer vision model trainingUsing synthetic data for computer vision model training
Using synthetic data for computer vision model training
Unity Technologies
?
A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
 A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs) A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
Thomas da Silva Paula
?
Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling
Yu Huang
?
Blue Brain-Nidhi Bisht
Blue Brain-Nidhi BishtBlue Brain-Nidhi Bisht
Blue Brain-Nidhi Bisht
Nidhi Bisht
?
Screenless Display PPT Presentation
Screenless Display PPT PresentationScreenless Display PPT Presentation
Screenless Display PPT Presentation
Sai Mohith
?
Machine Learning for Medical Image Analysis: What, where and how?
Machine Learning for Medical Image Analysis:What, where and how?Machine Learning for Medical Image Analysis:What, where and how?
Machine Learning for Medical Image Analysis: What, where and how?
Debdoot Sheet
?
Neural Network Based Brain Tumor Detection using MR Images
Neural Network Based Brain Tumor Detection using MR ImagesNeural Network Based Brain Tumor Detection using MR Images
Neural Network Based Brain Tumor Detection using MR Images
Aisha Kalsoom
?
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
Vitaly Bondar
?
Brain tumor detection using image segmentation ppt
Brain tumor detection using image segmentation pptBrain tumor detection using image segmentation ppt
Brain tumor detection using image segmentation ppt
Roshini Vijayakumar
?
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks
Yunjey Choi
?
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
Sergio Orts-Escolano
?
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical AnalysisBrain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
MD Abdullah Al Nasim
?
Blue brain seminar by bhaumik patel
Blue brain seminar by bhaumik patelBlue brain seminar by bhaumik patel
Blue brain seminar by bhaumik patel
bhaumikpatel004
?

Similar to Image colorization (20)

Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...
Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...
Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...
Catalina Arango
?
Volodymyr Lyubinets ¡°Generative models for images¡±
Volodymyr Lyubinets  ¡°Generative models for images¡±Volodymyr Lyubinets  ¡°Generative models for images¡±
Volodymyr Lyubinets ¡°Generative models for images¡±
Lviv Startup Club
?
Decomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisDecomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesis
Naeem Shehzad
?
Brain Tumour Detection.pptx
Brain Tumour Detection.pptxBrain Tumour Detection.pptx
Brain Tumour Detection.pptx
RevolverRaja2
?
Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018
Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018
Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018
Universitat Polit¨¨cnica de Catalunya
?
TransNeRF
TransNeRFTransNeRF
TransNeRF
NavneetPaul2
?
Domain adaptation for Image Segmentation
Domain adaptation for Image SegmentationDomain adaptation for Image Segmentation
Domain adaptation for Image Segmentation
Deepak Thukral
?
Alberto Massidda - Scenes from a memory - Codemotion Rome 2019
Alberto Massidda - Scenes from a memory - Codemotion Rome 2019Alberto Massidda - Scenes from a memory - Codemotion Rome 2019
Alberto Massidda - Scenes from a memory - Codemotion Rome 2019
Codemotion
?
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNNAutomatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
Zihao(Gerald) Zhang
?
Deep Generative Modelling
Deep Generative ModellingDeep Generative Modelling
Deep Generative Modelling
Petko Nikolov
?
Generative adversarial network_Ayadi_Alaeddine
Generative adversarial network_Ayadi_AlaeddineGenerative adversarial network_Ayadi_Alaeddine
Generative adversarial network_Ayadi_Alaeddine
Deep Learning Italia
?
190410 sc fegan-face_editing_generative_adversarial_network_with_users_sketch...
190410 sc fegan-face_editing_generative_adversarial_network_with_users_sketch...190410 sc fegan-face_editing_generative_adversarial_network_with_users_sketch...
190410 sc fegan-face_editing_generative_adversarial_network_with_users_sketch...
ÁÁºê ÌÙ¾®
?
Face-GAN project report.pptx
Face-GAN project report.pptxFace-GAN project report.pptx
Face-GAN project report.pptx
AndleebFatima16
?
Face-GAN project report
Face-GAN project reportFace-GAN project report
Face-GAN project report
AndleebFatima16
?
Generation of Deepfake images using GAN and Least squares GAN.ppt
Generation of Deepfake images using GAN and Least squares GAN.pptGeneration of Deepfake images using GAN and Least squares GAN.ppt
Generation of Deepfake images using GAN and Least squares GAN.ppt
DivyaGugulothu
?
Implementing Neural Style Transfer
Implementing Neural Style Transfer Implementing Neural Style Transfer
Implementing Neural Style Transfer
Tahsin Mayeesha
?
Variational Autoencoders For Image Generation
Variational Autoencoders For Image GenerationVariational Autoencoders For Image Generation
Variational Autoencoders For Image Generation
Jason Anderson
?
Seeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper reviewSeeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper review
QuantUniversity
?
[NS][Lab_Seminar_250203]KAG-prompt (1).pptx
[NS][Lab_Seminar_250203]KAG-prompt (1).pptx[NS][Lab_Seminar_250203]KAG-prompt (1).pptx
[NS][Lab_Seminar_250203]KAG-prompt (1).pptx
thanhdowork
?
[NS][Lab_Seminar_250203]KAG-prompt (1).pptx
[NS][Lab_Seminar_250203]KAG-prompt (1).pptx[NS][Lab_Seminar_250203]KAG-prompt (1).pptx
[NS][Lab_Seminar_250203]KAG-prompt (1).pptx
thanhdowork
?
Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...
Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...
Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...
Catalina Arango
?
Volodymyr Lyubinets ¡°Generative models for images¡±
Volodymyr Lyubinets  ¡°Generative models for images¡±Volodymyr Lyubinets  ¡°Generative models for images¡±
Volodymyr Lyubinets ¡°Generative models for images¡±
Lviv Startup Club
?
Decomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisDecomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesis
Naeem Shehzad
?
Brain Tumour Detection.pptx
Brain Tumour Detection.pptxBrain Tumour Detection.pptx
Brain Tumour Detection.pptx
RevolverRaja2
?
Domain adaptation for Image Segmentation
Domain adaptation for Image SegmentationDomain adaptation for Image Segmentation
Domain adaptation for Image Segmentation
Deepak Thukral
?
Alberto Massidda - Scenes from a memory - Codemotion Rome 2019
Alberto Massidda - Scenes from a memory - Codemotion Rome 2019Alberto Massidda - Scenes from a memory - Codemotion Rome 2019
Alberto Massidda - Scenes from a memory - Codemotion Rome 2019
Codemotion
?
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNNAutomatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
Zihao(Gerald) Zhang
?
Deep Generative Modelling
Deep Generative ModellingDeep Generative Modelling
Deep Generative Modelling
Petko Nikolov
?
Generative adversarial network_Ayadi_Alaeddine
Generative adversarial network_Ayadi_AlaeddineGenerative adversarial network_Ayadi_Alaeddine
Generative adversarial network_Ayadi_Alaeddine
Deep Learning Italia
?
190410 sc fegan-face_editing_generative_adversarial_network_with_users_sketch...
190410 sc fegan-face_editing_generative_adversarial_network_with_users_sketch...190410 sc fegan-face_editing_generative_adversarial_network_with_users_sketch...
190410 sc fegan-face_editing_generative_adversarial_network_with_users_sketch...
ÁÁºê ÌÙ¾®
?
Face-GAN project report.pptx
Face-GAN project report.pptxFace-GAN project report.pptx
Face-GAN project report.pptx
AndleebFatima16
?
Generation of Deepfake images using GAN and Least squares GAN.ppt
Generation of Deepfake images using GAN and Least squares GAN.pptGeneration of Deepfake images using GAN and Least squares GAN.ppt
Generation of Deepfake images using GAN and Least squares GAN.ppt
DivyaGugulothu
?
Implementing Neural Style Transfer
Implementing Neural Style Transfer Implementing Neural Style Transfer
Implementing Neural Style Transfer
Tahsin Mayeesha
?
Variational Autoencoders For Image Generation
Variational Autoencoders For Image GenerationVariational Autoencoders For Image Generation
Variational Autoencoders For Image Generation
Jason Anderson
?
Seeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper reviewSeeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper review
QuantUniversity
?
[NS][Lab_Seminar_250203]KAG-prompt (1).pptx
[NS][Lab_Seminar_250203]KAG-prompt (1).pptx[NS][Lab_Seminar_250203]KAG-prompt (1).pptx
[NS][Lab_Seminar_250203]KAG-prompt (1).pptx
thanhdowork
?
[NS][Lab_Seminar_250203]KAG-prompt (1).pptx
[NS][Lab_Seminar_250203]KAG-prompt (1).pptx[NS][Lab_Seminar_250203]KAG-prompt (1).pptx
[NS][Lab_Seminar_250203]KAG-prompt (1).pptx
thanhdowork
?

Recently uploaded (20)

Artificial intelligence based solar vehicle.pptx
Artificial intelligence based solar vehicle.pptxArtificial intelligence based solar vehicle.pptx
Artificial intelligence based solar vehicle.pptx
rrabin2
?
Wireless-Charger presentation for seminar .pdf
Wireless-Charger presentation for seminar .pdfWireless-Charger presentation for seminar .pdf
Wireless-Charger presentation for seminar .pdf
AbhinandanMishra30
?
IoT-based-Electrical-Motor-Fault-Detection-System.pptx
IoT-based-Electrical-Motor-Fault-Detection-System.pptxIoT-based-Electrical-Motor-Fault-Detection-System.pptx
IoT-based-Electrical-Motor-Fault-Detection-System.pptx
atharvapardeshi03
?
Defining the Future of Biophilic Design in Crete.pdf
Defining the Future of Biophilic Design in Crete.pdfDefining the Future of Biophilic Design in Crete.pdf
Defining the Future of Biophilic Design in Crete.pdf
ARENCOS
?
ESIT135 Problem Solving Using Python Notes of Unit-1 and Unit-2
ESIT135 Problem Solving Using Python Notes of Unit-1 and Unit-2ESIT135 Problem Solving Using Python Notes of Unit-1 and Unit-2
ESIT135 Problem Solving Using Python Notes of Unit-1 and Unit-2
prasadmutkule1
?
Taykon-Kalite belgeleri
Taykon-Kalite belgeleriTaykon-Kalite belgeleri
Taykon-Kalite belgeleri
TAYKON
?
INTERNET OF THINGSSSSSSSSSSSSSSSSSSSSSSSSS.pptx
INTERNET OF THINGSSSSSSSSSSSSSSSSSSSSSSSSS.pptxINTERNET OF THINGSSSSSSSSSSSSSSSSSSSSSSSSS.pptx
INTERNET OF THINGSSSSSSSSSSSSSSSSSSSSSSSSS.pptx
bmit1
?
Scientific_Thinking_and_Method_GE3791.pptx
Scientific_Thinking_and_Method_GE3791.pptxScientific_Thinking_and_Method_GE3791.pptx
Scientific_Thinking_and_Method_GE3791.pptx
aravym456
?
AIR FILTER system in internal combustion engine system.ppt
AIR FILTER system in internal combustion engine system.pptAIR FILTER system in internal combustion engine system.ppt
AIR FILTER system in internal combustion engine system.ppt
thisisparthipan1
?
-PPT-5-Wind-Energy conversion slides contents
-PPT-5-Wind-Energy conversion   slides contents-PPT-5-Wind-Energy conversion   slides contents
-PPT-5-Wind-Energy conversion slides contents
senthilkumarmamse
?
Dijkstra Shortest Path Algorithm in Network.ppt
Dijkstra Shortest Path Algorithm in Network.pptDijkstra Shortest Path Algorithm in Network.ppt
Dijkstra Shortest Path Algorithm in Network.ppt
RAJASEKARAN G
?
Failover System in Cloud Computing System
Failover System in Cloud Computing SystemFailover System in Cloud Computing System
Failover System in Cloud Computing System
Hitesh Mohapatra
?
Cloud Cost Optimization for GCP, AWS, Azure
Cloud Cost Optimization for GCP, AWS, AzureCloud Cost Optimization for GCP, AWS, Azure
Cloud Cost Optimization for GCP, AWS, Azure
vinothsk19
?
eng funda notes.pdfddddddddddddddddddddddd
eng funda notes.pdfdddddddddddddddddddddddeng funda notes.pdfddddddddddddddddddddddd
eng funda notes.pdfddddddddddddddddddddddd
aayushkumarsinghec22
?
ESIT135 Problem Solving Using Python Notes of Unit-3
ESIT135 Problem Solving Using Python Notes of Unit-3ESIT135 Problem Solving Using Python Notes of Unit-3
ESIT135 Problem Solving Using Python Notes of Unit-3
prasadmutkule1
?
Von karman Equation full derivation .pdf
Von karman Equation full derivation  .pdfVon karman Equation full derivation  .pdf
Von karman Equation full derivation .pdf
Er. Gurmeet Singh
?
The Golden Gate Bridge a structural marvel inspired by mother nature.pptx
The Golden Gate Bridge a structural marvel inspired by mother nature.pptxThe Golden Gate Bridge a structural marvel inspired by mother nature.pptx
The Golden Gate Bridge a structural marvel inspired by mother nature.pptx
AkankshaRawat75
?
Biases, our brain and software development
Biases, our brain and software developmentBiases, our brain and software development
Biases, our brain and software development
Matias Iacono
?
Design and Analysis of Algorithms Unit 5
Design and Analysis of Algorithms Unit 5Design and Analysis of Algorithms Unit 5
Design and Analysis of Algorithms Unit 5
sureshkumara29
?
direct current machine first part about machine.pdf
direct current machine first part about machine.pdfdirect current machine first part about machine.pdf
direct current machine first part about machine.pdf
sahilshah890338
?
Artificial intelligence based solar vehicle.pptx
Artificial intelligence based solar vehicle.pptxArtificial intelligence based solar vehicle.pptx
Artificial intelligence based solar vehicle.pptx
rrabin2
?
Wireless-Charger presentation for seminar .pdf
Wireless-Charger presentation for seminar .pdfWireless-Charger presentation for seminar .pdf
Wireless-Charger presentation for seminar .pdf
AbhinandanMishra30
?
IoT-based-Electrical-Motor-Fault-Detection-System.pptx
IoT-based-Electrical-Motor-Fault-Detection-System.pptxIoT-based-Electrical-Motor-Fault-Detection-System.pptx
IoT-based-Electrical-Motor-Fault-Detection-System.pptx
atharvapardeshi03
?
Defining the Future of Biophilic Design in Crete.pdf
Defining the Future of Biophilic Design in Crete.pdfDefining the Future of Biophilic Design in Crete.pdf
Defining the Future of Biophilic Design in Crete.pdf
ARENCOS
?
ESIT135 Problem Solving Using Python Notes of Unit-1 and Unit-2
ESIT135 Problem Solving Using Python Notes of Unit-1 and Unit-2ESIT135 Problem Solving Using Python Notes of Unit-1 and Unit-2
ESIT135 Problem Solving Using Python Notes of Unit-1 and Unit-2
prasadmutkule1
?
Taykon-Kalite belgeleri
Taykon-Kalite belgeleriTaykon-Kalite belgeleri
Taykon-Kalite belgeleri
TAYKON
?
INTERNET OF THINGSSSSSSSSSSSSSSSSSSSSSSSSS.pptx
INTERNET OF THINGSSSSSSSSSSSSSSSSSSSSSSSSS.pptxINTERNET OF THINGSSSSSSSSSSSSSSSSSSSSSSSSS.pptx
INTERNET OF THINGSSSSSSSSSSSSSSSSSSSSSSSSS.pptx
bmit1
?
Scientific_Thinking_and_Method_GE3791.pptx
Scientific_Thinking_and_Method_GE3791.pptxScientific_Thinking_and_Method_GE3791.pptx
Scientific_Thinking_and_Method_GE3791.pptx
aravym456
?
AIR FILTER system in internal combustion engine system.ppt
AIR FILTER system in internal combustion engine system.pptAIR FILTER system in internal combustion engine system.ppt
AIR FILTER system in internal combustion engine system.ppt
thisisparthipan1
?
-PPT-5-Wind-Energy conversion slides contents
-PPT-5-Wind-Energy conversion   slides contents-PPT-5-Wind-Energy conversion   slides contents
-PPT-5-Wind-Energy conversion slides contents
senthilkumarmamse
?
Dijkstra Shortest Path Algorithm in Network.ppt
Dijkstra Shortest Path Algorithm in Network.pptDijkstra Shortest Path Algorithm in Network.ppt
Dijkstra Shortest Path Algorithm in Network.ppt
RAJASEKARAN G
?
Failover System in Cloud Computing System
Failover System in Cloud Computing SystemFailover System in Cloud Computing System
Failover System in Cloud Computing System
Hitesh Mohapatra
?
Cloud Cost Optimization for GCP, AWS, Azure
Cloud Cost Optimization for GCP, AWS, AzureCloud Cost Optimization for GCP, AWS, Azure
Cloud Cost Optimization for GCP, AWS, Azure
vinothsk19
?
eng funda notes.pdfddddddddddddddddddddddd
eng funda notes.pdfdddddddddddddddddddddddeng funda notes.pdfddddddddddddddddddddddd
eng funda notes.pdfddddddddddddddddddddddd
aayushkumarsinghec22
?
ESIT135 Problem Solving Using Python Notes of Unit-3
ESIT135 Problem Solving Using Python Notes of Unit-3ESIT135 Problem Solving Using Python Notes of Unit-3
ESIT135 Problem Solving Using Python Notes of Unit-3
prasadmutkule1
?
Von karman Equation full derivation .pdf
Von karman Equation full derivation  .pdfVon karman Equation full derivation  .pdf
Von karman Equation full derivation .pdf
Er. Gurmeet Singh
?
The Golden Gate Bridge a structural marvel inspired by mother nature.pptx
The Golden Gate Bridge a structural marvel inspired by mother nature.pptxThe Golden Gate Bridge a structural marvel inspired by mother nature.pptx
The Golden Gate Bridge a structural marvel inspired by mother nature.pptx
AkankshaRawat75
?
Biases, our brain and software development
Biases, our brain and software developmentBiases, our brain and software development
Biases, our brain and software development
Matias Iacono
?
Design and Analysis of Algorithms Unit 5
Design and Analysis of Algorithms Unit 5Design and Analysis of Algorithms Unit 5
Design and Analysis of Algorithms Unit 5
sureshkumara29
?
direct current machine first part about machine.pdf
direct current machine first part about machine.pdfdirect current machine first part about machine.pdf
direct current machine first part about machine.pdf
sahilshah890338
?

Image colorization

  • 1. Image Colorization Using CycleGAN Zahil Shanis Yash Saraf Sai Varun
  • 2. Generative Adversarial Networks ¡ñ Generative models with two competing differentiable functions, represented by neural networks. ¡ñ Generator: Generates data from random noise using feedback from discriminator. ¡ñ Discriminator: A classifier to identify real data from fake (synthesized) data. We train the generator to create data towards what the discriminator thinks is real.
  • 3. CycleGAN ¡ñ Proposed by Jun-Yan Zhu, Taesung Park, Phillip Isola and Alexei A. Efros ¡ñ Performs unpaired image to image translation. ¡ñ Unpaired translation - doesn¡¯t require a training set of aligned image pairs. ¡ñ Cycle GAN can translate an image from a source domain X to a target domain Y in the absence of paired examples.
  • 4. Cycle GAN Architecture ¡ñ Architecture consists of two mappings: G : X -> Y and F : Y -> X. ¡ñ A generator G is used to translate real image from domain X to domain Y. ¡ñ A generator F is used to translate real image from domain Y to domain X. ¡ñ Discriminators (Dx and Dy) are used to discriminate real and fake images at respective domains.
  • 5. Cycle GAN Cost Function ¡ñ In addition to the Generator and Discriminator losses, CycleGAN uses one more type of loss called Cycle Consistency Loss. ¡ñ This enforces that the input and generated output are recognizably the same. ¡ñ Final Objective Function is given by:
  • 6. Image Colorization with Cycle GAN ¡ñ Colorize gray scale images using Cycle GAN architecture. ¡ñ Training on unpaired flowers dataset - domain X as gray scale images and domain Y as color images. Network Architecture ¡ñ Generator: A UNet like architecture with an encoder, transformer and decoder. ¡ñ Discriminator: PatchGANS which look at a ¡°patch¡± of the input image, and output the probability of the patch being ¡°real¡±. ¡ñ Trained with a batch size of 1 with Adam as the optimizer.
  • 8. Network Modifications 1) Cycle GAN with Stochastic Generators ¡ñ Inter domain mapping from unpaired data need not always be one-to-one or deterministic. ¡ñ Stochastic Cycle GAN - Generates multiple color images for a single grayscale image. ¡ñ Can be achieved by modifying the generator GAB to take a vector of noise and a sample from the source domain, and generates a non-deterministic sample in the target domain. ¡ñ With different noise z ~ p(z), model can generate different domain B mappings. ¡ñ Inspired from Conditional Instance Normalization for Style Transfer paper by Huang et al. ¡ñ We are working on implementing this.
  • 9. Network Modifications 2) Cycle GAN with Capsule Nets ¡ñ In CNN, Pooling layers are used to increase the field of view and predict higher order features by combining values. ¡ñ Use of Capsule Nets helps preserve hierarchical pose relationships between object parts.
  • 10. Network Modifications ¡ñ Capsule Networks and GANs - Using a Capsule Network as a discriminator to better train the model to understand spatial differences. ¡ñ Papers CapsGAN, and CapsuleGAN, takes forward the idea by replacing the DCGAN discriminator with CapsuleGANs.
  • 11. Conditional GAN (pix2pix) ¡ñ Performs paired image to image translation. ¡ñ In an unconditioned generative model, there is no control on modes of the data being generated. ¡ñ In the CGAN, the generator learns to generate a fake sample with a specific condition or characteristics rather than a generic sample from unknown noise distribution.
  • 12. Conditional GAN (pix2pix) Training a conditional GAN Combined Loss Function
  • 13. References ¡ñ Cycle GAN paper by Zhu et al - https://arxiv.org/pdf/1703.10593.pdf ¡ñ Blog Cycle GAN - https://medium.com/@jonathan_hui/gan-cyclegan-6a50e7600d7 ¡ñ Cycle GAN implementation - https://github.com/eriklindernoren/Keras-GAN ¡ñ Keras documentation - https://keras.io/ ¡ñ CapsuleGAN implementation - https://github.com/gusgad/capsule-GAN/blob/master/capsule_gan.ipynb ¡ñ CapsGAN - https://arxiv.org/abs/1806.03968 ¡ñ CapsuleGAN - https://arxiv.org/abs/1802.06167 ¡ñ Capsule Networks - https://arxiv.org/abs/1710.09829 ¡ñ Blog Capsule Networks - https://medium.com/ai%C2%B3-theory-practice-business/understanding-hintons-capsule-networks- part-i-intuition-b4b559d1159b ¡ñ Conditional Instance Normalization - https://arxiv.org/pdf/1703.06868.pdf ¡ñ Pix2pix implemntation - https://github.com/eriklindernoren/Keras-GAN/tree/master/pix2pix ¡ñ Pix2pix - https://arxiv.org/abs/1611.07004

Editor's Notes

  • #3: Conceptually speaking, discriminator in the network give guidance to generator on what data to create. As we train the networks arternatively, eventually, the discriminator identifies the tiny difference between the real and the generated, and the generator creates images that the discriminator cannot tell the difference. The GAN model eventually converges and produces data indistinguishable from real data.
  • #10: Max pooling loses important relevant information about translational and rotational relationship between the objects. Using Primary Caps, the digit caps are calculated using Dynamic routing algorithms. Now, these are fed to the decoder network. The model is trained using Reconstruction loss from decoder network.
  • #11: Promising results for MNIST and CIFAR datasets. Outperforms simple GAN by 2-3 percent.
  • #12: In the Conditional GAN (CGAN), the generator learns to generate a fake sample with a specific condition or characteristics (such as a label associated with an image or more detailed tag) rather than a generic sample from unknown noise distribution.
  • #13: Training a conditional GAN . The conditioning image, x is applied as the input to the generator and as input to the discriminator. The generator in this case is trying to learn how to colorize a black and white image. The discriminator is looking at the generator¡¯s colorization attempts and trying to learn to tell the difference between the colorizations the generator provides and the true colorized target image provided in the dataset. The discriminator, D, learns to classify between fake (synthesized by the generator) and real images. The generator, G, learns to fool the discriminator.