ºÝºÝߣ

ºÝºÝߣShare a Scribd company logo
Deep
Learning
Applications
for fun and profit
Hello!I am Roberto Silveira
EE engineer, ML enthusiast
rsilveira79@gmail.com
@rsilveira79
¡°Fearing a rise of killer
robots is like worrying about
overpopulation on Mars.¡±
(Andrew Ng)
{ A short intro }
Deep Learning (1)
Artificial Intelligence
e.g. knowledge bases
Source: Deep Learning
(Goodfellow, Bengio, Courville)
Machine Learning
e.g. logistic regression
Representation Learning
e.g. shallow autoencoders
Deep Learning
e.g. MLPs
Source: Deep Learning
(Goodfellow, Bengio, Courville)
Rule-based
INPUT
OUTPUT CAT
Classic ML Deep Learning
Hand
designed
program
CAT
Hand
designed
features
Mapping
from
features
CAT
CAT
Features
Mapping
from
features
Simple
Features
+ more
abstract
features
Mapping
from
features
Representation Learning
Deep Learning (2)
Visualizing distributed representation
Source: Convolutional networks and applications in vision
(Yan LeCun et al., 2010)
Visualizing distributed representation
Source: Visualizing and Understanding Convolutional Networks
(Zeiler et al., 2013)
Visualizing distributed representation
Source: Visualizing and Understanding Convolutional Networks
(Zeiler et al., 2013)
Visualizing distributed representation
Source: Visualizing and Understanding Convolutional Networks
(Zeiler et al., 2013)
Start of a revolution - why now?
Availability of data Processing power
(GPUs)
Improved models/
techniques
(activations, regularization,
initialization, attention)
{ Art }
A Neural Algorithm of Artistic Style
Source: A Neural Algorithm of Artistic Style
(Gatys et al., 2015)
Style Content
La?a-Style
La?a-Style
Deep Dream
Low-level layer enhancement
High-level layer enhancement
Source: https://goo.gl/U4xhM6
Deep Dream
La?a-Dreams
La?a-Dreams
La?a-Dreams
La?a-Dreams
La?a-Dreams
La?a-Dreams
{ Music, Sound }
WaveNet - Speech Generation
Parametric
Concatenative
WaveNet
Music #2
Music #1
Source: WaveNet: A Generative Model for Raw Audio
(Oord et al., 2016)
Music recommendation
Collaborative filtering vs content based
Genre
Instruments
Mood
Year
Geolocation of artist
Lyrical themes
Low level (max) - Vibrato singing
Low level (avg) - noise, distortion
Low level (avg) - Chinese pop
Similarity Playlists - Fleet Foxes
Source: Deep content-based music recommendation
(Oord et al., 2013)
{ Image, Photos }
{ GANs
Generative Adversarial Networks
Source: Thomas Paula (slideshare)
Goal: produce counterfeit money
that is as similar as real money
Goal: distinguish between real and
counterfeit money
Generated from Gaussian or
Normal distribution (usually)
Generated instance
Training image
generator discriminator
}
GANs-Image Generation
Source: Generative Visual Manipulation on the Natural Image Manifold
(Zhu et al., 2016)
GANs-Super Resolution
Source: Photo-Realistic Single Image Super-Resolution Using a Generative
Adversarial Network, (Zhu et al., 2016)
bicubic SRResNet SRGAN Original (HR)
GANs-Super Resolution
Source: Photo-Realistic Single Image Super-Resolution Using a Generative
Adversarial Network, (Zhu et al., 2016)
bicubic SRResNet SRGAN Original (HR)
GANs-Super Resolution
Source: Photo-Realistic Single Image Super-Resolution Using a Generative
Adversarial Network, (Zhu et al., 2016)
bicubic SRResNet SRGAN Original (HR)
Semantic/Instance Segmentation
Source: Mask R-CNN (He et al., 2017)
Semantic/Instance Segmentation
Source: Mask R-CNN (He et al., 2017)
Semantic/Instance Segmentation
Source: Mask R-CNN (He et al., 2017)
{ Language, Text }
Language Translation
Non-monotonic
alignment
Source: Learning Phrase Representations using RNN
Encoder-Decoder for Statistical Machine Translation
(Cho et al., 2014)
Text Summarization
Source: A Deep Reinforced Model for Abstractive Summarization
(Paulus et al., 2017)
Extractive vs Abstractive
{ Chatbots, Personal
Assistants }
Conversational Bots
Source: A Neural Conversational Model
(Vinyals et al., 2015)
Google Smart Reply
Visual Questioning and Answering
Sources: (1)Dynamic Memory Networks for Visual and Textual Question Answering (Xiong et al.,
2016), (2)Inferring and Executing Programs for Visual Reasoning , (Johnson et al., 2017)
2-level input mechanism (visual + text)
{ Healthcare }
Biomedical image segmentation
Source: U-Net: Convolutional Networks for Biomedical Image Segmentation
(Ronneberger et al., 2015)
Disease prediction
Source: https://goo.gl/jGK2Jp
ML improvements:
Random forest: +1.7 %
Logistic regression: +3.2 %
Gradient boosting: +3.3 %
Neural nets: +3.6%
{ Agriculture }
Cucumber ¡°Prediction¡±
Source: https://goo.gl/Tvp3om
Crop yield prediction
Source: http://www.descarteslabs.com/
RGB image
Infrared image
Crop yield prediction
Source: http://www.descarteslabs.com/
Corn Prediction (State)
Crop yield prediction
Source: http://www.descarteslabs.com/
Soy Prediction (State)
Kaggle - Understanding the Amazon
Source: https://goo.gl/5eBxnr
Kaggle - Understanding the Amazon
Source: https://goo.gl/5eBxnr
Cloudy
Kaggle - Understanding the Amazon
Source: https://goo.gl/5eBxnr
Rain forest
Kaggle - Understanding the Amazon
Source: https://goo.gl/5eBxnr
Water (rivers & lakes)
Kaggle - Understanding the Amazon
Source: https://goo.gl/5eBxnr
Habitation
Kaggle - Understanding the Amazon
Source: https://goo.gl/5eBxnr
Agriculture
{ Sports }
Ghosting with Deep Imitation
Learning
Source: Data-Driven Ghosting using Deep Imitation Learning
(Le et al., 2017)
Role and formation discovery
4-5-1
{ Future }
Social AI ¡ú AI + emotions
understanding synthesis/generation
Jibo Cozmo
{ Conclusions }
- Deep Learning is here to stay
- AI won¡¯t kill humans (for the time being)
- AI will revolutionize labor market (already is)
- Nothing to do (1) ¡ú Learn Python
- Nothing to do (2) ¡ú Learn TensorFlow/Keras/PyTorch
- Nothing to do (3) ¡ú ML Andrew Ng (Coursera), Deep Learning (Udacity),
Stanford CS231n (Youtube), Stanford CS224d (Youtube), Oxford NLP
(Github)
- Nothing to do (4) ¡ú Practice @ Kaggle
https://www.meetup.com/Machine-Learning-Porto-Alegre/

More Related Content

Deep learning applications - for fun and profit