ºÝºÝߣ

ºÝºÝߣShare a Scribd company logo
Global AI Bootcamp Seoul
3D Environment HomeNavi
Language,
Vision
& Action
???(Paul Kim)
HomeNavi Introduction
RL approach
- Value-based
- Policy Search
- Evolution Strategy
- ¡­.
RL approach? ??
????? ??? ???? ??¡­.
??? ????
Motivated paper
Target-driven Visual Navigation Model using
Deep Reinforcement Learning(Y Zhu, 2016)
??? ??!!
Mobile Robot
A mobile robot is a robot that is capable of locomotion
- wikipedia-
Mobile Robot
A mobile robot is a robot that is capable of locomotion
- wikipedia-
?, ???
Model-base? ??
RL????
??? ???
?????!!!
Domain skills
? Camera motion
? Robotics / Manipulation
? APIs
Language
ActionsVision
? Image / video
understanding
? 3D environment perception
? Instruction following
? Question answering
? Dialog
LV&A
LV&A
- Language
- Embedding
- RNN
- Attention
- ¡­
- Vision
- CNN
- YOLO extensions
- ¡­
- Action
- Actor-Critic
- Value Based Approach
- Policy Optimization
- HRL(Hierarchical RL)
- ¡­
Environments
Deepmind Lab
AI2-THOR
MINOS
Matterport3D
Environments & Tasks
Navigation with Vision & RL
unsupervised reinforcement and auxiliary learning agent
Environment
- Deepmind Lab
Sensory Inputs
- Image
Auxiliary Task
- Pixel Control
- Reward Prediction
- Value Function Replay
Control
- A3C
Navigation with Vision & RL
Learning to Navigate in Complex Environments
Environment
- Deepmind Lab
Sensory Inputs
- Image
Auxiliary Task
- Depth prediction
- Loop Closure prediction
Control
- A3C
Navigation with Vision & RL
Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning
Environment
- AI2-THOR
Sensory Inputs
- Image
- Target image
Control
- Siamese Network
- Actor-critic
Navigation with Vision & RL
????
??? ?? ??? ?? ???
?? ???
? ?¡­
Navigation with Vision & RL
Example
reinforcement learning with unsupervised
auxiliary tasks(M Jaderberg et al, 2016)
??? ??!!
Navigation with Vision & RL
Example
reinforcement learning with unsupervised
auxiliary tasks(M Jaderberg et al, 2016)
????
?? ???
???
?? ???¡­
Navigation with Vision & RL
Example
reinforcement learning with unsupervised
auxiliary tasks(M Jaderberg et al, 2016)
Vision
based
What is Language Grounding?
?? Vision??? ?????
??? ???? Agent?
???? Language?
??? ? ?? ??? ???¡­
What is Language Grounding?
Pick up a cup
Go to the bedroom
Empty the trash can
Go to the kitchen
Wash dishes
¡­
¡­
Multi-Modality Representation
Language??? Vision???
??? ?? ???? ???? ?? ??? ? ??? ??!
Navigation with Vision,
Language(Instructions) & RL
Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning
Environment
- MALMO
Sensory Inputs
- Image : RGB image
- Instruction : Analogy Making
Hiearchical Structure
- Parameterized Skills
- Meta Contoller
- Pointer
Control
- Actor-Critic(GAE)
Navigation with Vision,
Language(Instructions) & RL
Grounded Language Learning in a Simulated 3D World
Environment
- Deepmind Lab
Sensory Inputs
- Image : RGB image
- Instruction
¡°green object next to the red object¡±
Auxiliary Tasks
- UNREAL
- Temporal AutoEncoder(tAE)
- Language Prediction(LP)
State Representation
- Concat
Control
- A3C
Navigation with Vision,
Language(Instructions) & RL
Gated-Attention Architectures for Task-Oriented Language Grounding
Environment
- VizDoom
Sensory Inputs
- Image : RGB image
- Instruction(templete)
¡°Go to the tallest red pillar¡±
State Representation
- Gated-Attention
Module(Attention based)
Control
- A3C
Navigation with Vision,
Language(Instructions) & RL
Building Generalizable Agents with a Realistic and Rich 3D Environment
Environment
- House3D(SUNCG base)
Sensory Inputs
- Image
- RGB only
- RGB + Depth
- Mask + Depth
- Instruction
¡°Go to Kitchen¡±
State Representation
Gated-Attention
Module(Attention based)
Control
- A3C, DDPG
Navigation with Vision,
Language(Instructions) & RL
?????
Language? ???
Agent? ???? ??
??? ? ??¡­
Navigation with Vision,
Language(Instructions) & RL Example
Zero-Shot Task Generalization with Multi-Task
Deep Reinforcement Learning(Oh et al, 2017)
??? ??!!
Navigation with Vision,
Language(Instructions) & RL Example
Zero-Shot Task Generalization with Multi-Task
Deep Reinforcement Learning(Oh et al, 2017)
?????
?? ????
???
????¡­
Navigation with Vision,
Language(Instructions) & RL Example
Zero-Shot Task Generalization with Multi-Task
Deep Reinforcement Learning(Oh et al, 2017)
Vision
Language
based
Question Answering
???? Agent?
Language? ???? ?? ??
??? ?? ???? ???
??? ??? ???¡­
Question Answering
??? ??!!
Question Answering
Navigation with Vision,
Language(QA) & RL
IQA: Visual Question Answering in Interactive Environments
Environment
- AI2-THOR
Sensory Inputs
- Image
- IQUAD dataset
¡°Is there a cup in the microwave?¡±
Hiearchical Structure
- Hierarchical Interactive Memory
Network
- Planner
- Semantic Memory
- Submodules
- ????? ??(ex. YOLO)
Control
- A3C, HIMN
Navigation with Vision,
Language(QA) & RL
Embodied Question Answering
Environment
- House3D(SUNCG base)
Sensory Inputs
- Image
- RGB image
- Segmentation mask
- Depth
- Instruction
¡°What color is the car?¡±
- Navigation
- Pretrain then fine tuning with
REINFORCE
- Question, Answering
- EQA dataset
Control
- A3C, PACMAN
- Imitation Learning
Navigation with Vision,
Language(QA) & RL
?? ?????
Question Answering
? ???? agent?
???? ?? ??? ? ?¡­
Navigation with Vision,
Language(QA) & RL Example
IQA: Visual Question Answering in
Interactive Environments(D Gordon et al, 2017)
??? ??!!
Navigation with Vision,
Language(QA) & RL Example
IQA: Visual Question Answering in
Interactive Environments(D Gordon et al, 2017)
Question Answering?
??? ????
?? ???
?? ???¡­
Navigation with Vision,
Language(QA) & RL Example
IQA: Visual Question Answering in
Interactive Environments(D Gordon et al, 2017)
Vision
Language(QA)
based
RL Korea HomaNavi
??? RL_Korea
HomeNavi????
? ?????
?????? ?? ??,
??? ??? ?? ??¡­..
?? ????!!!
????? ??? ??
??? ?????
(?? ?? ??¡­???? ??)
Reinforcement
Learning Korea
RL Korea & Modulabs
Reinforcement
Learning Korea
??? ???
LV&A Lab
Lab ??¡­
??? ???
LV&A Lab
??? ?????..
????
Thank You
Thank
You

More Related Content

Similar to 2018 global ai_bootcamp_seoul_HomeNavi(Reinforcement Learning, AI) (13)

PPTX
[AAAI21] Self-Domain Adaptation for Face Anti-Spoofing
KIMMINHA3
?
PPTX
White box in Computer Vision
Jaehyuk Heo
?
PPTX
Enhancing VAEs for collaborative filtering : flexible priors & gating mechanisms
seungwoo kim
?
PDF
ESM Machine learning 5?? Review by Mario Cho
Mario Cho
?
PDF
Koss 1605 machine_learning_mariocho_t10
Mario Cho
?
PPTX
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...
changedaeoh
?
PDF
Deep learning image recognition for autonomous driving(classification, objec...
?? ?
?
PDF
Nlp and transformer (v3s)
H K Yoon
?
PPTX
PPT - Discovering Reinforcement Learning Algorithms
Jisang Yoon
?
PDF
?? ??? AI
NAVER Engineering
?
PPTX
Dream2Control paper review
taeseon ryu
?
PDF
Gen AI with LLM for construction technology
Tae wook kang
?
PPTX
Pytorch kr devcon
jaewon lee
?
[AAAI21] Self-Domain Adaptation for Face Anti-Spoofing
KIMMINHA3
?
White box in Computer Vision
Jaehyuk Heo
?
Enhancing VAEs for collaborative filtering : flexible priors & gating mechanisms
seungwoo kim
?
ESM Machine learning 5?? Review by Mario Cho
Mario Cho
?
Koss 1605 machine_learning_mariocho_t10
Mario Cho
?
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...
changedaeoh
?
Deep learning image recognition for autonomous driving(classification, objec...
?? ?
?
Nlp and transformer (v3s)
H K Yoon
?
PPT - Discovering Reinforcement Learning Algorithms
Jisang Yoon
?
Dream2Control paper review
taeseon ryu
?
Gen AI with LLM for construction technology
Tae wook kang
?
Pytorch kr devcon
jaewon lee
?

More from Yechan(Paul) Kim (7)

PDF
????? LV&A ??? Navigation Agent
Yechan(Paul) Kim
?
PDF
Neural module Network
Yechan(Paul) Kim
?
PDF
Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Ne...
Yechan(Paul) Kim
?
PDF
Multiagent Cooperative and Competition with Deep Reinforcement Learning
Yechan(Paul) Kim
?
PPTX
Diversity is all you need(DIAYN) : Learning Skills without a Reward Function
Yechan(Paul) Kim
?
PDF
pyconkr 2018 RL_Adventure : Rainbow(value based Reinforcement Learning)
Yechan(Paul) Kim
?
PDF
pycon2018 "RL Adventure : DQN ?? Rainbow DQN??"
Yechan(Paul) Kim
?
????? LV&A ??? Navigation Agent
Yechan(Paul) Kim
?
Neural module Network
Yechan(Paul) Kim
?
Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Ne...
Yechan(Paul) Kim
?
Multiagent Cooperative and Competition with Deep Reinforcement Learning
Yechan(Paul) Kim
?
Diversity is all you need(DIAYN) : Learning Skills without a Reward Function
Yechan(Paul) Kim
?
pyconkr 2018 RL_Adventure : Rainbow(value based Reinforcement Learning)
Yechan(Paul) Kim
?
pycon2018 "RL Adventure : DQN ?? Rainbow DQN??"
Yechan(Paul) Kim
?
Ad

Recently uploaded (20)

PDF
Disk Evolution Study Through Imaging of Nearby Young Stars (DESTINYS): Eviden...
S¨¦rgio Sacani
?
PPTX
Microbes Involved In Malaria, Microbiology
UMME54
?
PDF
HOW TO DEAL WITH THREATS FROM THE FORCES OF NATURE FROM OUTER SPACE.pdf
Faga1939
?
PPTX
Clinical Toxicology- Drug antagonism and drug synergism
jasmine698677
?
PPT
rate of reaction and the factors affecting it.ppt
MOLATELOMATLEKE
?
PPTX
(Normal Mechanism)physiology of labour.pptx
DavidSalman2
?
PPTX
Cyclotron_Presentation_theory, designMSc.pptx
MohamedMaideen12
?
PPTX
Chromosomal Aberration (Mutation) and Classification.
Dr-Haseeb Zubair Tagar
?
PPTX
FACTORS PREDISPOSING TO MICROBIAL PATHOGENICITY.pptx
Remya M S
?
PDF
Can Consciousness Live and Travel Through Quantum AI?
Saikat Basu
?
PDF
The First Detection of Molecular Activity in the Largest Known Oort Cloud Com...
S¨¦rgio Sacani
?
PPTX
Operating_a_Microscope_Presentation.pptx
MerylVelardeCapapas
?
PDF
Global Health Initiatives: Lessons from Successful Programs (www.kiu.ac.ug)
publication11
?
PDF
Cultivation and goods of microorganisms-4.pdf
adimondal300
?
PPTX
Philippine_Literature_Precolonial_Period_Designed.pptx
josedalagdag5
?
PDF
Electromagnetism 3.pdf - AN OVERVIEW ON ELECTROMAGNETISM
kaustavsahoo94
?
PDF
seedproductiontechniques-210522130809.pdf
sr5566mukku
?
PPTX
MATTER.pptxBYUHNJMIK,O.LBYHNUJMIK,OL.PN8M9,0L.-;/
emelitamaranga
?
PDF
Agentic AI: Autonomy, Accountability, and the Algorithmic Society
vs5qkn48td
?
PPTX
PROTOCOL PREsentation.pptx 12345567890q0
jeevika54
?
Disk Evolution Study Through Imaging of Nearby Young Stars (DESTINYS): Eviden...
S¨¦rgio Sacani
?
Microbes Involved In Malaria, Microbiology
UMME54
?
HOW TO DEAL WITH THREATS FROM THE FORCES OF NATURE FROM OUTER SPACE.pdf
Faga1939
?
Clinical Toxicology- Drug antagonism and drug synergism
jasmine698677
?
rate of reaction and the factors affecting it.ppt
MOLATELOMATLEKE
?
(Normal Mechanism)physiology of labour.pptx
DavidSalman2
?
Cyclotron_Presentation_theory, designMSc.pptx
MohamedMaideen12
?
Chromosomal Aberration (Mutation) and Classification.
Dr-Haseeb Zubair Tagar
?
FACTORS PREDISPOSING TO MICROBIAL PATHOGENICITY.pptx
Remya M S
?
Can Consciousness Live and Travel Through Quantum AI?
Saikat Basu
?
The First Detection of Molecular Activity in the Largest Known Oort Cloud Com...
S¨¦rgio Sacani
?
Operating_a_Microscope_Presentation.pptx
MerylVelardeCapapas
?
Global Health Initiatives: Lessons from Successful Programs (www.kiu.ac.ug)
publication11
?
Cultivation and goods of microorganisms-4.pdf
adimondal300
?
Philippine_Literature_Precolonial_Period_Designed.pptx
josedalagdag5
?
Electromagnetism 3.pdf - AN OVERVIEW ON ELECTROMAGNETISM
kaustavsahoo94
?
seedproductiontechniques-210522130809.pdf
sr5566mukku
?
MATTER.pptxBYUHNJMIK,O.LBYHNUJMIK,OL.PN8M9,0L.-;/
emelitamaranga
?
Agentic AI: Autonomy, Accountability, and the Algorithmic Society
vs5qkn48td
?
PROTOCOL PREsentation.pptx 12345567890q0
jeevika54
?
Ad

2018 global ai_bootcamp_seoul_HomeNavi(Reinforcement Learning, AI)