狠狠撸

pyconkr 2018 RL_Adventure : Rainbow(value based Reinforcement Learning)

1 like245 views

Yechan(Paul) Kim

该文档介绍了 Rainbow 算法，结合了多个强化学习方法，如双重深度 Q 学习、对偶 DQN、多步 TD 和优先经验重放。文档中还提供了相关代码和超参数设置，以及对实验环境的说明。多个辅助链接指向相关的学术论文。

RL Adventure
RAINBOW
???
1

INDEX
1. Environment
2. Before RAINBOW
DDQN(Double Deep Q-Learning)
Dueling DQN
Multi-Step TD(Temporal Difference)
PER(Prioritized Experience Replay)
Noisy Network
Categorical DQN(C51)
3. RAINBOW
4. RAINBOW - Code
2

OPENAI GYM
HTTPS://GYM.OPENAI.COM
HTTPS://GITHUB.COM/OPENAI/GYM
1. EXPERIMENT ENVIRONMENT
3

2. BEFORE RAINBOW : DOUBLE DQN
4
HTTPS://ARXIV.ORG/ABS/1509.06461

2. BEFORE RAINBOW : DUELING DQN
HTTPS://ARXIV.ORG/ABS/1511.06581
5

2. BEFORE RAINBOW : DUELING DQN
6
HTTPS://ARXIV.ORG/ABS/1511.06581

2. BEFORE RAINBOW : MULTI-STEP LEARNING
7

2. BEFORE RAINBOW : PER
HTTPS://ARXIV.ORG/ABS/1511.05952
8

2. BEFORE RAINBOW : NOISY NETWORK
HTTPS://ARXIV.ORG/ABS/1706.10295
9

2. BEFORE RAINBOW : NOISY NETWORK
HTTPS://ARXIV.ORG/ABS/1706.10295
10

2. BEFORE RAINBOW : CATEGORICAL DQN(C51)
HTTPS://ARXIV.ORG/PDF/1707.06887.PDF
11

2. BEFORE RAINBOW : CATEGORICAL DQN(C51)
HTTPS://ARXIV.ORG/PDF/1707.06887.PDF
12

RAINBOW
3. RAINBOW
13

3. RAINBOW
RAINBOW
DDQN(Double Deep Q-Learning)
+
Dueling DQN
+
Multi-Step TD(Temporal Difference)
+
PER(Prioritized Experience Replay)
+
Noisy Network
+
Categorical DQN(C51)
14

3. RAINBOW
15

3. RAINBOW
HYPERPARAMETERS
16

3. RAINBOW
17

3. RAINBOW
18

PONG
4. RAINBOW - CODE
19

NOISY LINEAR
4. RAINBOW - CODE
20

DUELING + NOISY + C51
4. RAINBOW - CODE
21

PROJECTION STEP
4. RAINBOW - CODE
22

CROSS-ENTROPY LOSS
4. RAINBOW - CODE
23

TEST
4. RAINBOW - CODE
24

Thank you
RAINBOW
???
25

Ad

Recommended

PDF

人人网开发一站式体验zhen chen

?

PDF

????? LV&A ??? Navigation AgentYechan(Paul) Kim

?

PDF

Neural module NetworkYechan(Paul) Kim

?

PDF

Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Ne...Yechan(Paul) Kim

?

PDF

Multiagent Cooperative and Competition with Deep Reinforcement LearningYechan(Paul) Kim

?

PDF

2018 global ai_bootcamp_seoul_HomeNavi(Reinforcement Learning, AI)Yechan(Paul) Kim

?

PDF

3D Environment : HomeNavigationYechan(Paul) Kim

?

PPTX

Diversity is all you need(DIAYN) : Learning Skills without a Reward FunctionYechan(Paul) Kim

?

PDF

2024 Trend Updates: What Really Works In SEO & Content MarketingSearch Engine Journal

?

PDF

Storytelling For The Web: Integrate Storytelling in your Design ProcessChiara Aliotta

?

PDF

Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...OECD Directorate for Financial and Enterprise Affairs

?

PDF

How to Leverage AI to Boost Employee Wellness - Lydia Di Francesco - SocialHR...SocialHRCamp

?

PDF

2024 State of Marketing Report – by HubspotMarius Sescu

?

PDF

Everything You Need To Know About ChatGPTExpeed Software

?

PDF

Product Design Trends in 2024 | Teenage EngineeringsPixeldarts

?

PDF

How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow

?

PDF

AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork

?

PDF

Skeleton Culture CodeSkeleton Technologies

?

PDF

PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley

?

PDF

Content Methodology: A Best Practices Report (Webinar)contently

?

PPTX

How to Prepare For a Successful Job Search for 2024Albert Qian

?

PDF

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

?

PDF

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

?

PDF

5 Public speaking tips from TED - Visualized summarySpeakerHub

?

PDF

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

?

PDF

Getting into the tech field. what next Tessa Mero

?

PDF

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

?

PDF

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

?

More Related Content

Featured (20)

PDF

2024 Trend Updates: What Really Works In SEO & Content MarketingSearch Engine Journal

?

PDF

Storytelling For The Web: Integrate Storytelling in your Design ProcessChiara Aliotta

?

PDF

Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...OECD Directorate for Financial and Enterprise Affairs

?

PDF

How to Leverage AI to Boost Employee Wellness - Lydia Di Francesco - SocialHR...SocialHRCamp

?

PDF

2024 State of Marketing Report – by HubspotMarius Sescu

?

PDF

Everything You Need To Know About ChatGPTExpeed Software

?

PDF

Product Design Trends in 2024 | Teenage EngineeringsPixeldarts

?

PDF

How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow

?

PDF

AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork

?

PDF

Skeleton Culture CodeSkeleton Technologies

?

PDF

PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley

?

PDF

Content Methodology: A Best Practices Report (Webinar)contently

?

PPTX

How to Prepare For a Successful Job Search for 2024Albert Qian

?

PDF

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

?

PDF

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

?

PDF

5 Public speaking tips from TED - Visualized summarySpeakerHub

?

PDF

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

?

PDF

Getting into the tech field. what next Tessa Mero

?

PDF

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

?

PDF

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

?

2024 Trend Updates: What Really Works In SEO & Content MarketingSearch Engine Journal

?

Storytelling For The Web: Integrate Storytelling in your Design ProcessChiara Aliotta

?

Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...OECD Directorate for Financial and Enterprise Affairs

?

How to Leverage AI to Boost Employee Wellness - Lydia Di Francesco - SocialHR...SocialHRCamp

?

2024 State of Marketing Report – by HubspotMarius Sescu

?

Everything You Need To Know About ChatGPTExpeed Software

?

Product Design Trends in 2024 | Teenage EngineeringsPixeldarts

?

How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow

?

AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork

?

Skeleton Culture CodeSkeleton Technologies

?

PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley

?

Content Methodology: A Best Practices Report (Webinar)contently

?

How to Prepare For a Successful Job Search for 2024Albert Qian

?

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

?

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

?

5 Public speaking tips from TED - Visualized summarySpeakerHub

?

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

?

Getting into the tech field. what next Tessa Mero

?

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

?

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

?

pyconkr 2018 RL_Adventure : Rainbow(value based Reinforcement Learning)

1. RL Adventure RAINBOW ??? 1

2. INDEX 1. Environment 2. Before RAINBOW DDQN(Double Deep Q-Learning) Dueling DQN Multi-Step TD(Temporal Difference) PER(Prioritized Experience Replay) Noisy Network Categorical DQN(C51) 3. RAINBOW 4. RAINBOW - Code 2

3. OPENAI GYM HTTPS://GYM.OPENAI.COM HTTPS://GITHUB.COM/OPENAI/GYM 1. EXPERIMENT ENVIRONMENT 3

4. 2. BEFORE RAINBOW : DOUBLE DQN 4 HTTPS://ARXIV.ORG/ABS/1509.06461

5. 2. BEFORE RAINBOW : DUELING DQN HTTPS://ARXIV.ORG/ABS/1511.06581 5

6. 2. BEFORE RAINBOW : DUELING DQN 6 HTTPS://ARXIV.ORG/ABS/1511.06581

7. 2. BEFORE RAINBOW : MULTI-STEP LEARNING 7

8. 2. BEFORE RAINBOW : PER HTTPS://ARXIV.ORG/ABS/1511.05952 8

9. 2. BEFORE RAINBOW : NOISY NETWORK HTTPS://ARXIV.ORG/ABS/1706.10295 9

10. 2. BEFORE RAINBOW : NOISY NETWORK HTTPS://ARXIV.ORG/ABS/1706.10295 10

11. 2. BEFORE RAINBOW : CATEGORICAL DQN(C51) HTTPS://ARXIV.ORG/PDF/1707.06887.PDF 11

12. 2. BEFORE RAINBOW : CATEGORICAL DQN(C51) HTTPS://ARXIV.ORG/PDF/1707.06887.PDF 12

13. RAINBOW 3. RAINBOW 13

14. 3. RAINBOW RAINBOW DDQN(Double Deep Q-Learning) + Dueling DQN + Multi-Step TD(Temporal Difference) + PER(Prioritized Experience Replay) + Noisy Network + Categorical DQN(C51) 14

15. 3. RAINBOW 15

16. 3. RAINBOW HYPERPARAMETERS 16

17. 3. RAINBOW 17

18. 3. RAINBOW 18

19. PONG 4. RAINBOW - CODE 19

20. NOISY LINEAR 4. RAINBOW - CODE 20

21. DUELING + NOISY + C51 4. RAINBOW - CODE 21

22. PROJECTION STEP 4. RAINBOW - CODE 22

23. CROSS-ENTROPY LOSS 4. RAINBOW - CODE 23

24. TEST 4. RAINBOW - CODE 24

25. Thank you RAINBOW ??? 25