3. ??
??? ???
City University of New York -Baruch College
Data Science ??
ConnexionAI ???
Freelancer Data Scientist
?????? ???? ???
Github:
https://github.com/wonseokjung
Facebook:
https://www.facebook.com/ws.jung.798
Blog:
https://wonseokjung.github.io/
4. 1. Dynamic Programming
a. Policy iteration
b. Value iteration
2. Monte Carlo method
3. Temporal-Difference Learning
a. Sarsa
b. Q-learning
4. ??? ????? ??? ?? ? ????? ?? ??
5. DQN? ??? ???? ????? ???
??
5. 1. Dynamic Programming
a. Policy iteration
b. Value iteration
2. Monte Carlo method
3. Temporal-Difference Learning
a. Sarsa
b. Q-learning
4. ????? ?? ?? ? ??? ????? ??? ??
5. DQN? ??? ???? ????? ???
Model-free
Model-based
Deeplearning?
+?
RL
??
6. 1. Dynamic Programming
a. Policy iteration
b. Value iteration
2. Monte Carlo method
3. Temporal-Difference Learning
a. Sarsa
b. Q-learning
4. ????? ?? ?? ? ??? ????? ??? ??
5. DQN? ??? ???? ????? ???
Grid world
??
40. Policy iteration- Policy Evaluation
Update Rule? ???? Evaluation? ??.
Value update
Policy Transition
Probability
Reward Next State?
estimated value
138. References:
* Reinforcement Learning: An Introduction Richard S. Sutton and Andrew G. Barto Second Edition, in progress MIT Press, Cambridge,
MA, 2017
* https://github.com/rlcode/reinforcement-learning-kr