7. Reinforcement Learning is de?ned
not by characterizing learning methods,
but by characterizing a learning problem.
???? ???? ???? ?? ??? ???? ????? ??????.
???? (Reinforcement Learning) ??
Sutton
8. 1. Fly stunt manoeuvres in a helicopter
2. Defeat the world champion at Backgammon
3. Manage an investment portfolio
4. Control a power station
5. Make a humanoid robot walk
6. Play many di?erent Atari games better than humans
???? (Reinforcement Learning) ??
??? ?? ?? ??