2. Index
1. Introduction
2. Method
- 2.1 The Deep Q-Learning Algorithm
- 2.2 Adaptation of the Code for the Multiplayer Paradigm
- 2.3 Game Selection
- 2.4 Reward Schemes
- 2.4.1 Score More than the Opponent(Fully Competitive)
- 2.4.2 Loosing the Ball Penalizes Both Players(Fully Cooperative)
- 2.4.3 Transition Between Cooperation and Competition
- 2.5 Training Procedure
- 2.6 Collecting the Game Statistics
3. Results
- 3.1 Emergence of Competitive Agents
- 3.2 Emergence of Collaborative Agents
- 3.3 Progression from Competition to Collaboration
2