ºÝºÝߣ

ºÝºÝߣShare a Scribd company logo
December 18 Fri., 2015, 09:10-09:30, Regular Session: Modeling 1, Frb04.3 @ 802
Transition ?Models ?of ?Equilibrium ?
Assessment ?in ?Bayesian ?Game
Kiminao Kogiso
University of Electro-Communications
Tokyo, Japan
The 54 Conference on Decision and Control
Osaka International Convention Center, Osaka, Japan
December 15 to 18, 2015
Supported by
JSPS Grant-in-Aid for Challenging Exploratory Research
2014 to 2016
Outline
2
Introduction ?
Static ?Bayesian ?Game ?
Novel ?Form ?in ?Bayesian ?Nash ?Equilibrium ?
Dynamics ?in ?Equilibrium ?Assessment ?
Simulation ?
Conclusion
Introduction
3
Strategic game enabling to consider uncertainties in player¡¯s decisions.
player: a reasonable decision maker
action: what a player chooses
utility: a player¡¯s preference over the actions
type: a label of player¡¯s private valuation (what the player really feels)
belief: a probability distribution over the types
(degree of feeling, tendency, proclivity,¡­)
Static Bayesian Game[1]
[1] Harsanyi, 1967. [2] Alpcan and Basar, et al., 2011, 2013. [3] Roy, et al., 2010. [4] Liu, et al., 2006. [5] Akkarajitsakul, et al., 2011.
A Bayesian game used in engineering problems to analyze a Bayesian
Nash equilibrium or to design a game mechanism.
network security[2,3], intrusion detection[4,5,6], belief learning[7]
electricity pricing[8,9], mechanism design[10]
[6] Sedjelmachi, et al., 2014, 2015. [7] Nachbar, 2008. [8] Li, et al., 2011, 2014. [9] Yang, et al., 2013. [10] Tao, et al., 2015.
Introduction
4
Insufficient tools and concepts[11]
Bayesian Nash equilibrium plays key roles in game analysis & design.
equilibrium analysis: for given belief, find a Bayesian Nash Equilibrium(BNE).
belief learning: for given BNE, find a corresponding belief.
mechanism design: for given utility, find rules to achieve a desired BNE.
Objective of this talk
Derive a dynamical state-space model whose state involves a BNE.
derive a novel condition related to the BNE,
discover a map (discrete-time system) defined by the novel condition,
confirm a time response of the map.
[11] Powell, 2011.
Challenge: prepare tools & concepts to apply our model-based fashion
to analysis and design of the game.
Bayesian Game
Player set
Action set
Type set
Utility
Strategy (mixed)
Belief
Static Bayesian Game: General
5
Two-player two-action Bayesian game w/ two types
G(N, A, ?, u, ?, S)
N := {1, 2}
A := A1 ? A2
? := ?1 ? ?2
u := (u1, u2)
? := (?1, ?2)
S := (S1, S2)
ai 2 Ai := {a, ?a} 8i 2 N
?i 2 ?i := {?, ??} 8i 2 N
?i 2 ?(?i) 8i 2 N
Si : ?i ! ?(Ai) 8i 2 N
si 2 Si(?i) 8i 2 N
?(X) : a probability distribution over a finite set X
Ui(?i, ? i) :=
?
ui(a, a, ?i, ? i) ui(a, ?a, ?i, ? i)
ui(?a, a, ?i, ? i) ui(?a, ?a, ?i, ? i)
: utility matrix8i 2 N, 8? 2 ?
ui : A ? ? ! < 8i 2 N
i 2 N
Static Bayesian Game: Example
6
Service of tennis
2, 2 0, 1
1, 21, 1
flat
spin
flat spin
0, 1 1, 2
0, 11, 2
flat
spin
flat spin
sideline
1, 0 1, 1
2, 00, 1
flat
spin
flat spin
1, 3 1, 2
0, 32, 2
flat
spin
flat spin
centerline
s1(a|?)
s1(?a|?)
s1(?a|??)
s1(a|??)
s2(a|??) s2(?a|??)s2(?a|?)s2(a|?)
center line ? side line ??
???
?a
?a ?a
?a
?a ?a
?a?aa
a a
a
?1(?)
?1(??)
?2(??)?2(?)
a
a a
a
type
belief
Bayesian Nash Equilibrium
7
Equilibrium assessment
definitions of Bayesian Nash Equilibrium(BNE)
using an ex-ante expected utility:
using a best response to opponent strategy:
EUi(si, s i) EUi(s0
, s i) 8s0
i 2 Si, s0
i 6= si
is denoted as the Bayesian Nash equilibrium.
A strategy profile , satisfying , is also a BNE.s = (si, s i)
Given a prior common probability , for any , the strategy satisfyingi 2 N sp(?)
si 2 BRi(s i, ?) 8i 2 N
the pair of considered as key variables of the Bayesian game.
Equilibrium Assessment : a pair of a belief and the corresponding BNE.(??, ?s)
(?, s)
equilibrium analysis[10]: find a BNE .?s9 ??,
[10] Y. Shoham and K. Leyton-Brown, Multiagent Systems, Cambridge University Press, 2009.
then the pair is an Equilibrium Assessment, where ,? :=
?
1 1
?
(??, ?s)
Novel Form Satisfying BNE
8
If the game satisfies the following condition (simultaneous polynomial in ):
Sufficient condition to be BNE
Lemma
8?i 2 ?i, 8i 2 N
G
??i(?s i, ?i) (?i)p(??) = 0
?i(?s i, ?i) :=
?
Ui(?i, ?)?s i(?) Ui(?i, ??)?s i(??)
?
,
(?) :=
?
1 0 0 0
0 1 0 0 , (??) :=
?
0 0 1 0
0 0 0 1 .
idea: derived from KKT condition of BNE by cancelation of Lagrangian variables.
point: # of the polynomials: 4, # of the variables: 6; D.O.F. in determining their values.
note: a BNE (mixed strategy) holds the above equation, but some of pure strategy BNEs
do not hold it.
??
all of EAs
Discover Dynamics!
9
Map from EA to EA
Idea to derive dynamics in EA
all of EAs
?
EA
(??, ?s)
satisfying
the Lemma
all of EAs
?
EA
(?? + ??, ?s + ?s)
satisfying
the Lemma
Given an initial EA, if there exists such that the game satisfies
the following condition w.r.t. utility matrices: ,
Dynamics in Equilibrium Assessment
10
Main result
Theorem
?
1 1
?
Ui(?i, ?)
?
1
1 1
= 0
?
1 1
?
Ui(?i, ??)
?
2
1 2
= 0
8?i 2 ?i8i 2 N
??(k + 1) = diag(A1, A2)??(k)
?s(k + 1) = A (ci(k))?si(k)
ci(k) :=
??i(?i, k + 1)
??i(?i, k)
, and is a row stochastic matrix.Ai 2 <2?2
8i 2 N
= [ 1 2]T
2 <2
then a nonlinear autonomous system in terms of the equilibrium assessment:
transfers from an EA to another EA , where(??(k), ?s(k)) (??(k + 1), ?s(k + 1))
ci(k) ! 1
A (1) = I
ci(k) :=
??i(?i, k + 1)
??i(?i, k)??(k + 1) = diag(A1, A2)??(k) ?s(k + 1) = A (ci(k))?si(k)
??(k)
??(k + 1)
stable linear system: time-varying system:
¡¤
?s(k)
??(k)
Simulation
11
Trajectory of equilibrium assessment
0.3
0.4
0.5
0.6
0.7
10
0.2
0.4
0.6
0.8
probabilityprobability
step
0 1 2 3 4 5 6 7 8 9
100 1 2 3 4 5 6 7 8 9
s1( )a|¦È?? s1( )a|¦È?
?
s1( )a|¦È
?
? s1( )a|¦È
??
s2( )a|¦È?? s2( )a|¦È?
?
s2( )a|¦È
?
? s2( )a|¦È
??
0.4
0.6
0.8
1.0
1.2
1.4
1.6
1.8
the proposed model
the computation of
the best respose (4)
10
expectedutilityvalue
step
0 1 2 3 4 5 6 7 8 9
EU1
EU2
0.4
0.6
0.8
1.0
1.2
1.4
1.6
1.8
10
step
0 1 2 3 4 5 6 7 8 9
valueofci()¦Èi
c ( )1 ¦È?
c ( )1 ¦È
?
c ( )2 ¦È?
c ( )2 ¦È
?
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
10
probability
step
0 1 2 3 4 5 6 7 8 9
1( )¦Ç ¦È?
1( )¦Ç ¦È
?
2( )¦Ç ¦È?
2( )¦Ç ¦È
?
belief strategy
c expected
utility
Conclusion
12
Introduction
Static Bayesian Game
two-players two-actions game with two-types
New Form in Bayesian Nash Equilib.
polynomial conditions in equilibrium assessment
Dynamics in Equilibrium Assessment
discrete-time autonomous time-varying system
convergence of the EA (stability)
Simulation
confirms states updated become EA and converge.
Future works
estimate player¡¯s belief for a given BNE, and
realize a control-theoretic mechanism design method.
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
10
probability
step
0 1 2 3 4 5 6 7 8 9
1( )¦Ç ¦È?
1( )¦Ç ¦È
?
2( )¦Ç ¦È?
2( )¦Ç ¦È
?
belief
0.3
0.4
0.5
0.6
0.7
10
0.2
0.4
0.6
0.8
probabilityprobability
step
0 1 2 3 4 5 6 7 8 9
100 1 2 3 4 5 6 7 8 9
s1( )a|¦È?? s1( )a|¦È?
?
s1( )a|¦È
?
? s1( )a|¦È
??
s2( )a|¦È?? s2( )a|¦È?
?
s2( )a|¦È
?
? s2( )a|¦È
??
strategy
??(k + 1) = diag(A1, A2)??(k)
?s(k + 1) = A (ci(k))?si(k)
Dynamics in EA:

More Related Content

Similar to Transition Models of Equilibrium Assessment in Bayesian Game (20)

Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium
Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium
Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium
Jie Bao
?
CAGT-IST Student Presentations
CAGT-IST Student Presentations CAGT-IST Student Presentations
CAGT-IST Student Presentations
Prithviraj (Raj) Dasgupta
?
IIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's Classes
IIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's ClassesIIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's Classes
IIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's Classes
SOURAV DAS
?
The Skeptic¡¯s Argumentation Game or: Well-Founded Explanations for Mere Mortals
The Skeptic¡¯s Argumentation Game or: Well-Founded Explanations for Mere MortalsThe Skeptic¡¯s Argumentation Game or: Well-Founded Explanations for Mere Mortals
The Skeptic¡¯s Argumentation Game or: Well-Founded Explanations for Mere Mortals
Bertram Lud?scher
?
lect1207
lect1207lect1207
lect1207
webuploader
?
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Bertram Lud?scher
?
CCS 3102 Lecture 2_ Mathematical foundations.pdf
CCS 3102 Lecture 2_ Mathematical foundations.pdfCCS 3102 Lecture 2_ Mathematical foundations.pdf
CCS 3102 Lecture 2_ Mathematical foundations.pdf
JosephKariuki46
?
Influencing Visual Judgment through Affective Priming
Influencing Visual Judgment through Affective PrimingInfluencing Visual Judgment through Affective Priming
Influencing Visual Judgment through Affective Priming
Lane Harrison
?
nips-gg
nips-ggnips-gg
nips-gg
webuploader
?
Supervised sequential pattern mining for identifying important patterns of pl...
Supervised sequential pattern mining for identifying important patterns of pl...Supervised sequential pattern mining for identifying important patterns of pl...
Supervised sequential pattern mining for identifying important patterns of pl...
Rory Bunker
?
Ubisoft
UbisoftUbisoft
Ubisoft
GIAF
?
Cooperative Game Theory
Cooperative Game TheoryCooperative Game Theory
Cooperative Game Theory
SSA KPI
?
Accelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference CompilationAccelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference Compilation
Feynman Liang
?
Probability Distribution
Probability DistributionProbability Distribution
Probability Distribution
Long Beach City College
?
A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...
A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...
A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...
Premier Publishers
?
Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...
Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...
Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...
IJERA Editor
?
Operations Research Situations and Games
Operations Research Situations and GamesOperations Research Situations and Games
Operations Research Situations and Games
SSA KPI
?
PCB_Lect02_Pairwise_allign (1).pdf
PCB_Lect02_Pairwise_allign (1).pdfPCB_Lect02_Pairwise_allign (1).pdf
PCB_Lect02_Pairwise_allign (1).pdf
ssusera1eccd
?
An Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP GameAn Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP Game
Academia Sinica
?
STT802project-writeup-Final (1)
STT802project-writeup-Final (1)STT802project-writeup-Final (1)
STT802project-writeup-Final (1)
James P. Regan II
?
Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium
Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium
Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium
Jie Bao
?
IIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's Classes
IIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's ClassesIIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's Classes
IIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's Classes
SOURAV DAS
?
The Skeptic¡¯s Argumentation Game or: Well-Founded Explanations for Mere Mortals
The Skeptic¡¯s Argumentation Game or: Well-Founded Explanations for Mere MortalsThe Skeptic¡¯s Argumentation Game or: Well-Founded Explanations for Mere Mortals
The Skeptic¡¯s Argumentation Game or: Well-Founded Explanations for Mere Mortals
Bertram Lud?scher
?
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Bertram Lud?scher
?
CCS 3102 Lecture 2_ Mathematical foundations.pdf
CCS 3102 Lecture 2_ Mathematical foundations.pdfCCS 3102 Lecture 2_ Mathematical foundations.pdf
CCS 3102 Lecture 2_ Mathematical foundations.pdf
JosephKariuki46
?
Influencing Visual Judgment through Affective Priming
Influencing Visual Judgment through Affective PrimingInfluencing Visual Judgment through Affective Priming
Influencing Visual Judgment through Affective Priming
Lane Harrison
?
Supervised sequential pattern mining for identifying important patterns of pl...
Supervised sequential pattern mining for identifying important patterns of pl...Supervised sequential pattern mining for identifying important patterns of pl...
Supervised sequential pattern mining for identifying important patterns of pl...
Rory Bunker
?
Ubisoft
UbisoftUbisoft
Ubisoft
GIAF
?
Cooperative Game Theory
Cooperative Game TheoryCooperative Game Theory
Cooperative Game Theory
SSA KPI
?
Accelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference CompilationAccelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference Compilation
Feynman Liang
?
A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...
A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...
A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...
Premier Publishers
?
Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...
Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...
Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...
IJERA Editor
?
Operations Research Situations and Games
Operations Research Situations and GamesOperations Research Situations and Games
Operations Research Situations and Games
SSA KPI
?
PCB_Lect02_Pairwise_allign (1).pdf
PCB_Lect02_Pairwise_allign (1).pdfPCB_Lect02_Pairwise_allign (1).pdf
PCB_Lect02_Pairwise_allign (1).pdf
ssusera1eccd
?
An Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP GameAn Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP Game
Academia Sinica
?
STT802project-writeup-Final (1)
STT802project-writeup-Final (1)STT802project-writeup-Final (1)
STT802project-writeup-Final (1)
James P. Regan II
?

More from Kiminao Kogiso (10)

Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...
Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...
Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...
Kiminao Kogiso
?
Player's Belief Estimation for Super Human Sports
Player's Belief Estimation for Super Human SportsPlayer's Belief Estimation for Super Human Sports
Player's Belief Estimation for Super Human Sports
Kiminao Kogiso
?
Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...
Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...
Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...
Kiminao Kogiso
?
Controller encryption using RSA public-key encryption scheme (Asian Control C...
Controller encryption using RSA public-key encryption scheme (Asian Control C...Controller encryption using RSA public-key encryption scheme (Asian Control C...
Controller encryption using RSA public-key encryption scheme (Asian Control C...
Kiminao Kogiso
?
Application of ElGamal Encryption Scheme to Control System for Security Enhan...
Application of ElGamal Encryption Scheme to Control System for Security Enhan...Application of ElGamal Encryption Scheme to Control System for Security Enhan...
Application of ElGamal Encryption Scheme to Control System for Security Enhan...
Kiminao Kogiso
?
Considerations on model predictive control of McKibben pneumatic artificial m...
Considerations on model predictive control of McKibben pneumatic artificial m...Considerations on model predictive control of McKibben pneumatic artificial m...
Considerations on model predictive control of McKibben pneumatic artificial m...
Kiminao Kogiso
?
Estimating Player's Belief in Bayesian Game by Feedback Control
Estimating Player's Belief in Bayesian Game by Feedback ControlEstimating Player's Belief in Bayesian Game by Feedback Control
Estimating Player's Belief in Bayesian Game by Feedback Control
Kiminao Kogiso
?
Modeling of McKibben pneumatic artificial muscle system using pressure-depend...
Modeling of McKibben pneumatic artificial muscle system using pressure-depend...Modeling of McKibben pneumatic artificial muscle system using pressure-depend...
Modeling of McKibben pneumatic artificial muscle system using pressure-depend...
Kiminao Kogiso
?
Experimental Validation of McKibben Pneumatic Artificial Muscle Model
Experimental Validation of McKibben Pneumatic Artificial Muscle ModelExperimental Validation of McKibben Pneumatic Artificial Muscle Model
Experimental Validation of McKibben Pneumatic Artificial Muscle Model
Kiminao Kogiso
?
Identification Procedure for McKibben Pneumatic Artificial Muscle Systems
Identification Procedure for McKibben Pneumatic Artificial Muscle SystemsIdentification Procedure for McKibben Pneumatic Artificial Muscle Systems
Identification Procedure for McKibben Pneumatic Artificial Muscle Systems
Kiminao Kogiso
?
Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...
Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...
Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...
Kiminao Kogiso
?
Player's Belief Estimation for Super Human Sports
Player's Belief Estimation for Super Human SportsPlayer's Belief Estimation for Super Human Sports
Player's Belief Estimation for Super Human Sports
Kiminao Kogiso
?
Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...
Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...
Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...
Kiminao Kogiso
?
Controller encryption using RSA public-key encryption scheme (Asian Control C...
Controller encryption using RSA public-key encryption scheme (Asian Control C...Controller encryption using RSA public-key encryption scheme (Asian Control C...
Controller encryption using RSA public-key encryption scheme (Asian Control C...
Kiminao Kogiso
?
Application of ElGamal Encryption Scheme to Control System for Security Enhan...
Application of ElGamal Encryption Scheme to Control System for Security Enhan...Application of ElGamal Encryption Scheme to Control System for Security Enhan...
Application of ElGamal Encryption Scheme to Control System for Security Enhan...
Kiminao Kogiso
?
Considerations on model predictive control of McKibben pneumatic artificial m...
Considerations on model predictive control of McKibben pneumatic artificial m...Considerations on model predictive control of McKibben pneumatic artificial m...
Considerations on model predictive control of McKibben pneumatic artificial m...
Kiminao Kogiso
?
Estimating Player's Belief in Bayesian Game by Feedback Control
Estimating Player's Belief in Bayesian Game by Feedback ControlEstimating Player's Belief in Bayesian Game by Feedback Control
Estimating Player's Belief in Bayesian Game by Feedback Control
Kiminao Kogiso
?
Modeling of McKibben pneumatic artificial muscle system using pressure-depend...
Modeling of McKibben pneumatic artificial muscle system using pressure-depend...Modeling of McKibben pneumatic artificial muscle system using pressure-depend...
Modeling of McKibben pneumatic artificial muscle system using pressure-depend...
Kiminao Kogiso
?
Experimental Validation of McKibben Pneumatic Artificial Muscle Model
Experimental Validation of McKibben Pneumatic Artificial Muscle ModelExperimental Validation of McKibben Pneumatic Artificial Muscle Model
Experimental Validation of McKibben Pneumatic Artificial Muscle Model
Kiminao Kogiso
?
Identification Procedure for McKibben Pneumatic Artificial Muscle Systems
Identification Procedure for McKibben Pneumatic Artificial Muscle SystemsIdentification Procedure for McKibben Pneumatic Artificial Muscle Systems
Identification Procedure for McKibben Pneumatic Artificial Muscle Systems
Kiminao Kogiso
?

Recently uploaded (20)

Lectureof nano 1588236675-biosensors (1).ppt
Lectureof nano 1588236675-biosensors (1).pptLectureof nano 1588236675-biosensors (1).ppt
Lectureof nano 1588236675-biosensors (1).ppt
SherifElGohary7
?
04 MAINTENANCE OF CONCRETE PAVEMENTS.ppt
04  MAINTENANCE OF CONCRETE PAVEMENTS.ppt04  MAINTENANCE OF CONCRETE PAVEMENTS.ppt
04 MAINTENANCE OF CONCRETE PAVEMENTS.ppt
sreenath seenu
?
Lessons learned when managing MySQL in the Cloud
Lessons learned when managing MySQL in the CloudLessons learned when managing MySQL in the Cloud
Lessons learned when managing MySQL in the Cloud
Igor Donchovski
?
US Patented ReGenX Generator, ReGen-X Quatum Motor EV Regenerative Accelerati...
US Patented ReGenX Generator, ReGen-X Quatum Motor EV Regenerative Accelerati...US Patented ReGenX Generator, ReGen-X Quatum Motor EV Regenerative Accelerati...
US Patented ReGenX Generator, ReGen-X Quatum Motor EV Regenerative Accelerati...
Thane Heins NOBEL PRIZE WINNING ENERGY RESEARCHER
?
15. Smart Cities Big Data, Civic Hackers, and the Quest for a New Utopia.pdf
15. Smart Cities Big Data, Civic Hackers, and the Quest for a New Utopia.pdf15. Smart Cities Big Data, Civic Hackers, and the Quest for a New Utopia.pdf
15. Smart Cities Big Data, Civic Hackers, and the Quest for a New Utopia.pdf
NgocThang9
?
TM-ASP-101-RF_Air Press manual crimping machine.pdf
TM-ASP-101-RF_Air Press manual crimping machine.pdfTM-ASP-101-RF_Air Press manual crimping machine.pdf
TM-ASP-101-RF_Air Press manual crimping machine.pdf
ChungLe60
?
google_developer_group_ramdeobaba_university_EXPLORE_PPT
google_developer_group_ramdeobaba_university_EXPLORE_PPTgoogle_developer_group_ramdeobaba_university_EXPLORE_PPT
google_developer_group_ramdeobaba_university_EXPLORE_PPT
JayeshShete1
?
UNIT 1FUNDAMENTALS OF OPERATING SYSTEMS.pptx
UNIT 1FUNDAMENTALS OF OPERATING SYSTEMS.pptxUNIT 1FUNDAMENTALS OF OPERATING SYSTEMS.pptx
UNIT 1FUNDAMENTALS OF OPERATING SYSTEMS.pptx
KesavanT10
?
CS3451-OPERATING-SYSTEM NOTES ALL123.pdf
CS3451-OPERATING-SYSTEM NOTES ALL123.pdfCS3451-OPERATING-SYSTEM NOTES ALL123.pdf
CS3451-OPERATING-SYSTEM NOTES ALL123.pdf
PonniS7
?
Env and Water Supply Engg._Dr. Hasan.pdf
Env and Water Supply Engg._Dr. Hasan.pdfEnv and Water Supply Engg._Dr. Hasan.pdf
Env and Water Supply Engg._Dr. Hasan.pdf
MahmudHasan747870
?
Engineering at Lovely Professional University (LPU).pdf
Engineering at Lovely Professional University (LPU).pdfEngineering at Lovely Professional University (LPU).pdf
Engineering at Lovely Professional University (LPU).pdf
Sona
?
Piping-and-pipeline-calculations-manual.pdf
Piping-and-pipeline-calculations-manual.pdfPiping-and-pipeline-calculations-manual.pdf
Piping-and-pipeline-calculations-manual.pdf
OMI0721
?
Frankfurt University of Applied Science urkunde
Frankfurt University of Applied Science urkundeFrankfurt University of Applied Science urkunde
Frankfurt University of Applied Science urkunde
Lisa Emerson
?
Optimization of Cumulative Energy, Exergy Consumption and Environmental Life ...
Optimization of Cumulative Energy, Exergy Consumption and Environmental Life ...Optimization of Cumulative Energy, Exergy Consumption and Environmental Life ...
Optimization of Cumulative Energy, Exergy Consumption and Environmental Life ...
J. Agricultural Machinery
?
Water Industry Process Automation & Control Monthly - March 2025.pdf
Water Industry Process Automation & Control Monthly - March 2025.pdfWater Industry Process Automation & Control Monthly - March 2025.pdf
Water Industry Process Automation & Control Monthly - March 2025.pdf
Water Industry Process Automation & Control
?
Taykon-Kalite belgeleri
Taykon-Kalite belgeleriTaykon-Kalite belgeleri
Taykon-Kalite belgeleri
TAYKON
?
How to Make an RFID Door Lock System using Arduino
How to Make an RFID Door Lock System using ArduinoHow to Make an RFID Door Lock System using Arduino
How to Make an RFID Door Lock System using Arduino
CircuitDigest
?
Syntax Directed Definitions Synthesized Attributes and Inherited Attributes
Syntax Directed Definitions  Synthesized Attributes  and  Inherited AttributesSyntax Directed Definitions  Synthesized Attributes  and  Inherited Attributes
Syntax Directed Definitions Synthesized Attributes and Inherited Attributes
GunjalSanjay
?
How to Build a Maze Solving Robot Using Arduino
How to Build a Maze Solving Robot Using ArduinoHow to Build a Maze Solving Robot Using Arduino
How to Build a Maze Solving Robot Using Arduino
CircuitDigest
?
Unit II: Design of Static Equipment Foundations
Unit II: Design of Static Equipment FoundationsUnit II: Design of Static Equipment Foundations
Unit II: Design of Static Equipment Foundations
Sanjivani College of Engineering, Kopargaon
?
Lectureof nano 1588236675-biosensors (1).ppt
Lectureof nano 1588236675-biosensors (1).pptLectureof nano 1588236675-biosensors (1).ppt
Lectureof nano 1588236675-biosensors (1).ppt
SherifElGohary7
?
04 MAINTENANCE OF CONCRETE PAVEMENTS.ppt
04  MAINTENANCE OF CONCRETE PAVEMENTS.ppt04  MAINTENANCE OF CONCRETE PAVEMENTS.ppt
04 MAINTENANCE OF CONCRETE PAVEMENTS.ppt
sreenath seenu
?
Lessons learned when managing MySQL in the Cloud
Lessons learned when managing MySQL in the CloudLessons learned when managing MySQL in the Cloud
Lessons learned when managing MySQL in the Cloud
Igor Donchovski
?
15. Smart Cities Big Data, Civic Hackers, and the Quest for a New Utopia.pdf
15. Smart Cities Big Data, Civic Hackers, and the Quest for a New Utopia.pdf15. Smart Cities Big Data, Civic Hackers, and the Quest for a New Utopia.pdf
15. Smart Cities Big Data, Civic Hackers, and the Quest for a New Utopia.pdf
NgocThang9
?
TM-ASP-101-RF_Air Press manual crimping machine.pdf
TM-ASP-101-RF_Air Press manual crimping machine.pdfTM-ASP-101-RF_Air Press manual crimping machine.pdf
TM-ASP-101-RF_Air Press manual crimping machine.pdf
ChungLe60
?
google_developer_group_ramdeobaba_university_EXPLORE_PPT
google_developer_group_ramdeobaba_university_EXPLORE_PPTgoogle_developer_group_ramdeobaba_university_EXPLORE_PPT
google_developer_group_ramdeobaba_university_EXPLORE_PPT
JayeshShete1
?
UNIT 1FUNDAMENTALS OF OPERATING SYSTEMS.pptx
UNIT 1FUNDAMENTALS OF OPERATING SYSTEMS.pptxUNIT 1FUNDAMENTALS OF OPERATING SYSTEMS.pptx
UNIT 1FUNDAMENTALS OF OPERATING SYSTEMS.pptx
KesavanT10
?
CS3451-OPERATING-SYSTEM NOTES ALL123.pdf
CS3451-OPERATING-SYSTEM NOTES ALL123.pdfCS3451-OPERATING-SYSTEM NOTES ALL123.pdf
CS3451-OPERATING-SYSTEM NOTES ALL123.pdf
PonniS7
?
Env and Water Supply Engg._Dr. Hasan.pdf
Env and Water Supply Engg._Dr. Hasan.pdfEnv and Water Supply Engg._Dr. Hasan.pdf
Env and Water Supply Engg._Dr. Hasan.pdf
MahmudHasan747870
?
Engineering at Lovely Professional University (LPU).pdf
Engineering at Lovely Professional University (LPU).pdfEngineering at Lovely Professional University (LPU).pdf
Engineering at Lovely Professional University (LPU).pdf
Sona
?
Piping-and-pipeline-calculations-manual.pdf
Piping-and-pipeline-calculations-manual.pdfPiping-and-pipeline-calculations-manual.pdf
Piping-and-pipeline-calculations-manual.pdf
OMI0721
?
Frankfurt University of Applied Science urkunde
Frankfurt University of Applied Science urkundeFrankfurt University of Applied Science urkunde
Frankfurt University of Applied Science urkunde
Lisa Emerson
?
Optimization of Cumulative Energy, Exergy Consumption and Environmental Life ...
Optimization of Cumulative Energy, Exergy Consumption and Environmental Life ...Optimization of Cumulative Energy, Exergy Consumption and Environmental Life ...
Optimization of Cumulative Energy, Exergy Consumption and Environmental Life ...
J. Agricultural Machinery
?
Taykon-Kalite belgeleri
Taykon-Kalite belgeleriTaykon-Kalite belgeleri
Taykon-Kalite belgeleri
TAYKON
?
How to Make an RFID Door Lock System using Arduino
How to Make an RFID Door Lock System using ArduinoHow to Make an RFID Door Lock System using Arduino
How to Make an RFID Door Lock System using Arduino
CircuitDigest
?
Syntax Directed Definitions Synthesized Attributes and Inherited Attributes
Syntax Directed Definitions  Synthesized Attributes  and  Inherited AttributesSyntax Directed Definitions  Synthesized Attributes  and  Inherited Attributes
Syntax Directed Definitions Synthesized Attributes and Inherited Attributes
GunjalSanjay
?
How to Build a Maze Solving Robot Using Arduino
How to Build a Maze Solving Robot Using ArduinoHow to Build a Maze Solving Robot Using Arduino
How to Build a Maze Solving Robot Using Arduino
CircuitDigest
?

Transition Models of Equilibrium Assessment in Bayesian Game

  • 1. December 18 Fri., 2015, 09:10-09:30, Regular Session: Modeling 1, Frb04.3 @ 802 Transition ?Models ?of ?Equilibrium ? Assessment ?in ?Bayesian ?Game Kiminao Kogiso University of Electro-Communications Tokyo, Japan The 54 Conference on Decision and Control Osaka International Convention Center, Osaka, Japan December 15 to 18, 2015 Supported by JSPS Grant-in-Aid for Challenging Exploratory Research 2014 to 2016
  • 2. Outline 2 Introduction ? Static ?Bayesian ?Game ? Novel ?Form ?in ?Bayesian ?Nash ?Equilibrium ? Dynamics ?in ?Equilibrium ?Assessment ? Simulation ? Conclusion
  • 3. Introduction 3 Strategic game enabling to consider uncertainties in player¡¯s decisions. player: a reasonable decision maker action: what a player chooses utility: a player¡¯s preference over the actions type: a label of player¡¯s private valuation (what the player really feels) belief: a probability distribution over the types (degree of feeling, tendency, proclivity,¡­) Static Bayesian Game[1] [1] Harsanyi, 1967. [2] Alpcan and Basar, et al., 2011, 2013. [3] Roy, et al., 2010. [4] Liu, et al., 2006. [5] Akkarajitsakul, et al., 2011. A Bayesian game used in engineering problems to analyze a Bayesian Nash equilibrium or to design a game mechanism. network security[2,3], intrusion detection[4,5,6], belief learning[7] electricity pricing[8,9], mechanism design[10] [6] Sedjelmachi, et al., 2014, 2015. [7] Nachbar, 2008. [8] Li, et al., 2011, 2014. [9] Yang, et al., 2013. [10] Tao, et al., 2015.
  • 4. Introduction 4 Insufficient tools and concepts[11] Bayesian Nash equilibrium plays key roles in game analysis & design. equilibrium analysis: for given belief, find a Bayesian Nash Equilibrium(BNE). belief learning: for given BNE, find a corresponding belief. mechanism design: for given utility, find rules to achieve a desired BNE. Objective of this talk Derive a dynamical state-space model whose state involves a BNE. derive a novel condition related to the BNE, discover a map (discrete-time system) defined by the novel condition, confirm a time response of the map. [11] Powell, 2011. Challenge: prepare tools & concepts to apply our model-based fashion to analysis and design of the game.
  • 5. Bayesian Game Player set Action set Type set Utility Strategy (mixed) Belief Static Bayesian Game: General 5 Two-player two-action Bayesian game w/ two types G(N, A, ?, u, ?, S) N := {1, 2} A := A1 ? A2 ? := ?1 ? ?2 u := (u1, u2) ? := (?1, ?2) S := (S1, S2) ai 2 Ai := {a, ?a} 8i 2 N ?i 2 ?i := {?, ??} 8i 2 N ?i 2 ?(?i) 8i 2 N Si : ?i ! ?(Ai) 8i 2 N si 2 Si(?i) 8i 2 N ?(X) : a probability distribution over a finite set X Ui(?i, ? i) := ? ui(a, a, ?i, ? i) ui(a, ?a, ?i, ? i) ui(?a, a, ?i, ? i) ui(?a, ?a, ?i, ? i) : utility matrix8i 2 N, 8? 2 ? ui : A ? ? ! < 8i 2 N i 2 N
  • 6. Static Bayesian Game: Example 6 Service of tennis 2, 2 0, 1 1, 21, 1 flat spin flat spin 0, 1 1, 2 0, 11, 2 flat spin flat spin sideline 1, 0 1, 1 2, 00, 1 flat spin flat spin 1, 3 1, 2 0, 32, 2 flat spin flat spin centerline s1(a|?) s1(?a|?) s1(?a|??) s1(a|??) s2(a|??) s2(?a|??)s2(?a|?)s2(a|?) center line ? side line ?? ??? ?a ?a ?a ?a ?a ?a ?a?aa a a a ?1(?) ?1(??) ?2(??)?2(?) a a a a type belief
  • 7. Bayesian Nash Equilibrium 7 Equilibrium assessment definitions of Bayesian Nash Equilibrium(BNE) using an ex-ante expected utility: using a best response to opponent strategy: EUi(si, s i) EUi(s0 , s i) 8s0 i 2 Si, s0 i 6= si is denoted as the Bayesian Nash equilibrium. A strategy profile , satisfying , is also a BNE.s = (si, s i) Given a prior common probability , for any , the strategy satisfyingi 2 N sp(?) si 2 BRi(s i, ?) 8i 2 N the pair of considered as key variables of the Bayesian game. Equilibrium Assessment : a pair of a belief and the corresponding BNE.(??, ?s) (?, s) equilibrium analysis[10]: find a BNE .?s9 ??, [10] Y. Shoham and K. Leyton-Brown, Multiagent Systems, Cambridge University Press, 2009.
  • 8. then the pair is an Equilibrium Assessment, where ,? := ? 1 1 ? (??, ?s) Novel Form Satisfying BNE 8 If the game satisfies the following condition (simultaneous polynomial in ): Sufficient condition to be BNE Lemma 8?i 2 ?i, 8i 2 N G ??i(?s i, ?i) (?i)p(??) = 0 ?i(?s i, ?i) := ? Ui(?i, ?)?s i(?) Ui(?i, ??)?s i(??) ? , (?) := ? 1 0 0 0 0 1 0 0 , (??) := ? 0 0 1 0 0 0 0 1 . idea: derived from KKT condition of BNE by cancelation of Lagrangian variables. point: # of the polynomials: 4, # of the variables: 6; D.O.F. in determining their values. note: a BNE (mixed strategy) holds the above equation, but some of pure strategy BNEs do not hold it. ?? all of EAs
  • 9. Discover Dynamics! 9 Map from EA to EA Idea to derive dynamics in EA all of EAs ? EA (??, ?s) satisfying the Lemma all of EAs ? EA (?? + ??, ?s + ?s) satisfying the Lemma
  • 10. Given an initial EA, if there exists such that the game satisfies the following condition w.r.t. utility matrices: , Dynamics in Equilibrium Assessment 10 Main result Theorem ? 1 1 ? Ui(?i, ?) ? 1 1 1 = 0 ? 1 1 ? Ui(?i, ??) ? 2 1 2 = 0 8?i 2 ?i8i 2 N ??(k + 1) = diag(A1, A2)??(k) ?s(k + 1) = A (ci(k))?si(k) ci(k) := ??i(?i, k + 1) ??i(?i, k) , and is a row stochastic matrix.Ai 2 <2?2 8i 2 N = [ 1 2]T 2 <2 then a nonlinear autonomous system in terms of the equilibrium assessment: transfers from an EA to another EA , where(??(k), ?s(k)) (??(k + 1), ?s(k + 1)) ci(k) ! 1 A (1) = I ci(k) := ??i(?i, k + 1) ??i(?i, k)??(k + 1) = diag(A1, A2)??(k) ?s(k + 1) = A (ci(k))?si(k) ??(k) ??(k + 1) stable linear system: time-varying system: ¡¤ ?s(k) ??(k)
  • 11. Simulation 11 Trajectory of equilibrium assessment 0.3 0.4 0.5 0.6 0.7 10 0.2 0.4 0.6 0.8 probabilityprobability step 0 1 2 3 4 5 6 7 8 9 100 1 2 3 4 5 6 7 8 9 s1( )a|¦È?? s1( )a|¦È? ? s1( )a|¦È ? ? s1( )a|¦È ?? s2( )a|¦È?? s2( )a|¦È? ? s2( )a|¦È ? ? s2( )a|¦È ?? 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 the proposed model the computation of the best respose (4) 10 expectedutilityvalue step 0 1 2 3 4 5 6 7 8 9 EU1 EU2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 10 step 0 1 2 3 4 5 6 7 8 9 valueofci()¦Èi c ( )1 ¦È? c ( )1 ¦È ? c ( )2 ¦È? c ( )2 ¦È ? 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10 probability step 0 1 2 3 4 5 6 7 8 9 1( )¦Ç ¦È? 1( )¦Ç ¦È ? 2( )¦Ç ¦È? 2( )¦Ç ¦È ? belief strategy c expected utility
  • 12. Conclusion 12 Introduction Static Bayesian Game two-players two-actions game with two-types New Form in Bayesian Nash Equilib. polynomial conditions in equilibrium assessment Dynamics in Equilibrium Assessment discrete-time autonomous time-varying system convergence of the EA (stability) Simulation confirms states updated become EA and converge. Future works estimate player¡¯s belief for a given BNE, and realize a control-theoretic mechanism design method. 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10 probability step 0 1 2 3 4 5 6 7 8 9 1( )¦Ç ¦È? 1( )¦Ç ¦È ? 2( )¦Ç ¦È? 2( )¦Ç ¦È ? belief 0.3 0.4 0.5 0.6 0.7 10 0.2 0.4 0.6 0.8 probabilityprobability step 0 1 2 3 4 5 6 7 8 9 100 1 2 3 4 5 6 7 8 9 s1( )a|¦È?? s1( )a|¦È? ? s1( )a|¦È ? ? s1( )a|¦È ?? s2( )a|¦È?? s2( )a|¦È? ? s2( )a|¦È ? ? s2( )a|¦È ?? strategy ??(k + 1) = diag(A1, A2)??(k) ?s(k + 1) = A (ci(k))?si(k) Dynamics in EA: