�ݺ�ߣ

Inferring parameters in a large stochastic model using only proportions of cell
death: insights from the birth-death model
Holly Ainsworth, Richard Boys & Colin Gillespie∗
Newcastle University, UK
REFERENCES
[1] Tang, M.Y., Proctor, C.J., Woulfe, J., Gray, D.A.
Experimental and Computational Analysis of
Polyglutamine-Mediated Cytotoxicity In PLoS Com-
put Biol., 2010 .
INTRODUCTION →
• Expanded polyglutamine (PolyQ) proteins
are known to be the causative agents in a
number of neurodegenerative diseases, but
they are still poorly understood.
• Aggregation of speciﬁc proteins are part of
the normal process of ageing in the brain
as well as in many age-related diseases.
• A causative link between aggregation and
disease is not universally accepted.
STOCHASTIC MODEL ↓
Stochastic kinetic model with uncertain pa-
rameters:
• 27 (chemical) species
• 70 reactions
• 40 rate constants, denoted θ
The model aims to explore the relationship
between PolyQ, p38MAPK activation, genera-
tion of reactive oxygen species (ROS), protea-
some inhibition and inclusion body formation.
DATA ←
Data are proportions (of cell death) - not quan-
titative trait measurements
Scenario 24hrs 36hrs 48hrs
GFP 15.03 14.55 26.08
H25 18.97 18.07 22.50
H103 21.68 23.44 36.44
SIMULATION ↓
Denote the probability of cell death at time t, pt(θ). Note the probability of cell death depends
on parameters θ. Given θ, the Gillespie algorithm can be used to simulate the time evolution of
a particular cell:
• The simulation gives us a binary time series for the cell, with 1 = death and 0 = no death.
• Repeating the above for n cells gives us a handle on pt(θ) via the observed proportion of
cell death ˆpt(θ), where
ˆpt(θ) ∼
1
n
Bin(n, pt(θ)).
BIRTH-DEATH MODEL →
Let x denote the number of individuals
present in the population. In chemical kinetic
notation, this system is represented as
R1 : x
λ
−→ 2x (birth)
R2 : 2x
µ
−→ x (death)
Example simulations from the model:
q qq0
5
10
15
0 4 8 12
Time
Population
0.00
0.25
0.50
0.75
1.00
0 4 8 12
Time
Probabilityofextinction
Simulator n = 10 n = 100 n = 1000 n = 10000
Comparisons with PolyQ model:
• Compare a single population governed by
a birth-death process with a cell governed
by the PolyQ model.
• A cell becoming extinct in the birth-death
process can be likened to a cell dying in
the PolyQ model.
• Proportions of extinction from the birth-
death model are comparible to propor-
tions of cell death from the PolyQ model.
An analytic expression for the probability of
extinction in the birth-death process for given
t, λ, µ and initial population level is available.
INFERENCE ↓
Work with proportions on the logit scale and assume data model:
yt = logit xt = logit pt(θ) + σ t, t = 1, . . . , T
where t ∼ N(0, 1) independently. The posterior of interest is
π(θ, σ|y) ∝ π(θ)π(σ)π(y|θ, σ).
Approaches to inference:
• Vanilla MCMC - approximate the distribution of ˆpt(θ) and account for the uncertainty using
elogit ˆpt(θ) ∼ N logit pt(θ),
1
npt(θ)[1 − pt(θ)]
approximately.
• Pseudo marginal 1 - construct a Monte Carlo estimate of the marginal likelihood.
• Pseudo marginal 2 - at each iteration of the MCMC scheme, use a (SIR) particle ﬁlter to
construct a SMC approximation to the marginal likelihood.
RESULTS
log(λ) log(µ) log(σ)
0
1
2
3
4
5
0
1
2
3
4
5
0
1
2
3
4
5
n=10n=100n=1000
−3 −2 −1 0 1 −0.5 0.0 0.5 1.0 −3 −2 −1 0 1
Parameter value
Density
Vanilla MCMC Pseudo−marginal MCMC 1 Pseudo−marginal MCMC 2
Inference using exact
probability of death
• Both pseudo-marginal schemes involve
more work. They use n×#particles runs
of the simulator at each iteration, com-
pared to n runs for the original scheme.
• The pseudo-marginal approach has the
advantage that it performs exact infer-
ence (for a particular choice of n).
• This will be useful particularly when
the asymptotic distributional result for
elogit ˆpt(θ) is poor for small n.
• The SMC approach to estimating the
marginal likelihood appears to mix better
than the MC approach.

�ݺ�ߣ

Poster for Information, probability and inference in systems biology (IPISB 2013)

More Related Content

Poster for Information, probability and inference in systems biology (IPISB 2013)