Efficiency of the cross-entropy method in Markov chain problems
Ridder, A; Tuffin, B (Amsterdam; Rennes)
Monday 21 June 2010, 11:40-12:05
Seminar Room 1, Newton Institute
Abstract
Consider the problem of estimating the probability ?(0) that a discrete-time Markov chain (X(t))1t=0
on a finite state space X, with transition probabilities (p(x, y))x,y2X , reaches a failure set F from
a reference state 0 before returning to the reference state. We assume that this is a rare event,
where the rarity parameter n might be the population size in queueing applications, or n = 1/o
in reliability applications, with o the failure rate. For simulation we apply importance sampling
by generating sample paths of the Markov chain using transition probabilities pIS(x, y). Let Y IS
n
be the resulting estimator of ?(0). In this paper we investigate the efficiency of the estimator, in
terms of bounded relative error, and logarithmically bounded relative error, defined by
limsup
n!1
p
Var[Yn]
E[Yn] < 1, and liminf
n!1
log
p
Var[Yn]
logE[Yn] 1,
respectively. Optimal but not implementable would be to use the zero-variance transition probabilities
pZV(x, y) = p(x, y)?(y)/?(x), where ?(x) is the probability of reaching the failure set before
the reference state when starting in state x. Then the importance sampling estimator has zero
variance. Suppose that we approximate ?(x) by ?APP(x), for any state x. L’Ecuyer and Tuffin
(2009) gave conditions for the approximated transition probabilities under which the associated
importance sampling estimator shows bounded relative error.
In this paper we analyse the cross-entropy method for estimating the zero-variance transition probabilities.
For that purpose we first show that the zero-variance transition probabilities satisfy
pZV(x, y) = E[1{X(T) 2 F}N(x, y)|X(0) = 0]
E
1{X(T) 2 F}
P
z2X N(x, z)|X(0) = 0
,
where T is the first entrance time in F [ {0}}, and N(x, y) is the number of times that transition
(x, y) has occured. The cross-entropy method provides a procedure for estimating the expectations
in this expression, resulting in estimators pEST(x, y) of the zero-variance transition probabilities.
Let pCE(x, y) be realised values of these estimators, based on finite sample sizes. Then we give a
sufficient condition for the associated estimator Y CE
n to show logarithmically bounded relative error.
The condition is in terms of the Kullback-Leibler distance D(PZV, PCE) between the underlying
probability measures that are associated with these changes of measure. Moreover, we propose a probabilistic condition for the estimators pEST(x, y) such that we obtain the bounded relative error
property probabilistically. This means the following, the condition gives us a probability a, such
that with probability a the estimator Y CE
n of ?(0) has bounded relative error. The proof is based
on checking the conditions of L’Ecuyer and Tuffin (2009).
Presentation
Comments
Start the discussion!