[BMI776] A note on likelihoods
Colin Dewey
cdewey at biostat.wisc.edu
Fri Mar 9 13:42:21 CST 2007
Hi all,
I've had a number of questions regarding the likelihood that is being
maximized in EM and Gibbs, so I thought I would post this clarification.
- The log likelihood of the data and a set of motif positions given
the parameters in the OOPS model is given on slide 3 of lecture 6.
In this equation, the Z_i,j are binary values.
- During the EM algorithm, you are maximizing the *expected* log
likelihood, which is also given by the formula on slide 3 of lecture
6, but with the Z_i,j now representing expected values of the binary
Z_i,j (so the Z_i,j terms will be between 0 and 1). You should
calculate the expected log likelihood after each EM iteration and
stop when it doesn't change by much (e.g., if the difference isn't
greater than 0.01).
- During Gibbs, you calculate just the log likelihood, because you
have set motif positions at each iteration (i.e., the Z_i,j are binary).
- In the log likelihood formula, the term P[X_i | Z_i,j = 1, theta]
is maybe more easily seen on slide 13 of lecture 5.
Hope that helps,
Colin
More information about the BMI776
mailing list