[Bmi776] study guide for final exam
Mark Craven
craven at biostat.wisc.edu
Tue May 2 23:27:12 CDT 2006
Hi All,
Two comments about the exam:
1) I forgot to mention this in class, but you can bring
two pages of notes to use during the final exam. Write
whatever you want on these pages.
2) Below is something of a study guide. For each
major topic, I've listed the concepts you should
understand, and the algorithms you should be able
to work in detail.
Mark
Probabilistic Sequence Analysis
concepts to know
the motif learning task
EM algorithms
Gibbs sampling
tying parameters
the MDD representation
semi-Markov models (generalized HMMs)
dynamic programming with semi-Markov models
TWINSCAN paired sequence representation
pairwise HMMs
sensitivity/recall, specificity, precision
be able to do
MEME algorithm
interpolated Markov models
back-off models
design HMMs with specified duration models
Sequence Alignment
concepts to know
the genome-alignment task
genome rearrangements
suffix trees
tries
threaded tries
maximal unique matches (MUMs)
multi-MUMs
longest increasing subsequence problem
constrained dynamic programming
recursive anchoring
overview of MUMMER/LAGAN/MLAGAN/Mauve algorithms
be able to do
show suffix trees for a given (set of) string(s)
show trie/threaded trie for strings
calculate MUMs and mult-MUMs
RNA structure modeling
concepts to know
RNA secondary structure
the secondary structure prediction task
how Nussinov can be generalized to do energy minimization
transformational grammars
probabilistic grammars
the Chomsky hierarchy
why CFGs are appropriate for RNA modeling
what the Inside, CYK and Inside/Outside algorithms do
using SCFGs for structure/sequence alignments
using SCFGs to predict novel RNA genes
be able to do
the Nussinov algorithm
show parse trees for a sequence with a given grammar
Inside algorithm
Inside/Outside algorithm
Biomedical Text Processing
concepts to know
the vector space model
the ARROWSMITH system
using EM to find themes in literature
the experiment annotation task
the named-entity recognition task
the relation-extraction task
hierarchical HMMs
the gene/protein annotation task
the CSM/CSC method
the method for augmenting PSI-BLAST with text
be able to do
Schwartz & Hearst algorithm for recognizing abbreviations
Network Models
concepts to know
Bayesian networks
dynamic Bayesian networks
representing CPDs using tables, linear Gaussian models, trees
the structure learning problem
Bayesian approach to scoring networks
the parameter learning problem
Markov blankets
the bootstrap method
permutation testing
how to elucidate causality
the general inference problem
be able to do
the Sparse Candidate algorithm
inference by enumeration
inference by variable elimination
More information about the Bmi776
mailing list