Computational Molecular Biology

Blocks, Profiles & Hidden Markov Models

Doug Brutlag

October 30, 2008

Bucher, P., Karplus, K., Moeri, N., & Hofmann, K. (1996). A flexible motif search technique based on generalized profiles. Comput Chem, 20(1), 3-23.

Bairoch, A., Bucher, P. and Hofmann, K. (1997). The PROSITE database, its status in 1997. Nucleic Acids Res, 25(1), 217-21.

Bairoch, A. and Apweiler, R. (1997). The SWISS-PROT protein sequence data bank and its supplement TrEMBL. Nucleic Acids Res, 25(1), 31-6.

Baldi, P., & Chauvin, Y. (1996). Hybrid modeling, HMM/NN architectures, and protein applications. Neural Comput, 8(7), 1541-65.

Bork, P., & Gibson, T. J. (1996). Applying Motif and Profile Searches. Methods in Enzymology, 266, 162-183.

Back to Top

Bowie, J. U., Luthy, R. and Eisenberg, D. (1991). A Method to Identify Protein Sequences That Fold Into a Known Three-Dimensional Structure. Science 253 , 164-170.

Bowie, J. U., Zhang, K., Wilmanns, M. and Eisenberg, D. (1996). Three-Dimensional Profiles for Measuring Compatability of Amino Acid Sequence with Three-Dimensional Structure. Methods in Enzymology, 266, 598-616.

Fuchs, R. (1994). Fast protein block searches. Comput Appl Biosci, 10(1), 79-80.

Gribskov, M., McLachlan, A. D. and Eisenberg, D. (1987). Profile analysis: Dectection of distantly related proteins. Proc. Natl. Acad. Sci. USA, 84, 4355-4358.

Gribskov, M., Homyak, M., Edenfield, J., & Eisenberg, D. (1988). Profile scanning for three-dimensional structural patterns in protein sequences. Comput Appl Biosci, 4, 61-6.

Gribskov, M. (1994). Profile analysis. Methods Mol Biol 25 , 247-66.

Gribskov, M. and Veretnik, S. (1996). Identification of Sequence Patterns with Profile Analysis. Methods in Enzymology, 266, 198-211.

Back to Top

Henikoff, S. (1991). Playing with blocks: some pitfalls of forcing multiple alignments. New Biol 3 (12), 1148-54.

Henikoff, S. and Henikoff, J. G. (1991). Automated assembly of protein blocks for database searching. Nucleic Acids Res 19 (23), 6565-72.

Henikoff, S., & Henikoff, J. G. (1994). Protein family classification based on searching a database of blocks. Genomics, 19(1), 97-107.

Henikoff, S. and Henikoff, J. G. (1994). Position-based Sequence Weights. J. Mol. Biol. 243 , 574-578.

Henikoff, J. G. and Henikoff, S. (1996). Blocks Database and Its Applications. Methods in Enzymology, 266, 88-104.

Henikoff, S. (1996). Scores for Sequence Searches. Current Opinion in Structural Biology, 6(3), 353-360.

Luthy, R., Bowie, J. U. and Eisenberg, D. (1992). Assessment of protein models with three-dimensional profiles. Nature 356 (6364), 83-85.

Perkins, D. N., & Attwood, T. K. (1996). XFINGER: a tool for searching and visualising protein fingerprints and patterns. Comput Appl Biosci, 12(2), 89-94.

Pietrovski, S., Henikoff, J. G., & Henikoff, S. (1996). The Blocks Database A ssytem for Protein Classification. Nucleic Acids Res., 24(1), 197-200.

Poch, O. and Delarue, M. (1996). Converting Sequence Block Alignments into Structural Insights. Methods in Enzymology, 266, 662-680.

Posfai, J., Bhagwat, A. S., Posfai, G. and Roberts, R. J. (1989). Predictive motifs derived from cytosine methyltransferases. Nucleic Acids Res 17 (7), 2421-35.

Back to Top

Saqi, M. A. S. and Sternberg, M. J. E. (1994). Identification of sequence motifs from a set of proteins with related function. Protein Engineering 7 (2), 165-171.

Saqi, M. A. and Sayle, R. (1994). PdbMotif--a tool for the automatic identification and display of motifs in protein structures. Comput Appl Biosci, 10(5), 545-6.

Smith, H. O., Annau, T. M. and Chandrasegaran, S. (1990). Finding sequence motifs in groups of functionally related proteins. Proc Natl Acad Sci U S A, 87(2), 826-30.

Staden, R. (1988). Methods to define and locate patterns of motifs in sequences. Comput Appl Biosci, 4 (1), 53-60.

Suyama, M., Matsuo, Y., & Nishikawa, K. (1997). Comparison of protein structures using 3D profile alignment. J Mol Evol, 44 Suppl 1, S163-73.

Thompson, J. D., Higgins, D. G., & Gibson, T. J. (1994). Improved sensitivity of profile searches through the use of sequence weights and gap excision. Comput Appl Biosci, 10(1), 19-29.

Tatusov, R. L., Altschul, S. F. and Koonin, E. V. (1994). Detection of conserved segments in proteins: iterative scanning of sequence databases with alignment blocks. Proc Natl Acad Sci U S A 91 (25), 12091-5.

Wallace, J. C. and Henikoff, S. (1992). PATMAT: a searching and extraction program for sequence, pattern and block queries and databases. Comput Appl Biosci 8 (3), 249-54.

Back to Lecture

Back to Syllabus