Protein structure database search and evolutionary classification

Jinn-Moon Yang*, Chi Hua Tung

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

92 Scopus citations

Abstract

As more protein structures become available and structural genomics efforts provide structural models in a genome-wide strategy, there is a growing need for fast and accurate methods for discovering homologous proteins and evolutionary classifications of newly determined structures. We have developed 3D-BLAST, in part, to address these issues. 3D-BLAST is as fast as BLAST and calculates the statistical significance (E-value) of an alignment to indicate the reliability of the prediction. Using this method, we first identified 23 states of the structural alphabet that represent pattern profiles of the backbone fragments and then used them to represent protein structure databases as structural alphabet sequence databases (SADB). Our method enhanced BLAST as a search method, using a new structural alphabet substitution matrix (SASM) to find the longest common substructures with high-scoring structured segment pairs from an SADB database. Using personal computers with Intel Pentium4 (2.8 GHz) processors, our method searched more than 10 000 protein structures in 1.3 s and achieved a good agreement with search results from detailed structure alignment methods. [3D-BLAST is available at http://3d-blast.life.nctu.edu.tw].

Original languageEnglish
Pages (from-to)3646-3659
Number of pages14
JournalNucleic acids research
Volume34
Issue number13
DOIs
StatePublished - 2006

Fingerprint

Dive into the research topics of 'Protein structure database search and evolutionary classification'. Together they form a unique fingerprint.

Cite this