Predicting protein-protein interaction sites using sequence descriptors and site propensity of neighboring amino acids

Tzu Hao Kuo, Kuo Bin Li*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

15 Scopus citations

Abstract

Information about the interface sites of Protein-Protein Interactions (PPIs) is useful for many biological research works. However, despite the advancement of experimental techniques, the identification of PPI sites still remains as a challenging task. Using a statistical learning technique, we proposed a computational tool for predicting PPI interaction sites. As an alternative to similar approaches requiring structural information, the proposed method takes all of the input from protein sequences. In addition to typical sequence features, our method takes into consideration that interaction sites are not randomly distributed over the protein sequence. We characterized this positional preference using protein complexes with known structures, proposed a numerical index to estimate the propensity and then incorporated the index into a learning system. The resulting predictor, without using structural information, yields an area under the ROC curve (AUC) of 0.675, recall of 0.597, precision of 0.311 and accuracy of 0.583 on a ten-fold cross-validation experiment. This performance is comparable to the previous approach in which structural information was used. Upon introducing the B-factor data to our predictor, we demonstrated that the AUC can be further improved to 0.750. The tool is accessible at http://bsaltools.ym.edu.tw/predppis.

Original languageEnglish
Article number1788
JournalInternational Journal Of Molecular Sciences
Volume17
Issue number11
DOIs
StatePublished - Nov 2016

Keywords

  • Intrinsically-disorder protein
  • Machine learning algorithms
  • Protein-protein interaction

Fingerprint

Dive into the research topics of 'Predicting protein-protein interaction sites using sequence descriptors and site propensity of neighboring amino acids'. Together they form a unique fingerprint.

Cite this