A New Scheme to Characterize and Identify Protein Ubiquitination Sites

Van Nui Nguyen, Kai Yao Huang, Chien Hsun Huang, K. Robert Lai, Tzong Yi Lee

Research output: Contribution to journalArticlepeer-review

22 Scopus citations

Abstract

Protein ubiquitination, involving the conjugation of ubiquitin on lysine residue, serves as an important modulator of many cellular functions in eukaryotes. Recent advancements in proteomic technology have stimulated increasing interest in identifying ubiquitination sites. However, most computational tools for predicting ubiquitination sites are focused on small-scale data. With an increasing number of experimentally verified ubiquitination sites, we were motivated to design a predictive model for identifying lysine ubiquitination sites for large-scale proteome dataset. This work assessed not only single features, such as amino acid composition (AAC), amino acid pair composition (AAPC) and evolutionary information, but also the effectiveness of incorporating two or more features into a hybrid approach to model construction. The support vector machine (SVM) was applied to generate the prediction models for ubiquitination site identification. Evaluation by five-fold cross-validation showed that the SVM models learned from the combination of hybrid features delivered a better prediction performance. Additionally, a motif discovery tool, MDDLogo, was adopted to characterize the potential substrate motifs of ubiquitination sites. The SVM models integrating the MDDLogo-identified substrate motifs could yield an average accuracy of 68.70 percent. Furthermore, the independent testing result showed that the MDDLogo-clustered SVM models could provide a promising accuracy (78.50 percent) and perform better than other prediction tools. Two cases have demonstrated the effective prediction of ubiquitination sites with corresponding substrate motifs.

Original languageEnglish
Article number7400980
Pages (from-to)393-403
Number of pages11
JournalIEEE/ACM Transactions on Computational Biology and Bioinformatics
Volume14
Issue number2
DOIs
StatePublished - 1 Mar 2017

Keywords

  • Protein ubiquitination
  • maximal dependence decomposition
  • substrate motif
  • support vector machine

Fingerprint

Dive into the research topics of 'A New Scheme to Characterize and Identify Protein Ubiquitination Sites'. Together they form a unique fingerprint.

Cite this