Incorporating significant amino acid pairs and protein domains to predict RNA splicing-related proteins with functional roles

Justin Bo Kai Hsu, Kai Yao Huang, Tzu Ya Weng, Chien Hsun Huang, Tzong Yi Lee*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Machinery of pre-mRNA splicing is carried out through the interaction of RNA sequence elements and a variety of RNA splicing-related proteins (SRPs) (e.g. spliceosome and splicing factors). Alternative splicing, which is an important post-transcriptional regulation in eukaryotes, gives rise to multiple mature mRNA isoforms, which encodes proteins with functional diversities. However, the regulation of RNA splicing is not yet fully elucidated, partly because SRPs have not yet been exhaustively identified and the experimental identification is labor-intensive. Therefore, we are motivated to design a new method for identifying SRPs with their functional roles in the regulation of RNA splicing. The experimentally verified SRPs were manually curated from research articles. According to the functional annotation of Splicing Related Gene Database, the collected SRPs were further categorized into four functional groups including small nuclear Ribonucleoprotein, Splicing Factor, Splicing Regulation Factor and Novel Spliceosome Protein. The composition of amino acid pairs indicates that there are remarkable differences among four functional groups of SRPs. Then, support vector machines (SVMs) were utilized to learn the predictive models for identifying SRPs as well as their functional roles. The cross-validation evaluation presents that the SVM models trained with significant amino acid pairs and functional domains could provide a better predictive performance. In addition, the independent testing demonstrates that the proposed method could accurately identify SRPs in mammals/plants as well as effectively distinguish between SRPs and RNA-binding proteins. This investigation provides a practical means to identifying potential SRPs and a perspective for exploring the regulation of RNA splicing.

Original languageEnglish
Pages (from-to)49-60
Number of pages12
JournalJOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN
Volume28
Issue number1
DOIs
StatePublished - Jan 2014

Keywords

  • Amino acid pair composition
  • RNA splicing
  • Spliceosome
  • Splicing-related protein
  • Support vector machine

Fingerprint

Dive into the research topics of 'Incorporating significant amino acid pairs and protein domains to predict RNA splicing-related proteins with functional roles'. Together they form a unique fingerprint.

Cite this