TSSUNet-MB – ab initio identification of σ70 promoter transcription start sites in Escherichia coli using deep multitask learning

Chung En Ni, Duy Phuong Doan, Yen Jung Chiu, Yen Hua Huang*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Motivation: Computational promoter prediction (CPP) tools designed to classify prokaryotic promoter regions usually assume that a transcription start site (TSS) is located at a predefined position within each promoter region. Such CPP tools are sensitive to any positional shifting of the TSS in a windowed region, and they are unsuitable for determining the boundaries of prokaryotic promoters. Results: TSSUNet-MB is a deep learning model developed to identify the TSSs of σ70 promoters. Mononucleotide and bendability were used to encode input sequences. TSSUNet-MB outperforms other CPP tools when assessed using the sequences obtained from the neighborhood of real promoters. TSSUNet-MB achieved a sensitivity of 0.839 and specificity of 0.768 on sliding sequences, while other CPP tool cannot maintain both sensitivities and specificities in a compatible range. Furthermore, TSSUNet-MB can precisely predict the TSS position of σ70 promoter-containing regions with a 10-base accuracy of 77.6%. By leveraging the sliding window scanning approach, we further computed the confidence score of each predicted TSS, which allows for more accurately identifying TSS locations. Our results suggest that TSSUNet-MB is a robust tool for finding σ70 promoters and identifying TSSs.

Original languageEnglish
Article number107904
JournalComputational Biology and Chemistry
Volume105
DOIs
StatePublished - Aug 2023

Keywords

  • Prokaryotic transcription start site
  • Promoter
  • σ

Fingerprint

Dive into the research topics of 'TSSUNet-MB – ab initio identification of σ70 promoter transcription start sites in Escherichia coli using deep multitask learning'. Together they form a unique fingerprint.

Cite this