The repetitive sequence database and mining putative regulatory elements in gene promoter regions

Jorng Tzong Horng*, Hsien Da Huang, Ming Hui Jin, Li Cheng Wu, Shir Ly Huang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

15 Scopus citations


At least 43% of the human genome is occupied by repetitive elements. Moreover, around 51% of the rice genome is occupied by repetitive elements. The analysis of repetitive elements reveals that repetitive elements in our genome may have been very important in the evolutionary genomics. The first part of this study is to describe a database of repetitive elements - RSDB. The RSDB database contains repetitive elements, which are classified into the following categories: exact, tandem, and similar. The interfaces needed to query and show the results and statistical data, such as the relationship between repetitive elements and genes, cross-references of repetitive elements among different organisms, and so on, are provided. The second part of this study then attempts to mine the putative binding site for information on how combinations of the known regulatory sites and overrepresented repetitive elements in RSDB are distributed in the promoter regions of groups of functionally related genes. The overrepresented repetitive elements appearing in the associations are possible transcription factor binding sites. Our proposed approach is applied to Saccharomyces cerevisiae and the promoter regions of Yeast ORFs. The complete contents of RSDB and partial putative binding sites are available to the public at The readers may download partial query results.

Original languageEnglish
Pages (from-to)621-640
Number of pages20
JournalJournal of Computational Biology
Issue number4
StatePublished - 2002


  • Data mining
  • Database
  • DNA
  • Genes
  • Repetitive elements


Dive into the research topics of 'The repetitive sequence database and mining putative regulatory elements in gene promoter regions'. Together they form a unique fingerprint.

Cite this