Protein domain repetition is enriched in Streptococcal cell-surface proteins

I. Hsuan Lin, Ming Ta Hsu, Chuan Hsiung Chang*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

11 Scopus citations


Tandem repetition of domain in protein sequence occurs in all three domains of life. It creates protein diversity and adds functional complexity in organisms. In this work, we analyzed 52 streptococcal genomes and found 3748 proteins contained domain repeats. Proteins not harboring domain repeats are significantly enriched in cytoplasm, whereas proteins with domain repeats are significantly enriched in cytoplasmic membrane, cell wall and extracellular locations. Domain repetition occurs most frequently in S. pneumoniae and least in S. thermophilus and S. pyogenes. DUF1542 is the highest repeated domain in a single protein, followed by Rib, CW_binding_1, G5 and HemolysinCabind. 3D structures of 24 repeat-containing proteins were predicted to investigate the structural and functional effect of domain repetition. Several repeat-containing streptococcal cell surface proteins are known to be virulence-associated. Surface-associated tandem domain-containing proteins without experimental functional characterization may be potentially involved in the pathogenesis of streptococci and deserve further investigation.

Original languageEnglish
Pages (from-to)370-379
Number of pages10
Issue number6
StatePublished - Dec 2012


  • Domain repeats
  • Domain repetition
  • Protein structure modeling
  • Protein subcellular localization
  • Streptococcus
  • Virulence


Dive into the research topics of 'Protein domain repetition is enriched in Streptococcal cell-surface proteins'. Together they form a unique fingerprint.

Cite this