Abstract
We analyzed the completed human genome for recent segmental duplications (size ≥ 1 kb and sequence similarity ≥ 90%). We found that approximately 4% of the genome is covered by duplications and that the extent of segmental duplication varies from 1% to 14% among the 24 chromosomes. Intrachromosomal duplication is more frequent than interchromosomal duplication in 15 chromosomes. The duplication frequencies in pericentromeric and subtelomeric regions are greater than the genome average by approximately threefold and fourfold. We examined factors that may affect the frequency of duplication in a region. Within individual chromosomes, the duplication frequency shows little correlation with local gene density, repeat density, recombination rate, and GC content, except chromosomes 7 and Y. For the entire genome, the duplication frequency is correlated with each of the above factors. Based on known genes and Ensembl genes, the proportion of duplications containing complete genes is 3.4% and 10.7%, respectively. The proportion of duplications containing genes is higher in intrachromosomal than in interchromosomal duplications, and duplications containing genes have a higher sequence similarity and tend to be longer than duplications containing no genes. Our simulation suggests that many duplications containing genes have been selectively maintained in the genome.
Original language | English |
---|---|
Pages (from-to) | 135-141 |
Number of pages | 7 |
Journal | Molecular Biology and Evolution |
Volume | 22 |
Issue number | 1 |
DOIs | |
State | Published - 1 Jan 2005 |
Keywords
- Gene density
- Positive selection
- Recombination rate
- Repetitive elements