Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 Jul;77(1):78-88.
doi: 10.1086/431652. Epub 2005 May 25.

Segmental duplications and copy-number variation in the human genome

Affiliations

Segmental duplications and copy-number variation in the human genome

Andrew J Sharp et al. Am J Hum Genet. 2005 Jul.

Abstract

The human genome contains numerous blocks of highly homologous duplicated sequence. This higher-order architecture provides a substrate for recombination and recurrent chromosomal rearrangement associated with genomic disease. However, an assessment of the role of segmental duplications in normal variation has not yet been made. On the basis of the duplication architecture of the human genome, we defined a set of 130 potential rearrangement hotspots and constructed a targeted bacterial artificial chromosome (BAC) microarray (with 2,194 BACs) to assess copy-number variation in these regions by array comparative genomic hybridization. Using our segmental duplication BAC microarray, we screened a panel of 47 normal individuals, who represented populations from four continents, and we identified 119 regions of copy-number polymorphism (CNP), 73 of which were previously unreported. We observed an equal frequency of duplications and deletions, as well as a 4-fold enrichment of CNPs within hotspot regions, compared with control BACs (P < .000001), which suggests that segmental duplications are a major catalyst of large-scale variation in the human genome. Importantly, segmental duplications themselves were also significantly enriched >4-fold within regions of CNP. Almost without exception, CNPs were not confined to a single population, suggesting that these either are recurrent events, having occurred independently in multiple founders, or were present in early human populations. Our study demonstrates that segmental duplications define hotspots of chromosomal rearrangement, likely acting as mediators of normal variation as well as genomic disease, and it suggests that the consideration of genomic architecture can significantly improve the ascertainment of large-scale rearrangements. Our specialized segmental duplication BAC microarray and associated database of structural polymorphisms will provide an important resource for the future characterization of human genomic disorders.

PubMed Disclaimer

Figures

Figure  1
Figure 1
Targeted design strategy for choosing BACs for array CGH on the basis of genomic architecture. We hypothesized that the presence of large, highly homologous segmental duplications predisposes certain regions of the genome to microdeletion and/or microduplication by nonallelic homologous recombination (Bailey et al. 2002). Termed “rearrangement hotspots,” these are defined by the presence of flanking intrachromosomal duplications >10 kb in length with >95% similarity and separated by 50 kb to 10 Mb of intervening sequence. For each region, we selected BACs that were contained entirely within each rearrangement hotspot, BACs that overlapped the segmental duplications, and BACs in the peripheral unique sequence for microarray manufacture. The SD microarray comprised a total of 2,194 BACs.
Figure  2
Figure 2
Observations of minor alleles in CNP regions. The 47 samples were categorized as American Indian, sub-Saharan African (Biaka and Mbuti), white (Czech and Druze), and Asian (Chinese and Japanese), and the 119 CNPs identified were plotted (see table A2 in appendix A [online only]). In cases in which the reference sample represented the rare allele (i.e., the majority of individuals showed a change in fluorescence intensity ratio), we classified the CNPs as minor-allele variants. Some previously identified CNPs are indicated by numerals above the bars: 1 = CHRNA7/CHRFAM7A at 15q13.3 (Riley et al. 2002); 2 = IGVH/SLC6A8/CDM pseudogene cluster at 16p11.2 (Barber et al. 1999); 3 = CCL3-L1/CCL4-L1 at 17q12 (Townson et al. 2002); 4 = IGHG1 gene cluster at 14q32.33 (Sasso et al. 1995); 5 = IGL gene cluster at 22q11.22 (van der Burg et al. 2002); 6 = NF1/IGVH/GABRA5 pseudogene amplification at 15q11.2-q13 (Fantes et al. 2002); and 7 = β-defensin gene cluster at 8p23.1 (Hollox et al. 2003).
Figure  3
Figure 3
CNPs detected by array CGH, and verification by FISH. A, Genomic profile of GM10473A (a Biaka female). After quality filtering, 1,967 BAC clones were ordered sequentially from 1pter to Yqter on the basis of physical location in the July 2003 genome assembly (X-axis). Each data point represents the mean log2 ratio of test:reference intensity of the three replicate spots of each BAC from a single hybridization experiment (Y-axis). Each hybridization was repeated with reverse labeling (dye swap), and clones that yielded log2 ratios deviating >2 SDs from the mean of all autosomal clones in both experiments were classified as variant. The reference DNA is male; thus, clones located on the sex chromosomes show log2 ratios consistent with female:male hybridization. Three previously reported CNPs are circled: the β-defensin gene cluster at 8p23.1 (Hollox et al. 2003) (solid circle), the IGHG1 gene cluster at 14q32.33 (Sasso et al. 1995) (dashed circle), and the IGVH/SLC6A8/CDM pseudogene cluster at 16p11.2 (Barber et al. 1999) (dotted circle). A novel variant locus, RP11-136P13 (chromosome 10:81097351–81263857 [green circle]), yielded log2 ratios of 0.44 and 0.48 in replicate hybridizations. B, FISH confirmation of RP11-136P13, which is composed entirely of segmental-duplication sequence and therefore shows dual signals when used as a FISH probe (green). However, the presence of an additional signal in interphase nuclei shows the polymorphic nature of this locus in GM10473A. A control probe (red) confirms that the cells are diploid. This duplication was also observed in two other subjects. C, Array CGH profile of chromosome 8 in GM10493 (a Biaka). Clones are ordered by physical distance from 8pter in kb (X-axis), with error bars showing the SD of the log2 ratios from the three replicate spots. Two adjacent clones, RP11-159F11 and RP11-46M15 (red circle), yielded log2 ratios of 0.41, indicating the presence of a duplication. A BAC that yielded a log2 ratio of −0.81 in this experiment (dashed circle) was not confirmed in the replicate dye-swapped hybridization; thus, it was classified as nonvariant. D, Use of RP11-159F11 as a FISH probe, showing increased signal intensity on a chromosome 8 homologue in metaphase cells, which resolves to dual signals in interphase cells, confirming the presence of a duplication of this region. An overlapping but nonidentical duplication of 8p23.2 was also observed in GM17051.
Figure  4
Figure 4
CNPs detected on chromosome 15. For each of the 47 individuals studied, relative duplications (green) and deletions (red) are represented, with invariant BACs shown in gray. The different sizes of the deletions and duplications in the three patients with known copy-number changes in the 15q11-q13 region (Locke et al. 2004) are clearly visible. Note that, as expected, none of the 47 normal individuals (94 chromosomes) in our population survey showed variation over these regions associated with disease. Also shown are the locations of segmental duplications >10 kb in size with >95% identity (pink) and of rearrangement hotspots represented on the array (blue), which illustrates the targeted nature of our array. Each tick mark represents 5 Mb, with gaps in the sequence assembly represented by gray bars.
Figure  5
Figure 5
Genomewide map of sites of copy-number variation detected by array CGH. The location of CNPs are indicated by colored bars on each chromosome. Relative deletions identified in a single individual are shown in orange, and deletions observed in multiple individuals are shown in red. Relative duplications identified in a single individual are shown in green, and duplications observed in multiple individuals are shown in blue. CNPs that showed both relative deletion and relative duplication in different individuals are shown in purple (not to scale). Also shown are the locations of segmental duplications >10 kb in size with >95% identity (pink bars above each chromosome). Each tick mark represents 20 Mb, with gaps in the sequence assembly represented by gray bars.

Similar articles

Cited by

References

Web Resources

    1. Chromosome Anomaly Register, http://www.som.soton.ac.uk/research/geneticsdiv/anomaly%20register/
    1. Structural Variation Database, http://paralogy.gs.washington.edu/structuralvariation
    1. UCSC Human Genome Browser, http://genome.ucsc.edu/

References

    1. Amos-Landgraf JM, Ji Y, Gottlieb W, Depinet T, Wandstrat AE, Cassidy SB, Driscoll DJ, Rogan PK, Schwartz S, Nicholls RD (1999) Chromosome breakage in the Prader-Willi and Angelman syndromes involves recombination between large, transcribed repeats at proximal and distal breakpoints. Am J Hum Genet 65:370–386 - PMC - PubMed
    1. Armengol L, Pujana MA, Cheung J, Scherer SW, Estivill X (2003) Enrichment of segmental duplications in regions of breaks of synteny between the human and mouse genomes suggest their involvement in evolutionary rearrangements. Hum Mol Genet 12:2201–220810.1093/hmg/ddg223 - DOI - PubMed
    1. Bailey JA, Baertsch R, Kent WJ, Haussler D, Eichler EE (2004) Hotspots of mammalian chromosomal evolution. Genome Biol 5:R2310.1186/gb-2004-5-4-r23 - DOI - PMC - PubMed
    1. Bailey JA, Gu Z, Clark RA, Reinert K, Samonte RV, Schwartz S, Adams MD, Myers EW, Li PW, Eichler EE (2002) Recent segmental duplications in the human genome. Science 297:1003–100710.1126/science.1072047 - DOI - PubMed
    1. Barber JC, Reed CJ, Dahoun SP, Joyce CA (1999) Amplification of a pseudogene cassette underlies euchromatic variation of 16p at the cytogenetic level. Hum Genet 104:211–21810.1007/s004390050938 - DOI - PubMed

Publication types

LinkOut - more resources

-