Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2017 Dec 15;45(22):12611-12624.
doi: 10.1093/nar/gkx1074.

Conserved non-coding elements: developmental gene regulation meets genome organization

Affiliations
Review

Conserved non-coding elements: developmental gene regulation meets genome organization

Dimitris Polychronopoulos et al. Nucleic Acids Res. .

Abstract

Comparative genomics has revealed a class of non-protein-coding genomic sequences that display an extraordinary degree of conservation between two or more organisms, regularly exceeding that found within protein-coding exons. These elements, collectively referred to as conserved non-coding elements (CNEs), are non-randomly distributed across chromosomes and tend to cluster in the vicinity of genes with regulatory roles in multicellular development and differentiation. CNEs are organized into functional ensembles called genomic regulatory blocks-dense clusters of elements that collectively coordinate the expression of shared target genes, and whose span in many cases coincides with topologically associated domains. CNEs display sequence properties that set them apart from other sequences under constraint, and have recently been proposed as useful markers for the reconstruction of the evolutionary history of organisms. Disruption of several of these elements is known to contribute to diseases linked with development, and cancer. The emergence, evolutionary dynamics and functions of CNEs still remain poorly understood, and new approaches are required to enable comprehensive CNE identification and characterization. Here, we review current knowledge and identify challenges that need to be tackled to resolve the impasse in understanding extreme non-coding conservation.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The phenomenon of extreme non-coding conservation. A conserved CNE (Human–Tetraodon CNE, on the left) shown here is more conserved than a protein-coding sequence (HIST1H4D, on the right). The multiple sequence alignment of 46 vertebrate species and the corresponding phyloP scores illustrate the evolutionary conservation of the CNE and protein-coding sequence. PhyloP scores range from negative to positive scores (red to blue) and indicate positive and negative selective pressure respectively. The 46-way alignment was downloaded from the UCSC genome browser and spans ∼600 million years of evolution since the last common ancestor of humans and lampreys.
Figure 2.
Figure 2.
(A) The GRB model. The regulatory input for one or more target genes (red) is provided by long-range interactions (dashed lines) between CNEs (green) and the target gene’s promoter. Bystander genes (gray) often contain CNEs in their introns but remain unresponsive to CNE-mediated regulation. For more details, see text, (B) CNE clustering across chromosome 15 and at the MEIS2 locus (shown zoomed in below the whole chromosome track). The human MEIS2 GRB (brown) is a 3.3 Mb region defined by an array of conserved non-coding elements. MEIS2 (red) encodes a transcription factor involved in lens development through regulation of PAX6. Regardless of species used in the pairwise comparison against human, CNEs (black) clearly mark the boundaries of this GRB. The boundaries of the topologically associated domain (TAD) covering MEIS2 (TAD in blue; H1-ESC TAD calls are generated using HMM_calls) and the GRB spanning this locus are highly concordant.
Figure 3.
Figure 3.
Sequence heatmaps showing dinucleotide content within and outside vertebrate CNEs. Plots are generated using heatmaps package (https://bioconductor.org/packages/release/bioc/html/heatmaps.html). CNEs which show sequence identity >98% for >50 nt between human and chicken are identified using CNEr (https://bioconductor.org/packages/release/bioc/html/CNEr.html). Sequences are ordered from shortest to longest on the Y-axis (aligned on the center) and X-axis shows distance in nucleotides from the center of each CNE.
Figure 4.
Figure 4.
Methodology describing how CNEs are utilized as markers for constructing phylogenies (adopted and modified with permission from Brant Faircloth and John McCormack). Panels (A-G) describe the steps for constructing phylogenies starting from CNE sequences to generating species trees.

Similar articles

Cited by

References

    1. Yaffe D., Nudel U., Mayer Y., Neuman S.. Highly conserved sequences in the 3’ untranslated region of mRNAs coding for homologous proteins in distantly related species. Nucleic Acids Res. 1985; 13:3723–3737. - PMC - PubMed
    1. Lemaire C., Heilig R., Mandel J.L.. The chicken dystrophin cDNA: striking conservation of the C-terminal coding and 3′ untranslated regions between man and chicken. EMBO J. 1988; 7:4157–4162. - PMC - PubMed
    1. Hraba-Renevey S., Kress M.. Expression of a mouse replacement histone H3.3 gene with a highly conserved 3′ noncoding region during SV40- and polyoma-induced Go to S-phase transition. Nucleic Acids Res. 1989; 17:2449–2461. - PMC - PubMed
    1. Kajimoto Y., Rotwein P.. Structure of the chicken insulin-like growth factor I gene reveals conserved promoter elements. J. Biol. Chem. 1991; 266:9724–9731. - PubMed
    1. Rouault J.P., Samarut C., Duret L., Tessa C., Samarut J., Magaud J.P.. Sequence analysis reveals that the BTG1 anti-proliferative gene is conserved throughout evolution in its coding and 3′ non-coding regions. Gene. 1993; 129:303–306. - PubMed
-