Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 Feb;169(2):1157-64.
doi: 10.1534/genetics.104.037051. Epub 2005 Jan 16.

Rapid subfunctionalization accompanied by prolonged and substantial neofunctionalization in duplicate gene evolution

Affiliations

Rapid subfunctionalization accompanied by prolonged and substantial neofunctionalization in duplicate gene evolution

Xionglei He et al. Genetics. 2005 Feb.

Abstract

Gene duplication is the primary source of new genes. Duplicate genes that are stably preserved in genomes usually have divergent functions. The general rules governing the functional divergence, however, are not well understood and are controversial. The neofunctionalization (NF) hypothesis asserts that after duplication one daughter gene retains the ancestral function while the other acquires new functions. In contrast, the subfunctionalization (SF) hypothesis argues that duplicate genes experience degenerate mutations that reduce their joint levels and patterns of activity to that of the single ancestral gene. We here show that neither NF nor SF alone adequately explains the genome-wide patterns of yeast protein interaction and human gene expression for duplicate genes. Instead, our analysis reveals rapid SF, accompanied by prolonged and substantial NF in a large proportion of duplicate genes, suggesting a new model termed subneofunctionalization (SNF). Our results demonstrate that enormous numbers of new functions have originated via gene duplication.

PubMed Disclaimer

Figures

F<sc>igure</sc> 1.—
Figure 1.—
Evolutionary models of functional divergence between duplicate genes. The three neofunctionalization (NF) models differ in the number of ancestral functions retained by the gene that acquires novel functions. The newly proposed subneofunctionalization (SNF) model is a combination of NF and subfunctionalization (SF). Duplicate genes are depicted by open circles and different gene functions are shown by solid squares. Dotted lines link genes with their functions. In this article, we analyze functions of duplicate genes by their protein interaction partners and expression sites.
F<sc>igure</sc> 2.—
Figure 2.—
Number of protein interaction partners of yeast duplicate and singleton genes. (a) Mean number of partners (t) for duplicate pairs with different dS. The error bar shows one standard error of the mean. The dashed line shows the average number of partners per singleton (A) and the dotted line shows the average number per random pair of singletons (T). There are 17, 15, 70, and 229 duplicate pairs in the four bins, respectively. (b) Frequency distributions of the number of interaction partners for singletons (open bars), singleton pairs (shaded bars), and duplicates (solid bars). (c) Frequency distributions of the number of shared partners between singleton pairs (open bars), duplicates with dS < 20 (shaded bars), and duplicates with dS > 20 (solid bars). (d) Difference in the number of partners between duplicates. The mean max(a1, a2), min(a1, a2), and δ = |a1a2| for duplicates are shown in open, shaded, and solid bars, respectively, with their corresponding mean values from random singleton pairs shown by the dotted, dashed, and solid lines, respectively. For any given pair of duplicates, a1 and a2 are the numbers of partners that they each have.
F<sc>igure</sc> 3.—
Figure 3.—
Number of expression sites of human duplicate and singleton genes. (a) Mean number of expression sites (t, squares) and mean number of shared expression sites (s, circles) for duplicate pairs with different dS. The 1230 duplicate gene pairs are ranked by their dS. Each square (or circle) represents the median dS and mean t (or mean s) for 100 gene pairs, with the exception of the last square (or circle), which is derived from 630 gene pairs. The error bar shows one standard error of the mean. The dashed line shows the average number of expression sites per singleton (A) and the dotted line shows the average number per random pair of singletons (T). The solid line shows the average number of shared expression sites per random pair of singletons. (b) Difference in the number of expression sites between duplicates. The mean δ = |a1a2| and min(a1, a2) for duplicates are shown in squares and circles, respectively, with their corresponding mean values from singleton pairs shown by the dotted and dashed lines, respectively. For any given pair of duplicates, a1 and a2 are the numbers of expression sites that they each have.

Similar articles

Cited by

References

    1. Davis, J. C., and D. A. Petrov, 2004. Preferential duplication of conserved proteins in eukaryotic genomes. PLoS Biol. 2: 318–326. - PMC - PubMed
    1. Force, A., M. Lynch, F. B. Pickett, A. Amores, Y. L. Yan et al., 1999. Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151: 1531–1545. - PMC - PubMed
    1. Fraser, H. B., A. E. Hirsh, L. M. Steinmetz, C. Scharfe and M. W. Feldman, 2002. Evolutionary rate in the protein interaction network. Science 296: 750–752. - PubMed
    1. Gu, Z., D. Nicolae, H. H.-S. Lu and W. H. Li, 2002. Rapid divergence in expression between duplicate genes inferred from microarray data. Trends Genet. 18: 609–613. - PubMed
    1. Hahn, M. W., G. C. Conant and A. Wagner, 2004. Molecular evolution in large genetic networks: Does connectivity equal constraint? J. Mol. Evol. 58: 203–211. - PubMed

Publication types

-