Inference of population splits and mixtures from genome-wide allele frequency data
- PMID: 23166502
- PMCID: PMC3499260
- DOI: 10.1371/journal.pgen.1002967
Inference of population splits and mixtures from genome-wide allele frequency data
Abstract
Many aspects of the historical relationships between populations in a species are reflected in genetic data. Inferring these relationships from genetic data, however, remains a challenging task. In this paper, we present a statistical model for inferring the patterns of population splits and mixtures in multiple populations. In our model, the sampled populations in a species are related to their common ancestor through a graph of ancestral populations. Using genome-wide allele frequency data and a Gaussian approximation to genetic drift, we infer the structure of this graph. We applied this method to a set of 55 human populations and a set of 82 dog breeds and wild canids. In both species, we show that a simple bifurcating tree does not fully describe the data; in contrast, we infer many migration events. While some of the migration events that we find have been detected previously, many have not. For example, in the human data, we infer that Cambodians trace approximately 16% of their ancestry to a population ancestral to other extant East Asian populations. In the dog data, we infer that both the boxer and basenji trace a considerable fraction of their ancestry (9% and 25%, respectively) to wolves subsequent to domestication and that East Asian toy breeds (the Shih Tzu and the Pekingese) result from admixture between modern toy breeds and "ancient" Asian breeds. Software implementing the model described here, called TreeMix, is available at http://treemix.googlecode.com.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
![Figure 1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.g001.gif)
![formula image](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.e046.jpg)
![formula image](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.e047.jpg)
![formula image](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.e048.jpg)
![formula image](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.e049.jpg)
![Figure 2](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.g002.gif)
![Figure 3](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.g003.gif)
![formula image](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.e177.jpg)
![formula image](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.e178.jpg)
![formula image](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.e179.jpg)
![formula image](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.e180.jpg)
![Figure 4](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.g004.gif)
![formula image](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.e187.jpg)
![Figure 5](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.g005.gif)
![formula image](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.e197.jpg)
![formula image](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.e198.jpg)
![formula image](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.e199.jpg)
![formula image](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.e200.jpg)
![Figure 6](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.g006.gif)
![formula image](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/3499260/bin/pgen.1002967.e201.jpg)
Similar articles
-
The genome-wide relationships of the critically endangered Quadricorna sheep in the Mediterranean region.PLoS One. 2023 Oct 18;18(10):e0291814. doi: 10.1371/journal.pone.0291814. eCollection 2023. PLoS One. 2023. PMID: 37851594 Free PMC article.
-
The ancestral origin of the critically endangered Quadricorna sheep as revealed by genome-wide analysis.PLoS One. 2022 Oct 26;17(10):e0275989. doi: 10.1371/journal.pone.0275989. eCollection 2022. PLoS One. 2022. Retraction in: PLoS One. 2022 Dec 7;17(12):e0279019. doi: 10.1371/journal.pone.0279019. PMID: 36288337 Free PMC article. Retracted.
-
Admixture and Ancestry Inference from Ancient and Modern Samples through Measures of Population Genetic Drift.Hum Biol. 2017 Jan;89(1):21-46. doi: 10.13110/humanbiology.89.1.02. Hum Biol. 2017. PMID: 29285965 Review.
-
Evolutionary genomics of dog domestication.Mamm Genome. 2012 Feb;23(1-2):3-18. doi: 10.1007/s00335-011-9386-7. Epub 2012 Jan 22. Mamm Genome. 2012. PMID: 22270221 Review.
-
The IGF1 small dog haplotype is derived from Middle Eastern grey wolves.BMC Biol. 2010 Feb 24;8:16. doi: 10.1186/1741-7007-8-16. BMC Biol. 2010. PMID: 20181231 Free PMC article.
Cited by
-
Chromosomal inversions from an initial ecotypic divergence drive a gradual repeated radiation of Galápagos beetles.Sci Adv. 2024 May 31;10(22):eadk7906. doi: 10.1126/sciadv.adk7906. Epub 2024 May 31. Sci Adv. 2024. PMID: 38820159 Free PMC article.
-
Adaptive divergence, historical population dynamics, and simulation of suitable distributions for Picea Meyeri and P. Mongolica at the whole-genome level.BMC Plant Biol. 2024 May 30;24(1):479. doi: 10.1186/s12870-024-05166-6. BMC Plant Biol. 2024. PMID: 38816690 Free PMC article.
-
Population genetics and phylogeographic history of the insular lizard Podarcis lilfordi (Gunther, 1874) from the Balearic Islands based on genome-wide polymorphic data.Ecol Evol. 2024 May 23;14(5):e11407. doi: 10.1002/ece3.11407. eCollection 2024 May. Ecol Evol. 2024. PMID: 38799398 Free PMC article.
-
Deep genome skimming reveals the hybrid origin of Pseudosasa gracilis (Poaceae: Bambusoideae).Plant Divers. 2023 Jun 7;46(3):344-352. doi: 10.1016/j.pld.2023.06.001. eCollection 2024 May. Plant Divers. 2023. PMID: 38798728 Free PMC article.
-
Solving the 250-year-old mystery of the origin and global spread of the German cockroach, Blattella germanica.Proc Natl Acad Sci U S A. 2024 May 28;121(22):e2401185121. doi: 10.1073/pnas.2401185121. Epub 2024 May 20. Proc Natl Acad Sci U S A. 2024. PMID: 38768340 Free PMC article.
References
-
- Felsenstein J (1982) How can we infer geography and history from gene frequencies? J Theor Biol 96: 9–20. - PubMed
-
- Cann RL, Stoneking M, Wilson AC (1987) Mitochondrial DNA and human evolution. Nature 325: 31–6. - PubMed
-
- Nei M, Roychoudhury AK (1993) Evolutionary relationships of human populations on a global scale. Mol Biol Evol 10: 927–43. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources