Genealogical Working Distributions for Bayesian Model Testing with Phylogenetic Uncertainty
- PMID: 26526428
- PMCID: PMC5009437
- DOI: 10.1093/sysbio/syv083
Genealogical Working Distributions for Bayesian Model Testing with Phylogenetic Uncertainty
Abstract
Marginal likelihood estimates to compare models using Bayes factors frequently accompany Bayesian phylogenetic inference. Approaches to estimate marginal likelihoods have garnered increased attention over the past decade. In particular, the introduction of path sampling (PS) and stepping-stone sampling (SS) into Bayesian phylogenetics has tremendously improved the accuracy of model selection. These sampling techniques are now used to evaluate complex evolutionary and population genetic models on empirical data sets, but considerable computational demands hamper their widespread adoption. Further, when very diffuse, but proper priors are specified for model parameters, numerical issues complicate the exploration of the priors, a necessary step in marginal likelihood estimation using PS or SS. To avoid such instabilities, generalized SS (GSS) has recently been proposed, introducing the concept of "working distributions" to facilitate--or shorten--the integration process that underlies marginal likelihood estimation. However, the need to fix the tree topology currently limits GSS in a coalescent-based framework. Here, we extend GSS by relaxing the fixed underlying tree topology assumption. To this purpose, we introduce a "working" distribution on the space of genealogies, which enables estimating marginal likelihoods while accommodating phylogenetic uncertainty. We propose two different "working" distributions that help GSS to outperform PS and SS in terms of accuracy when comparing demographic and evolutionary models applied to synthetic data and real-world examples. Further, we show that the use of very diffuse priors can lead to a considerable overestimation in marginal likelihood when using PS and SS, while still retrieving the correct marginal likelihood using both GSS approaches. The methods used in this article are available in BEAST, a powerful user-friendly software package to perform Bayesian evolutionary analyses.
Keywords: Bayes factor; Bayesian inference; MCMC; Working distribution; coalescent model; marginal likelihood; phylogenetics.
© The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Figures
![Figure 1.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/5009437/bin/syv083f1.gif)
![Figure 2.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/5009437/bin/syv083f2.gif)
![Figure 3.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/5009437/bin/syv083f3.gif)
![Figure 4.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/5009437/bin/syv083f4.gif)
![Figure 5.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/5009437/bin/syv083f5.gif)
![Figure 6.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/5009437/bin/syv083f6.gif)
Similar articles
-
Marginal Likelihoods in Phylogenetics: A Review of Methods and Applications.Syst Biol. 2019 Sep 1;68(5):681-697. doi: 10.1093/sysbio/syz003. Syst Biol. 2019. PMID: 30668834 Free PMC article. Review.
-
Species delimitation using Bayes factors: simulations and application to the Sceloporus scalaris species group (Squamata: Phrynosomatidae).Syst Biol. 2014 Mar;63(2):119-33. doi: 10.1093/sysbio/syt069. Epub 2013 Nov 20. Syst Biol. 2014. PMID: 24262383
-
Improving the accuracy of demographic and molecular clock model comparison while accommodating phylogenetic uncertainty.Mol Biol Evol. 2012 Sep;29(9):2157-67. doi: 10.1093/molbev/mss084. Epub 2012 Mar 7. Mol Biol Evol. 2012. PMID: 22403239 Free PMC article.
-
Improving marginal likelihood estimation for Bayesian phylogenetic model selection.Syst Biol. 2011 Mar;60(2):150-60. doi: 10.1093/sysbio/syq085. Epub 2010 Dec 27. Syst Biol. 2011. PMID: 21187451 Free PMC article.
-
Using models of nucleotide evolution to build phylogenetic trees.Dev Comp Immunol. 2005;29(3):211-27. doi: 10.1016/j.dci.2004.07.007. Dev Comp Immunol. 2005. PMID: 15572070 Review.
Cited by
-
Assessing the emergence time of SARS-CoV-2 zoonotic spillover.PLoS One. 2024 Apr 4;19(4):e0301195. doi: 10.1371/journal.pone.0301195. eCollection 2024. PLoS One. 2024. PMID: 38574109 Free PMC article.
-
Molecular evolution and phylogeographic analysis of wheat dwarf virus.Front Microbiol. 2024 Feb 14;15:1314526. doi: 10.3389/fmicb.2024.1314526. eCollection 2024. Front Microbiol. 2024. PMID: 38419641 Free PMC article.
-
Bayesian phylodynamic analysis reveals the evolutionary history and the dispersal patterns of citrus tristeza virus in China based on the p25 gene.Virol J. 2023 Oct 3;20(1):223. doi: 10.1186/s12985-023-02190-0. Virol J. 2023. PMID: 37789347 Free PMC article.
-
Detecting Episodic Evolution through Bayesian Inference of Molecular Clock Models.Mol Biol Evol. 2023 Oct 4;40(10):msad212. doi: 10.1093/molbev/msad212. Mol Biol Evol. 2023. PMID: 37738550 Free PMC article.
-
Evaluating the Accuracy of Methods for Detecting Correlated Rates of Molecular and Morphological Evolution.Syst Biol. 2023 Dec 30;72(6):1337-1356. doi: 10.1093/sysbio/syad055. Syst Biol. 2023. PMID: 37695237 Free PMC article.
References
-
- Arima S., Tardella L. 2012. Improved harmonic mean estimator for phylogenetic model evidence. J. Comp. Biol. 19:418–438. - PubMed
-
- Baele G., Lemey P. 2013. Bayesian evolutionary model testing in the phylogenomics era: matching model complexity with computational efficiency. Bioinformatics 29:1970–1979. - PubMed
-
- Baele G., Lemey P.Bernardo J. M., Bayarri M. J., Berger J. O. 2014. Bayesian model selection in phylogenetics and genealogy-based population genetics. Bayesian phylogenetics: methods, computational algorithms, and applications. Boca Raton, Florida: Chapman & Hall/CRC Mathematical & Computational Biology; pp. 55–90.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous