Statistical significance for genomewide studies

doi:10.1073/pnas.1530509100

. 2003 Aug 5;100(16):9440-5.

doi: 10.1073/pnas.1530509100. Epub 2003 Jul 25.

Statistical significance for genomewide studies

John D Storey¹, Robert Tibshirani

Affiliations

PMID: 12883005
PMCID: PMC170937
DOI: 10.1073/pnas.1530509100

Statistical significance for genomewide studies

John D Storey et al. Proc Natl Acad Sci U S A. 2003.

. 2003 Aug 5;100(16):9440-5.

doi: 10.1073/pnas.1530509100. Epub 2003 Jul 25.

Authors

John D Storey¹, Robert Tibshirani

Affiliation

¹ Department of Biostatistics, University of Washington, Seattle, WA 98195, USA. jstorey@u.washington.edu

PMID: 12883005
PMCID: PMC170937
DOI: 10.1073/pnas.1530509100

Abstract

With the increase in genomewide experiments and the sequencing of multiple genomes, the analysis of large data sets has become commonplace in biology. It is often the case that thousands of features in a genomewide data set are tested against some null hypothesis, where a number of features are expected to be significant. Here we propose an approach to measuring statistical significance in these genomewide studies based on the concept of the false discovery rate. This approach offers a sensible balance between the number of true and false positives that is automatically calibrated and easily interpreted. In doing so, a measure of statistical significance called the q value is associated with each tested feature. The q value is similar to the well known p value, except it is a measure of significance in terms of the false discovery rate rather than the false positive rate. Our approach avoids a flood of false positive results, while offering a more liberal criterion than what has been used in genome scans for linkage.

PubMed Disclaimer

Figures

**Fig. 1.**
A density histogram of the 3,170 p values from the Hedenfalk *et al.* (14) data. The dashed line is the density histogram we would expect if all genes were null (not differentially expressed). The dotted line is at the height of our estimate of the proportion of null p values.

**Fig. 2.**
Results from the Hedenfalk *et al.* (14) data. (a) The q values of the genes versus their respective t statistics. (b) The q values versus their respective p values. (c) The number of genes occurring on the list up through each q value versus the respective q value. (d) The expected number of false positive genes versus the total number of significant genes given by the q values.

**Fig. 3.**
The versus λ for the data of Hedenfalk *et al.* (14). The solid line is a natural cubic spline fit to these points to estimate .

formula image — **Fig. 3.**
The versus λ for the data of Hedenfalk *et al.* (14). The solid line is a natural cubic spline fit to these points to estimate .

See this image and copyright information in PMC

Cited by

Peripheral Immune Cells Contribute to the Pathogenesis of Alzheimer's Disease.
Zhang H, Cao F, Zhou Y, Wu B, Li C. Zhang H, et al. Mol Neurobiol. 2024 Jun 6. doi: 10.1007/s12035-024-04266-6. Online ahead of print. Mol Neurobiol. 2024. PMID: 38842674
Deep learning-based pathway-centric approach to characterize recurrent hepatocellular carcinoma after liver transplantation.
To J, Ghosh S, Zhao X, Pasini E, Fischer S, Sapisochin G, Ghanekar A, Jaeckel E, Bhat M. To J, et al. Hum Genomics. 2024 Jun 5;18(1):58. doi: 10.1186/s40246-024-00624-6. Hum Genomics. 2024. PMID: 38840185 Free PMC article.
Exploring the causal association between genetically determined circulating metabolome and hemorrhagic stroke.
Wang Y, Shen Y, Li Q, Xu H, Gao A, Li K, Rong Y, Gao S, Liang H, Zhang X. Wang Y, et al. Front Nutr. 2024 May 15;11:1376889. doi: 10.3389/fnut.2024.1376889. eCollection 2024. Front Nutr. 2024. PMID: 38812939 Free PMC article.
Causal effect between gut microbiota and metabolic syndrome in European population: a bidirectional mendelian randomization study.
Yan J, Wang Z, Bao G, Xue C, Zheng W, Fu R, Zhang M, Ding J, Yang F, Sun B. Yan J, et al. Cell Biosci. 2024 May 28;14(1):67. doi: 10.1186/s13578-024-01232-6. Cell Biosci. 2024. PMID: 38807189 Free PMC article.
A brain-enriched circular RNA controls excitatory neurotransmission and restricts sensitivity to aversive stimuli.
Giusti SA, Pino NS, Pannunzio C, Ogando MB, Armando NG, Garrett L, Zimprich A, Becker L, Gimeno ML, Lukin J, Merino FL, Pardi MB, Pedroncini O, Di Mauro GC, Durner VG, Fuchs H, de Angelis MH, Patop IL, Turck CW, Deussing JM, Vogt Weisenhorn DM, Jahn O, Kadener S, Hölter SM, Brose N, Giesert F, Wurst W, Marin-Burgin A, Refojo D. Giusti SA, et al. Sci Adv. 2024 May 24;10(21):eadj8769. doi: 10.1126/sciadv.adj8769. Epub 2024 May 24. Sci Adv. 2024. PMID: 38787942 Free PMC article.

See all "Cited by" articles

References

1. Morton, N. E. (1955) Am. J. Hum. Gen. 7, 277–318. - PMC - PubMed
1. Lander, E. S. & Kruglyak, L. (1995) Nat. Genet. 11, 241–247. - PubMed
1. Storey, J. D. (2003) Ann. Stat., in press.
1. Storey, J. D. (2002) J. R. Stat. Soc. B 64, 479–498.
1. Benjamini, Y. & Hochberg, Y. (1995) J. R. Stat. Soc. B 85, 289–300.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- H1 Connect
- The Lens - Patent Citations

[1] Morton, N. E. (1955) Am. J. Hum. Gen. 7, 277–318. - PMC - PubMed

[2] Morton, N. E. (1955) Am. J. Hum. Gen. 7, 277–318. - PMC - PubMed

[3] Lander, E. S. & Kruglyak, L. (1995) Nat. Genet. 11, 241–247. - PubMed

[4] Lander, E. S. & Kruglyak, L. (1995) Nat. Genet. 11, 241–247. - PubMed

[5] Storey, J. D. (2003) Ann. Stat., in press.

[6] Storey, J. D. (2003) Ann. Stat., in press.

[7] Storey, J. D. (2002) J. R. Stat. Soc. B 64, 479–498.

[8] Storey, J. D. (2002) J. R. Stat. Soc. B 64, 479–498.

[9] Benjamini, Y. & Hochberg, Y. (1995) J. R. Stat. Soc. B 85, 289–300.

[10] Benjamini, Y. & Hochberg, Y. (1995) J. R. Stat. Soc. B 85, 289–300.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Statistical significance for genomewide studies

Affiliation

Statistical significance for genomewide studies

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Figures

Similar articles

Cited by

References

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources