Moderated statistical tests for assessing differences in tag abundance
- PMID: 17881408
- DOI: 10.1093/bioinformatics/btm453
Moderated statistical tests for assessing differences in tag abundance
Abstract
Motivation: Digital gene expression (DGE) technologies measure gene expression by counting sequence tags. They are sensitive technologies for measuring gene expression on a genomic scale, without the need for prior knowledge of the genome sequence. As the cost of sequencing DNA decreases, the number of DGE datasets is expected to grow dramatically. Various tests of differential expression have been proposed for replicated DGE data using binomial, Poisson, negative binomial or pseudo-likelihood (PL) models for the counts, but none of the these are usable when the number of replicates is very small.
Results: We develop tests using the negative binomial distribution to model overdispersion relative to the Poisson, and use conditional weighted likelihood to moderate the level of overdispersion across genes. Not only is our strategy applicable even with the smallest number of libraries, but it also proves to be more powerful than previous strategies when more libraries are available. The methodology is equally applicable to other counting technologies, such as proteomic spectral counts.
Availability: An R package can be accessed from http://bioinf.wehi.edu.au/resources/
Similar articles
-
Serial analysis of gene expression: probing transcriptomes for molecular targets.Curr Opin Mol Ther. 1999 Dec;1(6):720-6. Curr Opin Mol Ther. 1999. PMID: 19629869 Review.
-
Statistical analysis and significance testing of serial analysis of gene expression data using a Poisson mixture model.BMC Bioinformatics. 2007 Aug 2;8:282. doi: 10.1186/1471-2105-8-282. BMC Bioinformatics. 2007. PMID: 17683533 Free PMC article.
-
Statistical modeling of sequencing errors in SAGE libraries.Bioinformatics. 2004 Aug 4;20 Suppl 1:i31-9. doi: 10.1093/bioinformatics/bth924. Bioinformatics. 2004. PMID: 15262778
-
Digital quantitative measurements of gene expression.Biotechnol Bioeng. 2004 Apr 20;86(2):117-24. doi: 10.1002/bit.20048. Biotechnol Bioeng. 2004. PMID: 15052631
-
Open systems: panoramic views of gene expression.J Immunol Methods. 2001 Apr;250(1-2):67-79. doi: 10.1016/s0022-1759(01)00306-4. J Immunol Methods. 2001. PMID: 11251222 Review.
Cited by
-
Comparative Methods for Demystifying Spatial Transcriptomics.Methods Mol Biol. 2024;2802:515-546. doi: 10.1007/978-1-0716-3838-5_17. Methods Mol Biol. 2024. PMID: 38819570
-
Error modelled gene expression analysis (EMOGEA) provides a superior overview of time course RNA-seq measurements and low count gene expression.Brief Bioinform. 2024 Mar 27;25(3):bbae233. doi: 10.1093/bib/bbae233. Brief Bioinform. 2024. PMID: 38770716 Free PMC article.
-
The characteristics of intratumoral microbial community reflect the development of lung adenocarcinoma.Front Microbiol. 2024 Apr 24;15:1353940. doi: 10.3389/fmicb.2024.1353940. eCollection 2024. Front Microbiol. 2024. PMID: 38721596 Free PMC article.
-
CATD: a reproducible pipeline for selecting cell-type deconvolution methods across tissues.Bioinform Adv. 2024 Mar 23;4(1):vbae048. doi: 10.1093/bioadv/vbae048. eCollection 2024. Bioinform Adv. 2024. PMID: 38638280 Free PMC article.
-
The Relevance of Reperfusion Stroke Therapy for miR-9-3p and miR-9-5p Expression in Acute Stroke-A Preliminary Study.Int J Mol Sci. 2024 Feb 27;25(5):2766. doi: 10.3390/ijms25052766. Int J Mol Sci. 2024. PMID: 38474013 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials