Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2019 Nov 27;20(6):2044-2054.
doi: 10.1093/bib/bby067.

Interpretation of differential gene expression results of RNA-seq data: review and integration

Affiliations
Review

Interpretation of differential gene expression results of RNA-seq data: review and integration

Adam McDermaid et al. Brief Bioinform. .

Abstract

Differential gene expression (DGE) analysis is one of the most common applications of RNA-sequencing (RNA-seq) data. This process allows for the elucidation of differentially expressed genes across two or more conditions and is widely used in many applications of RNA-seq data analysis. Interpretation of the DGE results can be nonintuitive and time consuming due to the variety of formats based on the tool of choice and the numerous pieces of information provided in these results files. Here we reviewed DGE results analysis from a functional point of view for various visualizations. We also provide an R/Bioconductor package, Visualization of Differential Gene Expression Results using R, which generates information-rich visualizations for the interpretation of DGE results from three widely used tools, Cuffdiff, DESeq2 and edgeR. The implemented functions are also tested on five real-world data sets, consisting of one human, one Malus domestica and three Vitis riparia data sets.

Keywords: R/Bioconductor package; bioinformatics tools; differential gene expression analysis; differentially expressed genes; visualization and interpretation.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Treatment distributions visualization generated by the ViDGER package using a DESeq2 data set.
Figure 2
Figure 2
Scatter plot of normalized read counts generated by the ViDGER package using a DESeq2 data set.
Figure 3
Figure 3
Heatmap of DEG counts by comparison generated by the ViDGER package using a DESeq2 data set with an adjusted P-value cutoff of 0.05 for classification as differentially expressed.
Figure 4
Figure 4
MA plot displaying the log fold-change compared with mean expression generated by the ViDGER package using a DESeq2 data set, with default log fold-change thresholds of −1 and 1.
Figure 5
Figure 5
Volcano plot generated by the ViDGER package using a DESeq2 data set, with default log fold-change thresholds of −1 and 1 and an adjusted P-value threshold of 0.05.
Figure 6
Figure 6
Four-way plot generated by the ViDGER package using a DESeq2 data set, with default log fold-change thresholds of −1 and 1.
Figure 7
Figure 7
(A) Boxplot generation of RNA-seq data using vsBoxplot; (B) scatter plot generation using vsScatterPlot; (C) differential gene expression matrix using vsDEGMatrix; (D) MA plot generation using vsMAPlot; (E) volcano plot generation using vsVolcano; (F) four-way plot generation using vsFourWay. Arrow and text color refer to visualizations generated using Cuffdiff data (black), DESeq2 data (blue) and edgeR data (red).
Figure 8
Figure 8
Matrix of all pairwise scatter plots showing normalized expression values generated by the ViDGER package using a DESeq2 data set. In addition to the pairwise scatter plots, density plots are provided along the diagonal and pairwise correlation values are provided in the opposite half of the matrix.
Figure 9
Figure 9
Matrix of all pairwise MA plots showing log fold-change compared with mean expression value generated by the ViDGER package using a DESeq2 data set, with default log fold-change thresholds of −1 and 1.
Figure 10
Figure 10
Matrix of all pairwise Volcano plots showing log fold-change versus adjusted P-value generated by the ViDGER package using a DESeq2 data set, with default log fold-change thresholds of −1 and 1 and an adjusted P-value threshold of 0.05.

Similar articles

Cited by

References

    1. Goodwin S, McPherson JD, McCombie WR. Coming of age: ten years of next-generation sequencing technologies. Nat Rev Genet 2016;17:333–51. - PMC - PubMed
    1. Marioni JC, Mason CE, Mane SM, et al. . RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res 2008;18:1509–17. - PMC - PubMed
    1. Miller JA, Menon V, Goldy J, et al. . Improving reliability and absolute quantification of human brain microarray data by filtering and scaling probes using RNA-Seq. BMC Genomics 2014;15:154. - PMC - PubMed
    1. Nagalakshmi U, Wang Z, Waern K, et al. . The transcriptional landscape of the yeast genome defined by RNA sequencing. Science 2008;320:1344–9. - PMC - PubMed
    1. Van Dijk EL, Auger H, Jaszczyszyn Y, et al. . Ten years of next-generation sequencing technology. Trends Genet 2014;30:418–26. - PubMed

Publication types

MeSH terms

-