featureCounts: an efficient general purpose program for assigning sequence reads to genomic features
- PMID: 24227677
- DOI: 10.1093/bioinformatics/btt656
featureCounts: an efficient general purpose program for assigning sequence reads to genomic features
Abstract
Motivation: Next-generation sequencing technologies generate millions of short sequence reads, which are usually aligned to a reference genome. In many applications, the key information required for downstream analysis is the number of reads mapping to each genomic feature, for example to each exon or each gene. The process of counting reads is called read summarization. Read summarization is required for a great variety of genomic analyses but has so far received relatively little attention in the literature.
Results: We present featureCounts, a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments. featureCounts implements highly efficient chromosome hashing and feature blocking techniques. It is considerably faster than existing methods (by an order of magnitude for gene-level summarization) and requires far less computer memory. It works with either single or paired-end reads and provides a wide range of options appropriate for different sequencing applications.
Availability and implementation: featureCounts is available under GNU General Public License as part of the Subread (http://subread.sourceforge.net) or Rsubread (http://www.bioconductor.org) software packages.
Similar articles
-
Fast and memory efficient approach for mapping NGS reads to a reference genome.J Bioinform Comput Biol. 2019 Apr;17(2):1950008. doi: 10.1142/S0219720019500082. J Bioinform Comput Biol. 2019. PMID: 31057068
-
The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads.Nucleic Acids Res. 2019 May 7;47(8):e47. doi: 10.1093/nar/gkz114. Nucleic Acids Res. 2019. PMID: 30783653 Free PMC article.
-
SCALCE: boosting sequence compression algorithms using locally consistent encoding.Bioinformatics. 2012 Dec 1;28(23):3051-7. doi: 10.1093/bioinformatics/bts593. Epub 2012 Oct 9. Bioinformatics. 2012. PMID: 23047557 Free PMC article.
-
Comparative analysis of algorithms for next-generation sequencing read alignment.Bioinformatics. 2011 Oct 15;27(20):2790-6. doi: 10.1093/bioinformatics/btr477. Epub 2011 Aug 19. Bioinformatics. 2011. PMID: 21856737
-
Girafe--an R/Bioconductor package for functional exploration of aligned next-generation sequencing reads.Bioinformatics. 2010 Nov 15;26(22):2902-3. doi: 10.1093/bioinformatics/btq531. Epub 2010 Sep 21. Bioinformatics. 2010. PMID: 20861030 Free PMC article.
Cited by
-
A disease-associated gene desert directs macrophage inflammation through ETS2.Nature. 2024 Jun;630(8016):447-456. doi: 10.1038/s41586-024-07501-1. Epub 2024 Jun 5. Nature. 2024. PMID: 38839969 Free PMC article.
-
Targeting dependency on a paralog pair of CBP/p300 against de-repression of KREMEN2 in SMARCB1-deficient cancers.Nat Commun. 2024 Jun 5;15(1):4770. doi: 10.1038/s41467-024-49063-w. Nat Commun. 2024. PMID: 38839769 Free PMC article.
-
A candidate sex determination locus in amphibians which evolved by structural variation between X- and Y-chromosomes.Nat Commun. 2024 Jun 5;15(1):4781. doi: 10.1038/s41467-024-49025-2. Nat Commun. 2024. PMID: 38839766 Free PMC article.
-
Mini-heterochromatin domains constrain the cis-regulatory impact of SVA transposons in human brain development and disease.Nat Struct Mol Biol. 2024 Jun 4. doi: 10.1038/s41594-024-01320-8. Online ahead of print. Nat Struct Mol Biol. 2024. PMID: 38834915
-
Identification of a longevity gene through evolutionary rate covariation of insect mito-nuclear genomes.Nat Aging. 2024 Jun 4. doi: 10.1038/s43587-024-00641-z. Online ahead of print. Nat Aging. 2024. PMID: 38834883
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources