HISAT: a fast spliced aligner with low memory requirements
- PMID: 25751142
- PMCID: PMC4655817
- DOI: 10.1038/nmeth.3317
HISAT: a fast spliced aligner with low memory requirements
Abstract
HISAT (hierarchical indexing for spliced alignment of transcripts) is a highly efficient system for aligning reads from RNA sequencing experiments. HISAT uses an indexing scheme based on the Burrows-Wheeler transform and the Ferragina-Manzini (FM) index, employing two types of indexes for alignment: a whole-genome FM index to anchor each alignment and numerous local FM indexes for very rapid extensions of these alignments. HISAT's hierarchical index for the human genome contains 48,000 local FM indexes, each representing a genomic region of ∼64,000 bp. Tests on real and simulated data sets showed that HISAT is the fastest system currently available, with equal or better accuracy than any other method. Despite its large number of indexes, HISAT requires only 4.3 gigabytes of memory. HISAT supports genomes of any size, including those larger than 4 billion bases.
Figures
![Figure 1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/4655817/bin/nihms736708f1.gif)
![Figure 2](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/4655817/bin/nihms736708f2.gif)
![Figure 3](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/4655817/bin/nihms736708f3.gif)
![Figure 4](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/4655817/bin/nihms736708f4.gif)
Similar articles
-
Rapid and accurate alignment of nucleotide conversion sequencing reads with HISAT-3N.Genome Res. 2021 Jul;31(7):1290-1295. doi: 10.1101/gr.275193.120. Epub 2021 Jun 8. Genome Res. 2021. PMID: 34103331 Free PMC article.
-
Mapping RNA-seq reads to transcriptomes efficiently based on learning to hash method.Comput Biol Med. 2020 Jan;116:103539. doi: 10.1016/j.compbiomed.2019.103539. Epub 2019 Nov 13. Comput Biol Med. 2020. PMID: 31765913 Review.
-
Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype.Nat Biotechnol. 2019 Aug;37(8):907-915. doi: 10.1038/s41587-019-0201-4. Epub 2019 Aug 2. Nat Biotechnol. 2019. PMID: 31375807 Free PMC article.
-
CUSHAW: a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform.Bioinformatics. 2012 Jul 15;28(14):1830-7. doi: 10.1093/bioinformatics/bts276. Epub 2012 May 9. Bioinformatics. 2012. PMID: 22576173
-
YOABS: yet other aligner of biological sequences--an efficient linearly scaling nucleotide aligner.Bioinformatics. 2012 Apr 15;28(8):1070-7. doi: 10.1093/bioinformatics/bts102. Epub 2012 Mar 7. Bioinformatics. 2012. PMID: 22402614
Cited by
-
CmERF1 acts as a positive regulator of fruits and leaves growth in melon (Cucumis melo L.).Plant Mol Biol. 2024 Jun 6;114(3):70. doi: 10.1007/s11103-024-01468-3. Plant Mol Biol. 2024. PMID: 38842600
-
Multiomics analysis of platelet-rich plasma promoting biological performance of mesenchymal stem cells.BMC Genomics. 2024 Jun 5;25(1):564. doi: 10.1186/s12864-024-10329-8. BMC Genomics. 2024. PMID: 38840037 Free PMC article.
-
Senescent glia link mitochondrial dysfunction and lipid accumulation.Nature. 2024 Jun;630(8016):475-483. doi: 10.1038/s41586-024-07516-8. Epub 2024 Jun 5. Nature. 2024. PMID: 38839958 Free PMC article.
-
A single-cell transcriptome atlas of human euploid and aneuploid blastocysts.Nat Genet. 2024 Jun 5. doi: 10.1038/s41588-024-01788-6. Online ahead of print. Nat Genet. 2024. PMID: 38839885
-
Chromosome-level genome assembly of the snakefly Mongoloraphidia duomilia (Raphidioptera: Raphidiidae).Sci Data. 2024 Jun 4;11(1):579. doi: 10.1038/s41597-024-03439-1. Sci Data. 2024. PMID: 38834590 Free PMC article.
References
-
- Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods. 2008;5:621–628. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources