Identification of cell types from single-cell transcriptomes using a novel clustering method
- PMID: 25805722
- PMCID: PMC6280782
- DOI: 10.1093/bioinformatics/btv088
Identification of cell types from single-cell transcriptomes using a novel clustering method
Abstract
Motivation: The recent advance of single-cell technologies has brought new insights into complex biological phenomena. In particular, genome-wide single-cell measurements such as transcriptome sequencing enable the characterization of cellular composition as well as functional variation in homogenic cell populations. An important step in the single-cell transcriptome analysis is to group cells that belong to the same cell types based on gene expression patterns. The corresponding computational problem is to cluster a noisy high dimensional dataset with substantially fewer objects (cells) than the number of variables (genes).
Results: In this article, we describe a novel algorithm named shared nearest neighbor (SNN)-Cliq that clusters single-cell transcriptomes. SNN-Cliq utilizes the concept of shared nearest neighbor that shows advantages in handling high-dimensional data. When evaluated on a variety of synthetic and real experimental datasets, SNN-Cliq outperformed the state-of-the-art methods tested. More importantly, the clustering results of SNN-Cliq reflect the cell types or origins with high accuracy.
Availability and implementation: The algorithm is implemented in MATLAB and Python. The source code can be downloaded at http://bioinfo.uncc.edu/SNNCliq.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Figures
Similar articles
-
Single cell clustering based on cell-pair differentiability correlation and variance analysis.Bioinformatics. 2018 Nov 1;34(21):3684-3694. doi: 10.1093/bioinformatics/bty390. Bioinformatics. 2018. PMID: 29771290
-
Identifying Cell Subpopulations and Their Genetic Drivers from Single-Cell RNA-Seq Data Using a Biclustering Approach.J Comput Biol. 2017 Jul;24(7):663-674. doi: 10.1089/cmb.2017.0049. J Comput Biol. 2017. PMID: 28657835 Free PMC article.
-
Single-Cell Clustering Based on Shared Nearest Neighbor and Graph Partitioning.Interdiscip Sci. 2020 Jun;12(2):117-130. doi: 10.1007/s12539-019-00357-4. Epub 2020 Feb 22. Interdiscip Sci. 2020. PMID: 32086753
-
Challenges in unsupervised clustering of single-cell RNA-seq data.Nat Rev Genet. 2019 May;20(5):273-282. doi: 10.1038/s41576-018-0088-9. Nat Rev Genet. 2019. PMID: 30617341 Review.
-
Computational approaches for high-throughput single-cell data analysis.FEBS J. 2019 Apr;286(8):1451-1467. doi: 10.1111/febs.14613. Epub 2018 Aug 30. FEBS J. 2019. PMID: 30058136 Review.
Cited by
-
Scalable and unbiased sequence-informed embedding of single-cell ATAC-seq data with CellSpace.Nat Methods. 2024 Jun;21(6):1014-1022. doi: 10.1038/s41592-024-02274-x. Epub 2024 May 9. Nat Methods. 2024. PMID: 38724693 Free PMC article.
-
scEWE: high-order element-wise weighted ensemble clustering for heterogeneity analysis of single-cell RNA-sequencing data.Brief Bioinform. 2024 Mar 27;25(3):bbae203. doi: 10.1093/bib/bbae203. Brief Bioinform. 2024. PMID: 38701413 Free PMC article.
-
Analysis and Visualization of Single-Cell Sequencing Data with Scanpy and MetaCell: A Tutorial.Methods Mol Biol. 2024;2757:383-445. doi: 10.1007/978-1-0716-3642-8_17. Methods Mol Biol. 2024. PMID: 38668977
-
Reconstruction of macroglia and adult neurogenesis evolution through cross-species single-cell transcriptomic analyses.Nat Commun. 2024 Apr 17;15(1):3306. doi: 10.1038/s41467-024-47484-1. Nat Commun. 2024. PMID: 38632253 Free PMC article.
-
Identification of Interpretable Clusters and Associated Signatures in Breast Cancer Single-Cell Data: A Topic Modeling Approach.Cancers (Basel). 2024 Mar 29;16(7):1350. doi: 10.3390/cancers16071350. Cancers (Basel). 2024. PMID: 38611028 Free PMC article.
References
-
- Beyer K., et al. (1999) When is “nearest neighbor” meaningful? In: Beeri C., Buneman P. (eds.) ICDT ’99 Proceedings of the 7th International Conference on Database Theory. p. 217–235. Springer-Verlag London, UK.
-
- Brennecke P., et al. (2013) Accounting for technical noise in single-cell RNA-seq experiments. Nat. methods, 10, 1093–1095. - PubMed
-
- Carey V., et al. (2011) RBGL: an interface to the BOOST graph library, R package version 1.40.1.
-
- Deng Q., et al. (2014) Single-cell RNA-seq reveals dynamic, random monoallelic gene expression in mammalian cells. Science, 343, 193–196. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources