NetGenes: A Database of Essential Genes Predicted Using Features From Interaction Networks
- PMID: 34630517
- PMCID: PMC8495214
- DOI: 10.3389/fgene.2021.722198
NetGenes: A Database of Essential Genes Predicted Using Features From Interaction Networks
Abstract
Essential gene prediction models built so far are heavily reliant on sequence-based features, and the scope of network-based features has been narrow. Previous work from our group demonstrated the importance of using network-based features for predicting essential genes with high accuracy. Here, we apply our approach for the prediction of essential genes to organisms from the STRING database and host the results in a standalone website. Our database, NetGenes, contains essential gene predictions for 2,700+ bacteria predicted using features derived from STRING protein-protein functional association networks. Housing a total of over 2.1 million genes, NetGenes offers various features like essentiality scores, annotations, and feature vectors for each gene. NetGenes database is available from https://rbc-dsai-iitm.github.io/NetGenes/.
Keywords: database; essential genes; interaction network; machine learning; networks.
Copyright © 2021 Senthamizhan, Ravindran and Raman.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures
![Figure 1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/8495214/bin/fgene-12-722198-g0001.gif)
![Figure 2](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/8495214/bin/fgene-12-722198-g0002.gif)
Similar articles
-
Essential gene prediction using limited gene essentiality information-An integrative semi-supervised machine learning strategy.PLoS One. 2020 Nov 30;15(11):e0242943. doi: 10.1371/journal.pone.0242943. eCollection 2020. PLoS One. 2020. PMID: 33253254 Free PMC article.
-
An Experimental Approach to Genome Annotation: This report is based on a colloquium sponsored by the American Academy of Microbiology held July 19-20, 2004, in Washington, DC.Washington (DC): American Society for Microbiology; 2004. Washington (DC): American Society for Microbiology; 2004. PMID: 33001599 Free Books & Documents. Review.
-
Network Embedding the Protein-Protein Interaction Network for Human Essential Genes Identification.Genes (Basel). 2020 Jan 31;11(2):153. doi: 10.3390/genes11020153. Genes (Basel). 2020. PMID: 32023848 Free PMC article.
-
An Evaluation of Machine Learning Approaches for the Prediction of Essential Genes in Eukaryotes Using Protein Sequence-Derived Features.Comput Struct Biotechnol J. 2019 Jun 8;17:785-796. doi: 10.1016/j.csbj.2019.05.008. eCollection 2019. Comput Struct Biotechnol J. 2019. PMID: 31312416 Free PMC article.
-
Network-based features enable prediction of essential genes across diverse organisms.PLoS One. 2018 Dec 13;13(12):e0208722. doi: 10.1371/journal.pone.0208722. eCollection 2018. PLoS One. 2018. PMID: 30543651 Free PMC article.
Cited by
-
Identification of discriminant features from stationary pattern of nucleotide bases and their application to essential gene classification.Front Genet. 2023 Apr 20;14:1154120. doi: 10.3389/fgene.2023.1154120. eCollection 2023. Front Genet. 2023. PMID: 37152988 Free PMC article.
-
Bacterial genome reductions: Tools, applications, and challenges.Front Genome Ed. 2022 Aug 31;4:957289. doi: 10.3389/fgeed.2022.957289. eCollection 2022. Front Genome Ed. 2022. PMID: 36120530 Free PMC article. Review.
References
-
- Henderson K., Gallagher B., Li L., Akoglu L., Eliassi-Rad T., Tong H., et al. . (2011). It's who you know: graph mining using recursive structural features, in Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '11 (New York, NY: Association for Computing Machinery; ), 663–671.
-
- L'Heureux A., Grolinger K., Elyamany H. F., Capretz M. A. M. (2017). Machine learning with big data: challenges and approaches. IEEE Access 5, 7776–7797. 10.1109/ACCESS.2017.2696365 - DOI
LinkOut - more resources
Full Text Sources