Machine learning approaches for biomarker discovery to predict large-artery atherosclerosis
- PMID: 37704672
- PMCID: PMC10499778
- DOI: 10.1038/s41598-023-42338-0
Machine learning approaches for biomarker discovery to predict large-artery atherosclerosis
Abstract
Large-artery atherosclerosis (LAA) is a leading cause of cerebrovascular disease. However, LAA diagnosis is costly and needs professional identification. Many metabolites have been identified as biomarkers of specific traits. However, there are inconsistent findings regarding suitable biomarkers for the prediction of LAA. In this study, we propose a new method integrates multiple machine learning algorithms and feature selection method to handle multidimensional data. Among the six machine learning models, logistic regression (LR) model exhibited the best prediction performance. The value of area under the receiver operating characteristic curve (AUC) was 0.92 when 62 features were incorporated in the external validation set for the LR model. In this model, LAA could be well predicted by clinical risk factors including body mass index, smoking, and medications for controlling diabetes, hypertension, and hyperlipidemia as well as metabolites involved in aminoacyl-tRNA biosynthesis and lipid metabolism. In addition, we found that 27 features were present among the five adopted models that could provide good results. If these 27 features were used in the LR model, an AUC value of 0.93 could be achieved. Our study has demonstrated the effectiveness of combining machine learning algorithms with recursive feature elimination and cross-validation methods for biomarker identification. Moreover, we have shown that using shared features can yield more reliable correlations than either model, which can be valuable for future identification of LAA.
© 2023. Springer Nature Limited.
Conflict of interest statement
The authors declare no competing interests.
Figures
![Figure 1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/10499778/bin/41598_2023_42338_Fig1_HTML.gif)
![Figure 2](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/10499778/bin/41598_2023_42338_Fig2_HTML.gif)
![Figure 3](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/10499778/bin/41598_2023_42338_Fig3_HTML.gif)
![Figure 4](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/10499778/bin/41598_2023_42338_Fig4_HTML.gif)
![Figure 5](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/10499778/bin/41598_2023_42338_Fig5_HTML.gif)
![Figure 6](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/10499778/bin/41598_2023_42338_Fig6_HTML.gif)
![Figure 7](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/10499778/bin/41598_2023_42338_Fig7_HTML.gif)
Similar articles
-
The prediction of in-hospital mortality in chronic kidney disease patients with coronary artery disease using machine learning models.Eur J Med Res. 2023 Jan 18;28(1):33. doi: 10.1186/s40001-023-00995-x. Eur J Med Res. 2023. PMID: 36653875 Free PMC article.
-
Prediction of atherosclerosis using machine learning based on operations research.Math Biosci Eng. 2022 Mar 14;19(5):4892-4910. doi: 10.3934/mbe.2022229. Math Biosci Eng. 2022. PMID: 35430846
-
Machine learning-based prediction of postpartum hemorrhage after vaginal delivery: combining bleeding high risk factors and uterine contraction curve.Arch Gynecol Obstet. 2022 Oct;306(4):1015-1025. doi: 10.1007/s00404-021-06377-0. Epub 2022 Feb 16. Arch Gynecol Obstet. 2022. PMID: 35171347
-
Diagnostic Performance of 2D and 3D T2WI-Based Radiomics Features With Machine Learning Algorithms to Distinguish Solid Solitary Pulmonary Lesion.Front Oncol. 2021 Nov 18;11:683587. doi: 10.3389/fonc.2021.683587. eCollection 2021. Front Oncol. 2021. PMID: 34868905 Free PMC article.
-
Preventing dataset shift from breaking machine-learning biomarkers.Gigascience. 2021 Sep 28;10(9):giab055. doi: 10.1093/gigascience/giab055. Gigascience. 2021. PMID: 34585237 Free PMC article. Review.
Cited by
-
Unlocking potential biomarkers bridging coronary atherosclerosis and pyrimidine metabolism-associated genes through an integrated bioinformatics and machine learning approach.BMC Cardiovasc Disord. 2024 Mar 7;24(1):148. doi: 10.1186/s12872-024-03819-w. BMC Cardiovasc Disord. 2024. PMID: 38454353 Free PMC article.
-
Mechanism exploration and biomarker identification of glycemic deterioration in patients with diseases of the exocrine pancreas.Sci Rep. 2024 Feb 22;14(1):4374. doi: 10.1038/s41598-024-52956-x. Sci Rep. 2024. PMID: 38388766 Free PMC article.
References
-
- Young, J. L., U. Libby P Fau-Schönbeck, & U. Schönbeck. Cytokines in the pathogenesis of atherosclerosis. 2002(0340–6245 (Print)). - PubMed
-
- Chapman, M. J. From pathophysiology to targeted therapy for atherothrombosis: a role for the combination of statin and aspirin in secondary prevention. 2007(0163–7258 (Print)). - PubMed
-
- Stoll, G., & Bendszus, M. Inflammation and atherosclerosis: Novel insights into plaque formation and destabilization. 2006(1524–4628 (Electronic)). - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical