Predicting Risk of Stroke From Lab Tests Using Machine Learning Algorithms: Development and Evaluation of Prediction Models
- PMID: 34860663
- PMCID: PMC8686476
- DOI: 10.2196/23440
Predicting Risk of Stroke From Lab Tests Using Machine Learning Algorithms: Development and Evaluation of Prediction Models
Abstract
Background: Stroke, a cerebrovascular disease, is one of the major causes of death. It causes significant health and financial burdens for both patients and health care systems. One of the important risk factors for stroke is health-related behavior, which is becoming an increasingly important focus of prevention. Many machine learning models have been built to predict the risk of stroke or to automatically diagnose stroke, using predictors such as lifestyle factors or radiological imaging. However, there have been no models built using data from lab tests.
Objective: The aim of this study was to apply computational methods using machine learning techniques to predict stroke from lab test data.
Methods: We used the National Health and Nutrition Examination Survey data sets with three different data selection methods (ie, without data resampling, with data imputation, and with data resampling) to develop predictive models. We used four machine learning classifiers and six performance measures to evaluate the performance of the models.
Results: We found that accurate and sensitive machine learning models can be created to predict stroke from lab test data. Our results show that the data resampling approach performed the best compared to the other two data selection techniques. Prediction with the random forest algorithm, which was the best algorithm tested, achieved an accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and area under the curve of 0.96, 0.97, 0.96, 0.75, 0.99, and 0.97, respectively, when all of the attributes were used.
Conclusions: The predictive model, built using data from lab tests, was easy to use and had high accuracy. In future studies, we aim to use data that reflect different types of stroke and to explore the data to build a prediction model for each type.
Keywords: lab tests; machine learning technology; predictive analytics; stroke.
©Eman M Alanazi, Aalaa Abdou, Jake Luo. Originally published in JMIR Formative Research (https://formative.jmir.org), 02.12.2021.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures
![Figure 1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/8686476/bin/formative_v5i12e23440_fig1.gif)
![Figure 2](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/8686476/bin/formative_v5i12e23440_fig2.gif)
![Figure 3](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/8686476/bin/formative_v5i12e23440_fig3.gif)
![Figure 4](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/8686476/bin/formative_v5i12e23440_fig4.gif)
Similar articles
-
Machine Learning and the Conundrum of Stroke Risk Prediction.Arrhythm Electrophysiol Rev. 2023 Apr 12;12:e07. doi: 10.15420/aer.2022.34. eCollection 2023. Arrhythm Electrophysiol Rev. 2023. PMID: 37427297 Free PMC article. Review.
-
A data-driven approach to predicting diabetes and cardiovascular disease with machine learning.BMC Med Inform Decis Mak. 2019 Nov 6;19(1):211. doi: 10.1186/s12911-019-0918-5. BMC Med Inform Decis Mak. 2019. PMID: 31694707 Free PMC article.
-
Predicting post-stroke pneumonia using deep neural network approaches.Int J Med Inform. 2019 Dec;132:103986. doi: 10.1016/j.ijmedinf.2019.103986. Epub 2019 Oct 1. Int J Med Inform. 2019. PMID: 31629312
-
Exploiting Machine Learning Algorithms and Methods for the Prediction of Agitated Delirium After Cardiac Surgery: Models Development and Validation Study.JMIR Med Inform. 2019 Oct 23;7(4):e14993. doi: 10.2196/14993. JMIR Med Inform. 2019. PMID: 31558433 Free PMC article.
-
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26. Artif Intell Med. 2019. PMID: 31383477 Review.
Cited by
-
Predictive modelling and identification of key risk factors for stroke using machine learning.Sci Rep. 2024 May 20;14(1):11498. doi: 10.1038/s41598-024-61665-4. Sci Rep. 2024. PMID: 38769427 Free PMC article.
-
Machine learning-based prognostication of mortality in stroke patients.Heliyon. 2024 Apr 3;10(7):e28869. doi: 10.1016/j.heliyon.2024.e28869. eCollection 2024 Apr 15. Heliyon. 2024. PMID: 38601648 Free PMC article.
-
Risk stratification of atrial fibrillation and stroke using single nucleotide polymorphism and circulating biomarkers.PLoS One. 2023 Oct 12;18(10):e0292118. doi: 10.1371/journal.pone.0292118. eCollection 2023. PLoS One. 2023. PMID: 37824462 Free PMC article.
-
Machine learning approaches for biomarker discovery to predict large-artery atherosclerosis.Sci Rep. 2023 Sep 13;13(1):15139. doi: 10.1038/s41598-023-42338-0. Sci Rep. 2023. PMID: 37704672 Free PMC article.
-
The use of artificial intelligence for delivery of essential health services across WHO regions: a scoping review.Front Public Health. 2023 Jul 4;11:1102185. doi: 10.3389/fpubh.2023.1102185. eCollection 2023. Front Public Health. 2023. PMID: 37469694 Free PMC article. Review.
References
-
- Sacco RL, Kasner SE, Broderick JP, Caplan LR, Connors JJB, Culebras A, Elkind MSV, George MG, Hamdan AD, Higashida RT, Hoh BL, Janis LS, Kase CS, Kleindorfer DO, Lee JM, Moseley ME, Peterson ED, Turan TN, Valderrama AL, Vinters HV, American Heart Association Stroke Council‚ Council on Cardiovascular Surgery and Anesthesia. Council on Cardiovascular Radiology and Intervention. Council on Cardiovascular and Stroke Nursing. Council on Epidemiology and Prevention. Council on Peripheral Vascular Disease. Council on Nutrition‚ Physical Activity and Metabolism An updated definition of stroke for the 21st century: A statement for healthcare professionals from the American Heart Association/American Stroke Association. Stroke. 2013 Jul;44(7):2064–2089. doi: 10.1161/STR.0b013e318296aeca.STR.0b013e318296aeca - DOI - PubMed
-
- Benjamin EJ, Muntner P, Alonso A, Bittencourt MS, Callaway CW, Carson AP, Chamberlain AM, Chang AR, Cheng S, Das SR, Delling FN, Djousse L, Elkind MSV, Ferguson JF, Fornage M, Jordan LC, Khan SS, Kissela BM, Knutson KL, Kwan TW, Lackland DT, Lewis TT, Lichtman JH, Longenecker CT, Loop MShane, Lutsey PL, Martin SS, Matsushita K, Moran AE, Mussolino ME, O'Flaherty M, Pandey A, Perak AM, Rosamond WD, Roth GA, Sampson UKA, Satou GM, Schroeder EB, Shah SH, Spartano NL, Stokes A, Tirschwell DL, Tsao CW, Turakhia MP, VanWagner LB, Wilkins JT, Wong SS, Virani SS, American Heart Association Council on Epidemiology and Prevention Statistics Committee and Stroke Statistics Subcommittee Heart Disease and Stroke Statistics-2019 Update: A report from the American Heart Association. Circulation. 2019 Mar 05;139(10):e56–e528. doi: 10.1161/CIR.0000000000000659. https://www.ahajournals.org/doi/abs/10.1161/CIR.0000000000000659?url_ver... - DOI - DOI - PubMed
-
- European Stroke Initiative Executive Committee. EUSI Writing Committee. Olsen TS, Langhorne P, Diener HC, Hennerici M, Ferro J, Sivenius J, Wahlgren NG, Bath P. European Stroke Initiative Recommendations for Stroke Management – Update 2003. Cerebrovasc Dis. 2003;16(4):311–337. doi: 10.1159/000072554. https://www.karger.com?DOI=10.1159/000072554 - DOI - PubMed
-
- Arnett D, Blumenthal R, Albert MA, Buroker AB, Goldberger ZD, Hahn EJ, Himmelfarb CD, Khera A, Lloyd-Jones D, McEvoy JW, Michos ED, Miedema MD, Muñoz D, Smith SC, Virani SS, Williams KA, Yeboah J, Ziaeian B. 2019 ACC/AHA Guideline on the Primary Prevention of Cardiovascular Disease: A report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. J Am Coll Cardiol. 2019 Sep 10;74(10):e177–e232. doi: 10.1016/j.jacc.2019.03.010.S0735-1097(19)33877-X - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources
Miscellaneous