ConceptWAS: A high-throughput method for early identification of COVID-19 presenting symptoms and characteristics from clinical notes
- PMID: 33774203
- PMCID: PMC7992296
- DOI: 10.1016/j.jbi.2021.103748
ConceptWAS: A high-throughput method for early identification of COVID-19 presenting symptoms and characteristics from clinical notes
Abstract
Objective: Identifying symptoms and characteristics highly specific to coronavirus disease 2019 (COVID-19) would improve the clinical and public health response to this pandemic challenge. Here, we describe a high-throughput approach - Concept-Wide Association Study (ConceptWAS) - that systematically scans a disease's clinical manifestations from clinical notes. We used this method to identify symptoms specific to COVID-19 early in the course of the pandemic.
Methods: We created a natural language processing pipeline to extract concepts from clinical notes in a local ER corresponding to the PCR testing date for patients who had a COVID-19 test and evaluated these concepts as predictors for developing COVID-19. We identified predictors from Firth's logistic regression adjusted by age, gender, and race. We also performed ConceptWAS using cumulative data every two weeks to identify the timeline for recognition of early COVID-19-specific symptoms.
Results: We processed 87,753 notes from 19,692 patients subjected to COVID-19 PCR testing between March 8, 2020, and May 27, 2020 (1,483 COVID-19-positive). We found 68 concepts significantly associated with a positive COVID-19 test. We identified symptoms associated with increasing risk of COVID-19, including "anosmia" (odds ratio [OR] = 4.97, 95% confidence interval [CI] = 3.21-7.50), "fever" (OR = 1.43, 95% CI = 1.28-1.59), "cough with fever" (OR = 2.29, 95% CI = 1.75-2.96), and "ageusia" (OR = 5.18, 95% CI = 3.02-8.58). Using ConceptWAS, we were able to detect loss of smell and loss of taste three weeks prior to their inclusion as symptoms of the disease by the Centers for Disease Control and Prevention (CDC).
Conclusion: ConceptWAS, a high-throughput approach for exploring specific symptoms and characteristics of a disease like COVID-19, offers a promise for enabling EHR-powered early disease manifestations identification.
Keywords: COVID-19; EHR; Natural language processing.
Copyright © 2021. Published by Elsevier Inc.
Conflict of interest statement
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Figures
![Fig. 1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7992296/bin/gr1_lrg.gif)
![Fig. 2](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7992296/bin/gr2_lrg.gif)
![Fig. 3](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7992296/bin/gr3_lrg.gif)
![Fig. A1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7992296/bin/gr4_lrg.gif)
![Fig. B1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7992296/bin/gr5_lrg.gif)
![Fig. C1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7992296/bin/gr6_lrg.gif)
![Fig. D1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7992296/bin/gr7_lrg.gif)
Update of
-
ConceptWAS: a high-throughput method for early identification of COVID-19 presenting symptoms.medRxiv [Preprint]. 2020 Nov 10:2020.11.06.20227165. doi: 10.1101/2020.11.06.20227165. medRxiv. 2020. Update in: J Biomed Inform. 2021 May;117:103748. doi: 10.1016/j.jbi.2021.103748. PMID: 33200151 Free PMC article. Updated. Preprint.
Similar articles
-
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3. Cochrane Database Syst Rev. 2022. PMID: 35593186 Free PMC article. Review.
-
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.Cochrane Database Syst Rev. 2021 Feb 23;2(2):CD013665. doi: 10.1002/14651858.CD013665.pub2. Cochrane Database Syst Rev. 2021. Update in: Cochrane Database Syst Rev. 2022 May 20;5:CD013665. doi: 10.1002/14651858.CD013665.pub3. PMID: 33620086 Free PMC article. Updated.
-
Symptom Profiles and Progression in Hospitalized and Nonhospitalized Patients with Coronavirus Disease, Colorado, USA, 2020.Emerg Infect Dis. 2021 Feb;27(2):385-395. doi: 10.3201/eid2702.203729. Emerg Infect Dis. 2021. PMID: 33496225 Free PMC article. Review.
-
ConceptWAS: a high-throughput method for early identification of COVID-19 presenting symptoms.medRxiv [Preprint]. 2020 Nov 10:2020.11.06.20227165. doi: 10.1101/2020.11.06.20227165. medRxiv. 2020. Update in: J Biomed Inform. 2021 May;117:103748. doi: 10.1016/j.jbi.2021.103748. PMID: 33200151 Free PMC article. Updated. Preprint.
-
Symptoms and symptom clusters associated with SARS-CoV-2 infection in community-based populations: Results from a statewide epidemiological study.medRxiv [Preprint]. 2020 Oct 22:2020.10.11.20210922. doi: 10.1101/2020.10.11.20210922. medRxiv. 2020. Update in: PLoS One. 2021 Mar 24;16(3):e0241875. doi: 10.1371/journal.pone.0241875. PMID: 33106813 Free PMC article. Updated. Preprint.
Cited by
-
Improving reporting standards for phenotyping algorithm in biomedical research: 5 fundamental dimensions.J Am Med Inform Assoc. 2024 Apr 3;31(4):1036-1041. doi: 10.1093/jamia/ocae005. J Am Med Inform Assoc. 2024. PMID: 38269642
-
Architectural Design of a Blockchain-Enabled, Federated Learning Platform for Algorithmic Fairness in Predictive Health Care: Design Science Study.J Med Internet Res. 2023 Oct 30;25:e46547. doi: 10.2196/46547. J Med Internet Res. 2023. PMID: 37902833 Free PMC article.
-
Medical Experts' Agreement on Risk Assessment Based on All Possible Combinations of the COVID-19 Predictors-A Novel Approach for Public Health Screening and Surveillance.Int J Environ Res Public Health. 2022 Dec 10;19(24):16601. doi: 10.3390/ijerph192416601. Int J Environ Res Public Health. 2022. PMID: 36554487 Free PMC article.
-
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3. Cochrane Database Syst Rev. 2022. PMID: 35593186 Free PMC article. Review.
-
Novel informatics approaches to COVID-19 Research: From methods to applications.J Biomed Inform. 2022 May;129:104028. doi: 10.1016/j.jbi.2022.104028. Epub 2022 Feb 16. J Biomed Inform. 2022. PMID: 35181495 Free PMC article. No abstract available.
References
-
- WHO Coronavirus Disease (COVID-19) Dashboard, (n.d.). https://covid19.who.int/ (accessed May 26, 2020).
-
- Guan W., Ni Z., Hu Y., Liang W., Ou C., He J., Liu L., Shan H., Lei C., Hui D.S.C., Du B., Li L., Zeng G., Yuen K.-Y., Chen R., Tang C., Wang T., Chen P., Xiang J., Li S., Wang J., Liang Z., Peng Y., Wei L., Liu Y., Hu Y., Peng P., Wang J., Liu J., Chen Z., Li G., Zheng Z., Qiu S., Luo J., Ye C., Zhu S., Zhong N. Clinical Characteristics of Coronavirus Disease 2019 in China. N. Engl. J. Med. 2020 doi: 10.1056/NEJMoa2002032. - DOI - PMC - PubMed
-
- Makaronidis J., Mok J., Balogun N., Magee C.G., Omar R.Z., Carnemolla A., Batterham R.L. Seroprevalence of SARS-CoV-2 antibodies in people with an acute loss in their sense of smell and/or taste in a community-based population in London, UK: An observational cohort study. PLoS Med. 2020;17 doi: 10.1371/journal.pmed.1003358. - DOI - PMC - PubMed
-
- A. Fritz, M. Brice-Saddler, M. Judkis, CDC confirms six coronavirus symptoms showing up in patients over and over, Washington Post. (n.d.). https://www.washingtonpost.com/health/2020/04/27/six-new-coronavirus-sym... (accessed September 25, 2020).
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical