Skip to main content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
AMIA Jt Summits Transl Sci Proc. 2024; 2024: 584–592.
Published online 2024 May 31.
PMCID: PMC11141861
PMID: 38827098

Designing a Consumer-centric Care Management Program by Prioritizing Interventions Using Deep Learning Causal Inference

Tianhao Li, MS, 1 Haoyun Feng, PhD, 2 Vikram Bandugula, MS, MBA, 2 and Ying Ding, PhD 1

Abstract

Care management is a team-based and patient-centered approach to reduce health risks and improve outcomes for managed populations. Post Discharge Management (PDM) is an important care management program at Elevance Health, which is aimed at reducing 30-day readmission risk for recently discharged patients. The current PDM program suffers from low engagement. When assigning interventions to patients, case managers choose the interventions to be conducted in each call only based on their limited personal experiences. In this work, we use deep learning causal inference to analyze the impact of interventions conducted on the first call on consumer engagement in the PDM program, which provides a reliable reference for case managers to select interventions to promote consumer engagement. With three experiments cross-validating the results, our results show that consumers will engage more in the program if the case manager conducts interventions that require more nurse-patient interactions on the first call. On the other hand, conducting less interactive and more technical interventions on the first call leads to relatively poor consumer engagement. These findings correspond to the clinical sense of experienced nurses and are consistent with previous findings in patient engagement in hospital settings.

Introduction

Care management is a set of healthcare activities designed to help patients and their support systems effectively manage medical conditions and related psychosocial problems1. The aim of care management includes improving patients’ health status, enhancing coordination of care, and reducing the need for expensive medical services2. Care management programs for high-risk consumers in health are covered by government health insurance, including Medicare and Medicaid, as well as most commercial health insurance. These programs provide consumers access to care management teams, which connect consumers with providers and community resources, and also provide consumers with social determinants of health (SDoH) barrier assistance and self-management knowledge education. Consumer engagement is one of the most critical factors that reflect the success of care management programs3. Better engagement also improves consumer outcomes and experiences of care4. However, low engagement rates are found in phone-based care management programs5. Therefore, low consumer engagement is an urgent problem to be solved to improve the quality of care management programs.

Previous work found multiple factors that could influence consumer engagement in care management programs, including patient-related factors, health professional-related factors, organization-related factors, and lay community-related factors6. While these factors are mainly inspired by sociological and psychological perspectives, few works studied how to improve consumer engagement from a data perspective. In this work, we explore improving consumer engagement from a data perspective by analyzing historical program data in the following care management program.

Post Discharge Management (PDM) is an important care management program in Elevance Health, a health insurance provider. The program is aimed at preventing 30-day readmission for those people who have recently been discharged from hospitals, which corresponds to the Hospital Readmissions Reduction Program (HRRP) by Centers for Medicare & Medicaid Services (CMS). The program lasts 4 weeks. For each enrolled member, case managers make outbound calls once every week to conduct interventions to ensure medication adherence and compliance with follow-up appointments, as well as to refer members to other programs and community resources. From Elevance Health internal report, although the PDM program has reduced potential 30-day readmission by a large amount, the program still suffers from low member engagement.

The current PDM program does not provide case managers with instructions on the interventions to be conducted on each call. Case managers will choose the interventions based on their experience to optimize member engagement. Intuitively, we hypothesize that conducting some interventions on the first call will lead to better engagement with members. To achieve this goal, we use historical user interaction data to analyze member’s engagement by Average Treatment Effect (ATE) with different interventions conducted on the first call. Considering each member’s health condition is different, causal inference methods are used to analyze the data. If the hypothesis is verified, it not only provides a reliable reference to case managers to decide which interventions should be conducted on the first call but also improves the program’s effectiveness by enhancing member engagement.

Causal inference is now widely used in healthcare with the emergence of large amounts of observational data from medical insurance companies and widespread adoption of electronic health records7. While the large amounts of data provide probability to calculate and analyze ATE for the tasks that cannot be performed by Randomized Controlled Trials (RCT), the effects of interventions may be confounded by variables that affect both treatment assignment as well as treatment outcome8. To correctly calculate the ATE using observational data, lots of causal inference methods have been developed in the past decades. Bayesian Additive Regression Trees (BART)9,10 is a prevailing machine learning model for causal inference and has demonstrated superior performance in a causal inference competition11. However, BART runs relatively slow in using Markov Chain Monte Carlo (MCMC)12, which makes it hard to process industry-scale data. With the development of artificial intelligence in recent years, a lot of deep learning methods are also proposed for causal inference13,14,15. Balancing Covariates Automatically Using Supervision (BCAUS)8 is a deep learning model for causal inference that uses neural networks to remove covariate imbalance by propensity modeling. It is much faster than BART and shows comparable performance with BART on a benchmark dataset8, which provides possibilities to analyze industry-scale data.

In this work, we use BCAUS as a causal inference model to prioritize the interventions that should be conducted on the first call to optimize member engagement in the PDM program. Our data is from the Elevance Health database, which contains covariates, interventions, and outcomes of each member enrolled in the PDM program. Experiments from two settings of BCAUS and one setting of BART demonstrate the robustness and correctness of our research, which identifies that interactive interventions, specifically Medication Reconciliation and Follow up Appointment, if conducted on the first call, will lead to better member engagement than other interventions. In summary, this work has the following contributions: 1) We explore improving consumer engagement from a data approach using causal inference in care management. 2) Our prioritization of interventions provides a reliable reference for case managers to optimize member engagement in the PDM program. 3) Implementation of BCAUS in our work verifies the feasibility of using deep learning for causal inference in large-scale healthcare data.

Methods

The Post Discharge Management (PDM) program is a 4-week care management program in Elevance Health. Members recently discharged from hospitals are selected for the program to prevent 30-day readmission. Each week, case managers will call the enrolled member and choose one or multiple interventions from all seven interventions to conduct. If a member is engaged with the program and closes all of the seven interventions in four weeks, the member is considered to successfully graduate from the program.

The seven interventions are Medication Reconciliation, Problem List, Problem Root Cause, Follow up Appointment, Appointment Scheduled, Health Record, and Referral. Medication reconciliation is an intervention in which the case manager review and compare patient’s current medications with medication history, to identify potential prescription errors; Problem list is an intervention for case manager to note down list of problems reported by provider; In problem root cause intervention, the case manager dig into patient’s medical record and discuss with patient to identify root cause of the problems; Follow up appointment is for the case manager to review all follow up doctor appointments that are required; Appointment scheduled is an intervention to help patient scheduling doctor appointments; Health record is an intervention in which the case manager review and update patient’s health record; Referral is an intervention in which the case manager provider patient with doctor referral. In this work, we use graduation as the metric to indicate a member’s engagement.

To find whether conducting some interventions on the first call will lead to better member engagement, we use causal inference models to calculate the ATE of members’ graduation for each intervention or each intervention pair. If the covariates are correctly balanced, the intervention with a higher ATE will lead to better outcomes in the total population if conducted on the first call.

Causal Inference Model Balancing Covariates Automatically Using Supervision (BCAUS)8 model for causal inference is applied in this work. It is a deep learning model that predicts the intervention and balances the covariates at the same time by using a joint loss penalizing both the incorrect prediction of the assigned treatment and the imbalance of covariates calculated by inverse probability of treatment weighting (IPTW). Figure 1 is the workflow of the BCAUS model. The model uses all of the covariates as input and predicts the intervention assignment. Covariates with IPTW are used to calculate the covariate bias between treatment group and control group. The model is trained by minimizing the binary cross-entropy (BCE) loss of intervention prediction and mean squared error (MSE) loss of the covariate bias. After training, propensity score together with outcome is used to calculate the ATE16,17. To verify the performance of BCAUS model on balancing the covariates, we calculate the standardized difference of covariates between treatment group and control group with and without IPTW. We use 0.1 as a commonly used threshold for ensuring the removal of covariate bias between the two groups16,17. We also include another commonly used machine learning model for causal inference, Bayesian Additive Regression Trees (BART)9, to cross-validate the performance of BCAUS model. Instead of using propensity score, BART is a model using counterfactual results to calculate ATE. The two models with different methods to calculate ATE will cross-validate their performance if both models produce similar results.

An external file that holds a picture, illustration, etc.
Object name is 2338f1.jpg

The workflow of BCAUS model. BCE loss for minimizing intervention assignment error and MSE loss for minimizing covariate bias are used together to optimize the model.

Experiment Setting In this work, we use two experiment settings to get the ranking of interventions to be conducted on the first call for better member engagement. In the first setting, we use the cases who had a specific intervention on the first call as the treatment group and the cases who did not have the same intervention on the first call as the control group, regardless of whether they had other interventions or not on the first call. Instead, we add the information of whether they had other interventions or not into the covariates, which are supposed to be balanced by propensity modeling. The ranking of the interventions is based on the ATE calculated for each intervention. Interventions with higher ATE are better for member engagement in the total population if conducted on the first call. In the second setting, we calculate the ATE between intervention pairs. For every two interventions, we select the cases that had only one specific intervention on the first call as the treatment group and the cases that only had another intervention as the control group. By calculating the ATE of the two groups, we can get a relative ranking of these two interventions. After all the intervention pairs are calculated, we will get a 7 × 7 matrix, in which each value measures the ATE of one pair of two interventions, with one intervention representing the row of the value and another intervention representing the column of the value. By comparing these ATE values, we can get the whole ranking of the 7 interventions. To distinguish the two settings, we call the first setting Single Intervention and the second setting Intervention Pair.

Results

Data The PDM program data is from the Elevance Health database. The data has been subjected to the company’s deidentification processes to ensure the removal or modification of personal identifiers, thus minimizing the risk of re-identification and ensuring compliance with privacy regulations and standards. Data with the start date from 01/01/2021 to 05/05/2022 is used in this work, with 27,300 cases in total. For each case, interventions conducted on the first call and graduation status are extracted from the database with related members’ data. Member’s data including member’s health conditions, medications, health service usages, UM frequencies, risk scores, and comorbidities are used as covariates in the causal inference models with 75 covariates in total. The number of cases who had a specific intervention in the first week ranges from 8,465 to 16,385. Table 1 lists the number of cases that had or did not have a specific intervention for each intervention. The percentages of cases that are conducted with a specific intervention on the first call among all the cases are from 30.9% to 60.0%. These numbers indicate that case managers select the interventions on the first call neither completely randomly nor similarly among all the case managers, which also suggests the necessity of providing a reference for case managers to choose interventions to conduct. Using this data, we conduct experiments of both settings using BCAUS, and an extra experiment using BART to cross-validate the results.

Table 1:

Statistics of cases who had or did not have interventions.

Intervention nameNumber of cases with interventionNumber of cases without intervention
Medication Reconciliation8,79518,505
Problem List12,72814,572
Problem Root Cause16,38510,915
Follow up Appointment9,86617,434
Appointment Scheduled10,21417,176
Health Record11,98515,315
Referral8,42518,875

BCAUS Single Intervention In this setting, we use the cases who had one specific intervention as the treatment group and cases who did not have that intervention as the control group. We add the other interventions to the covariates and train the BCAUS model. ATE is calculated to measure how the specific intervention influences member engagement in the total population. Using Elevance Health internal server, it takes less than a minute to train a BCAUS model, which is the most time-consuming part of getting the ATE from BCAUS. Table 2 shows the result of ATE for each intervention using BCAUS. In the table, Medication Reconciliation has the highest ATE among all the interventions. Based on the results, all the interventions are recommended in the following order.

Table 2:

ATE for each intervention in the Single Intervention setting calculated by BCAUS.

Intervention nameAverage Treatment Effect
Medication Reconciliation0.196
Problem List-0.003
Problem Root Cause-0.035
Follow up Appointment0.094
Appointment Scheduled0.077
Health Record0.016
Referral0.051

Medication Reconciliation > Follow up Appointment > Appointment Scheduled > Referral > Health Record > Problem List > Problem Root Cause

To learn the performance of how BCAUS balances covariates in this setting, we calculate the standardized difference of covariates of raw data and covariates balanced with IPTW from BCAUS separately. Figure 2 is the distribution of standardized differences of all member’s data covariates from raw data and BCAUS IPTW. X axis is the standardized difference. Y axis is the index of covariates. Whiskers are at 5 and 95 percentiles. Covariates with standardized differences smaller than 0.1 are treated as balanced covariates. In this setting, all the covariates are balanced after BCAUS IPTW, while lots of covariates are not balanced in raw data. Therefore, BCAUS has a high performance in balancing the covariates, which also suggests that our result of intervention prioritization is reliable.

An external file that holds a picture, illustration, etc.
Object name is 2338f2.jpg

Distributions of standardized difference of all the member’s data covariates from raw data (left) and from BCAUS IPTW (right). All the covariates are balanced after BCAUS IPTW.

BCAUS Intervention Pair In this setting, for every two interventions, we select the cases that only had one intervention as treatment group and that only had the other intervention as control group. For each intervention pair, we train a BCAUS model and calculate the ATE. ATE for each intervention pair represents the impact on member engagement if conducting the intervention of the treatment group on the first call instead of the intervention of the control group. Table 3 shows the result of ATE for each intervention pair using BCAUS. The indices of row and column from 1 to 7 represent Medication Reconciliation, Problem List, Problem Root Cause, Follow up Appointment, Appointment Scheduled, Health Record, and Referral. Columns represent interventions of the treatment group and rows represent interventions of the control group. Based on the results, all the interventions are recommended in the following order.

Table 3:

ATE for each intervention pair in the Intervention Pair setting calculated by BCAUS.

1234567
1-0.383-0.319-0.140-0.222-0.355-0.130
20.381-0.0690.1950.158-0.0470.062
30.3130.0710.2460.1550.0270.148
40.150-0.195-0.246-0.00-0.172-0.023
50.220-0.157-0.1550.017-0.124-0.015
60.3510.043-0.0250.1770.1230.074
70.131-0.063-0.1480.0270.012-0.078

Medication Reconciliation > Follow up Appointment > Appointment Scheduled > Referral > Problem List > Health Record > Problem Root Cause

Compared with the result in Single Intervention setting by BCAUS, only the two adjacent interventions Health Record and Problem List are reversed. This minor inconsistency may be because this setting only uses a subgroup of the data and the ATE of the two interventions in the Single Intervention setting is very close. The two results using BCAUS demonstrate the robustness of our method to prioritize interventions in the PDM program.

BART Experiment In the experiment using BART, we use covariates together with interventions to predict the outcome. Specifically, we use 1 to represent that the case manager conducted a specific intervention on the first call, and 0 to represent that the case manager did not conduct that intervention on the first call. All the 7 numbers representing the intervention information are added to input with member’s data covariates. To calculate the ATE for each intervention, we flip that intervention from 1 to 0 or from 0 to 1 and get the counterfactual outcome from the BART model. ATE is calculated using the factual and counterfactual outcome. Using Elevance Health internal server, it takes about 12 minutes on average to do model inference for one counterfactual outcome, which is the most time-consuming part of getting the ATE from BART. Table 4 shows the result of ATE for each intervention in BART. Based on the results, all the interventions are recommended in the following order.

Table 4:

ATE for each intervention calculated by BART.

Intervention nameAverage Treatment Effect
Medication Reconciliation0.141
Problem List0.020
Problem Root Cause-0.027
Follow up Appointment0.069
Appointment Scheduled0.025
Health Record0.042
Referral0.025

Medication Reconciliation > Follow up Appointment > Health Record > Appointment Scheduled = Referral > Problem List > Problem Root Cause

Compared with the result in Single Intervention setting by BCAUS, Health Record moves up two places, while the others remain the same. Considering BART and BCAUS use two different methods, counterfactual inference and IPTW, to calculate ATE, the results of the two intervention prioritizations are close enough to cross-validate each other’s correctness.

Discussion

This study explores the impact of interventions conducted on the first call on consumer engagement in a care management program from a data perspective. We use a deep learning causal inference model BCAUS to analyze how the interventions conducted on the first call influence member engagement, specifically member’s graduation, in the PDM program, and cross-validate the results using another prevailing machine learning model BART. Our results show that the two interventions that require interaction between member and case manager, Medication Reconciliation and Follow up Appointment, continuously get the highest ATE, which indicates that member’s engagement will be improved in the total population if these two interventions are conducted on the first call. On the other hand, Problem Root Cause, which is a relative technical intervention with a structured process18, continuously gets the lowest ATE, which suggests that case managers may consider conducting this intervention later from the perspective of improving member engagement. However, Problem Root Cause has the highest percentage 60.0% to be conducted on the first call from statistics of the data in Table 1. Medication Reconciliation and Follow up Appointment have relatively low percentages, 32.2% and 36.1%, separately. The inconsistency of our prioritization and current program situation emphasizes the importance of this work in providing a reference of intervention conduction for improving member engagement.

Our results come from two experiments using BCAUS and one experiment using BART. In the two BCAUS experiments, there is only one minor difference, where two adjacent interventions are reversed in the two results. Considering one experiment using the whole data and the other using the subgroup of the data, and the ATE of the two reversed interventions are relatively close, these two experiments demonstrate the robustness of BCAUS method. BART experiment cross-validates the results of BCAUS. Only one intervention’s order from BART is different from BCAUS with two places advanced. Since BCAUS and BART use two different causal methods to calculate the ATE, all three experiments demonstrate the correctness of our results. Moreover, BCAUS is an order of magnitude faster than BART in Elevance Health internal server. In this way, our work not only provides a reliable reference for case managers to improve members’ engagement and promote consumer health, but also demonstrates the ability of BCAUS as a deep learning causal inference model to fast and accurately process large-scale healthcare data.

Experienced nurses in Elevance Health found our results intuitive. Medication Reconciliation is an effective intervention in readmission prevention19. From the nurses’ perspective, it is an educational topic and a very engaging intervention. The conversation would start with ”Do you understand your medication?” Many members were sick and just got out of the hospital, so they did not get a chance to grab the understanding. By helping members understand their medications and concerns on the first call, case managers will achieve better engagement with members throughout the entire program period. On the other hand, Problem Root Cause is an intervention where case managers go through the member’s records and summarize the root cause, which does not require the member’s participation. Case managers conducting this intervention miss the opportunity to connect with members, which may lead to relatively poor engagement. Our results are also consistent with previous studies on patient engagement in hospital settings, which show that interactive interplay between the patient and nurse helps promote patient engagement20. However, documentation work from nurses does not necessarily reflect patient engagement21. Nurses can be task-oriented in some activities, gaining information from patients without sufficient patient engagement20. These studies may explain our findings in care management. Interactive interventions such as Medication Reconciliation promote consumer engagement, while documentation work such as Problem Root Cause contributes less to improving consumer engagement.

Conclusion

In this work, we explore the impact of interventions conducted on the first call on consumer engagement in a care management program of Elevance Health. Our results show that more interactive interventions such as Medication Reconciliation and Follow up Appointment help promote consumer engagement, while less interactive and more technical interventions such as Problem Root Cause may not contribute to consumer engagement. Our experiments demonstrate the robustness of the deep learning causal inference model we use and the correctness of our results and suggest the feasibility of using deep learning causal inference to process large-scale healthcare data. Our intervention prioritization provides a reliable reference for case managers to select the interventions to be conducted on the first call to improve member engagement. However, our pipeline has limitations. For each intervention, a BCAUS model needs to be trained to get the ATE for prioritizing. When the number of interventions is large, it will be difficult to calculate ATEs for all the interventions. For example, if we want to prioritize intervention combinations in the same setting, there will be 49 intervention combinations containing 2 interventions and 343 intervention combinations containing 3 interventions. Calculating all ATEs will be difficult and prioritizing intervention combinations with this large amount of numbers will be inaccurate. In future work, we intend to implement the proposed pipeline in other care management programs in Elevance Health to promote consumer health. We are also building efficient deep learning causal inference pipelines for complex healthcare programs dealing with more interventions to promote outcomes and benefit consumers.

Acknowledgments

The authors acknowledge Davis Klaila, who is a Ph.D. in Behavior Science, and Sharon Stritzel, who used to be a case manager and now a care management product lead in Elevance Health, for providing input in this work.

Figures & Table

References

1. Bodenheimer T, Berry-Millett R. Care management of patients with complex health care needs. Policy. 2009;1(6) [PubMed] [Google Scholar]
2. Mechanic R. Mechanic care management improve the value of US health care. The Health Industry Forum. 2004.
3. Clavel N, Paquette J, Dumez V, Del Grande C, Ghadiri DP, Pomey MP, et al. Patient engagement in care: A scoping review of recently validated tools assessing patients’ and healthcare professionals’ preferences and experience. Health Expectations. 2021;24(6):1924–35. [PMC free article] [PubMed] [Google Scholar]
4. Higgins T, Larson E, Schnall R. Unraveling the meaning of patient engagement: a concept analysis. Patient Education and Counseling. 2017;100(1):30–6. [PubMed] [Google Scholar]
5. Annis AM, Holtrop JS, Tao M, Chang HC, Luo Z. Comparison of provider and plan-based targeting strategies for disease management. 2015. [PubMed]
6. Graffigna G, Barello S. Spotlight on the Patient Health Engagement model (PHE model): a psychosocial theory to understand people’s meaningful engagement in their own health care. Patient preference and adherence. 2018. pp. 1261–71. [PMC free article] [PubMed]
7. Shi J, Norgeot B. Learning Causal Effects From Observational Data in Healthcare: A Review and Summary. Frontiers in Medicine. 2022:2027. [PMC free article] [PubMed] [Google Scholar]
8. Belthangady C, Stedden W, Norgeot B. Minimizing bias in massive multi-arm observational studies with BCAUS: balancing covariates automatically using supervision. BMC Medical Research Methodology. 2021;21(1):1–10. [PMC free article] [PubMed] [Google Scholar]
9. Chipman HA, George EI, McCulloch RE. BART: Bayesian additive regression trees. 2010.
10. Hill JL. Bayesian nonparametric modeling for causal inference. Journal of Computational and Graphical Statistics. 2011;20(1):217–40. [Google Scholar]
11. Dorie V, Hill J, Shalit U, Scott M, Cervone D. Automated versus do-it-yourself methods for causal inference: Lessons learned from a data analysis competition. 2019.
12. Ronen O, Saarinen T, Tan YS, Duncan J, Yu B. A Mixing Time Lower Bound for a Simplified Version of BART. arXiv preprint arXiv:221009352. 2022 [Google Scholar]
13. Johansson F, Shalit U, Sontag D. Learning representations for counterfactual inference. In: International conference on machine learning. PMLR. 2016:3020–9. [Google Scholar]
14. Shalit U, Johansson FD, Sontag D. Estimating individual treatment effect: generalization bounds and algorithms. In: International Conference on Machine Learning. PMLR. 2017:3076–85. [Google Scholar]
15. Louizos C, Shalit U, Mooij JM, Sontag D, Zemel R, Welling M. Causal effect inference with deep latent-variable models. Advances in neural information processing systems. 2017;30 [Google Scholar]
16. Austin PC, Stuart EA. Moving towards best practice when using inverse probability of treatment weighting (IPTW) using the propensity score to estimate causal treatment effects in observational studies. Statistics in medicine. 2015;34(28):3661–79. [PMC free article] [PubMed] [Google Scholar]
17. Austin PC. Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples. Statistics in medicine. 2009;28(25):3083–107. [PMC free article] [PubMed] [Google Scholar]
18. Charles R, Hood B, Derosier JM, Gosbee JW, Li Y, Caird MS, et al. How to perform a root cause analysis for workup and future prevention of medical errors: a review. Patient safety in surgery. 2016;10(1):1–5. [PMC free article] [PubMed] [Google Scholar]
19. Zemaitis CT, Morris G, Cabie M, Abdelghany O, Lee L. Reducing readmission at an academic medical center: results of a pharmacy-facilitated discharge counseling and medication reconciliation program. Hospital Pharmacy. 2016;51(6):468–73. [PMC free article] [PubMed] [Google Scholar]
20. Tobiano G, Jerofke-Owen T, Marshall AP. Promoting patient engagement: a scoping review of actions that align with the interactive care model. Scandinavian Journal of Caring Sciences. 2021;35(3):722–41. [PubMed] [Google Scholar]
21. K#x00E4;rkk#x00E4;inen O, Bondas T, Eriksson K. Documentation of individualized patient care: A qualitative metasynthesis. Nursing ethics. 2005;12(2):123–32. [PubMed] [Google Scholar]

Articles from AMIA Summits on Translational Science Proceedings are provided here courtesy of American Medical Informatics Association

-