Figures
Abstract
Estimated glomerular filtration rate (eGFR) decline is associated with negative health outcomes, but the use of decision tree algorithms to predict eGFR decline is underreported. Among patients hospitalized during the first year of the COVID-19 pandemic, it remains unclear which individuals are at the greatest risk of eGFR decline after discharge. We conducted a retrospective cohort study on patients hospitalized at Stony Brook University Hospital in 2020 who were followed for 36 months post discharge. Random Forest (RF) identified the top ten features associated with fast eGFR decline. Logistic regression (LR) and Classification and Regression Trees (CART) were then employed to uncover the relative importance of these top features and identify the highest risk patients. In the cohort of 1,747 hospital survivors, 61.6% experienced fast eGFR decline, which was associated with younger age, higher baseline eGFR, and acute kidney injury (AKI). Multivariate LR analysis showed that older age was associated with lower odds of fast eGFR decline whereas length of hospitalization and vasopressor use with greater odds. CART analysis identified length of hospitalization as the most important factor and that patients with AKI and hospitalization of 27 days or more were at highest risk. After grouping by ICU and COVID-19 status and propensity score matching for demographics, these risk factors of fast eGFR decline remained consistent. CART analysis can help identify patient subgroups with the highest risk of post-discharge eGFR decline. Clinicians should consider the length of hospitalization in post-discharge monitoring of kidney function.
Citation: Wang W, Zhu W, Hajagos J, Fochtmann L, Koraishy FM (2025) Classification and Regression Trees analysis identifies patients at high risk for kidney function decline following hospitalization. PLoS ONE 20(1): e0317558. https://doi.org/10.1371/journal.pone.0317558
Editor: Keiko Hosohata, Osaka University of Pharmaceutical Sciences, JAPAN
Received: November 6, 2024; Accepted: December 23, 2024; Published: January 31, 2025
Copyright: © 2025 Wang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Fast decline in kidney function, as measured by estimated glomerular filtration rate (eGFR), is a hallmark of the development and progression of chronic kidney disease (CKD), and associated with adverse health outcomes [1–4]. Inadequate kidney recovers after acute kidney injury (AKI), a common diagnosis in hospitalized patients, is strongly associated with the risk of CKD [5]. Among hospitalized patients, post-discharge eGFR decline is seen in patients with or without AKI and associated with increased risk of death [6–8].
Machine learning (ML) models have been used to predict the risk of kidney disease [9–11]. In studies of patients with CKD, logistic regression (LR) and random forest (RF) algorithms have been used to identify patients at risk of fast eGFR decline [12, 13]. Although predictive models of eGFR decline identify important clinical factors, the relative importance of these factors and the combination of factors that identify highest and lowest risk patient subgroups is less extensively studied. Classification and Regression Trees (CART) is a non-parametric and non-linear supervised ML algorithm that identifies the various factors associated with clinical outcomes in an open-box decision tree approach. The feature importance of each node (clinical variable) is determined by its hierarchy in the decision tree. The resulting cluster and hierarchy of nodes can identify the patient subgroups at lowest and highest risk of the clinical outcome. CART has been used in the study of kidney diseases including AKI [14–17], urinary obstruction [18], and dialysis patients [19]. Although decision tree algorithms have also been used to predict CKD [20–26], their use in the prediction of eGFR decline has rarely been reported [27, 28]. The only study that clearly identified the use of CART to predict eGFR decline, was limited to a small, select group of patients and did not investigate post-hospitalization eGFR decline [28].
We recently reported the characteristics of Coronavirus disease 2019 (COVID-19) associated AKI in hospitalized patients in a large national cohort of 336,473 patients with COVID-19, out of whom 129,176 (38%) patients had AKI [29]. In a previous study at our medical center, we reported that among hospitalized patients, the prevalence of AKI was 41.3% in those with COVID-19, while only 24.2% in those without COVID [30]. Although COVID-19 associated AKI was recently reported to be associated with a lower risk of kidney outcomes compared to AKI due to other causes [31], the study of post-discharge kidney function in those without AKI is lacking. There is also lack of data on the use of ML models to predict factors associated with eGFR decline in patients hospitalized with COVID-19.
We hypothesized that using CART, we will be able to determine the relative importance of clinical risk factors and identify patients at the highest risk of kidney function decline after hospitalization during the first wave of COVID-19 pandemic. In this study, we employed RF to identify the most significant factors associated with post-hospitalization eGFR decline in patients admitted at Stony Brook University Hospital (SBUH), and then used LR and CART to determine the relative importance of these factors and the patient sub-groups at the highest and lowest risk.
Materials and method
Study design and participants
We conducted a retrospective cohort study on patients hospitalized at Stony Brook University Hospital (SBUH) from March 6th, 2020, to December 31st, 2020, during the first year of the COVID-19 pandemic. SBUH is the largest tertiary care centers in Long Island, NY that treated one of the highest numbers of COVID-19 patients in the United States in 2020. All patients who were discharged alive from the hospital were followed until February 16th, 2023, for assessment of kidney function (using outpatient eGFR values). We excluded patients who were ≤18 years of age, pregnant, or who had end stage kidney disease (ESKD), including chronic dialysis or kidney transplant. The study was approved by the SBU Institutional Review Board (IRB # 2020–00239).
Consent was not obtained (waived) since the data were deidentified and analyzed anonymously.
Privacy protection and data security
The data were accessed for research purposes on 01/06/2023 and 01/12/2023. Initially, the authors involved in the formal analysis had access to Protected Health Information (PHI) when it was extracted from the SBUH electronic health records (EHR). These files were stored on a secure, HIPAA-compliant server. PHI was promptly removed, and all subsequent analyses were conducted on de-identified patient data. All other authors had access only to de-identified data and summary results during research meetings.
Outcome (eGFR decline) definition
Hospital survivors were followed for 36 months post discharge from index hospitalization. For each patient, a baseline eGFR was identified during or before the index hospitalization. The change in eGFR was estimated using this baseline eGFR and the most recent eGFR for all patients with an outpatient eGFR value 90 or more days after discharge from the index hospitalization. The target outcome of “fast eGFR decline” was defined as ≥5 ml/min/1.73 m2 per year [32]. The ‘control group’ was patients without fast eGFR decline during follow-up.
Data collection and definition of other variables
Information pertaining to data collection and the definition of variables is detailed in the S1 Methods section.
Univariate analyses
If more than 5% of values for a variable were missing, the variable was removed from the analysis. In each subset and propensity matched dataset, univariate LR was used to identify potential risk factors for the target outcome (fast versus not fast eGFR decline). For continuous variables, we summarized the mean value and standard deviation for each patient group (ICU admitted versus not admitted, COVID-19 positive versus COVID-19 negative, and with fast eGFR decline versus without fast eGFR decline). For binary variables (true or false), we determined the number and corresponding proportion in each patient group. P-value < 0.05 was considered as the cutoff for statistical significance.
Summary of the multivariate and machine learning analyses
We generated multivariate LR models and CART decision trees to predict whether hospital survivors will develop fast eGFR decline during the follow-up period. The potential risk factors were selected based on RF analyses. By default, in each RF model, the number of decision trees was set to 500 and the number of variables used as a potential candidate split variable was 3. We chose the union of the top 10 variables based on the mean decrease accuracy (MDA) and the mean decrease in Gini (MDG) coefficient to fit LR and CART models.
In multivariate LR models, variables with significant odds ratio were considered as influential factors. In CART decision trees analyses, the minimum number of observations allowed in a terminal node was set to be 2% of the sample size N. The maximum depth of the decision tree was set to be 3. Variables that appeared as the split condition for the nodes in the decision tree were considered influential factors. We also calculated the overall prediction accuracy, sensitivity and specificity of each LR model and CART decision tree to compare the model performances.
The details of LR, RF and CART methods are in the S1 Methods section.
Propensity score matching (PSM) analysis
PSM is a quasi-experimental method used to establish an artificial control group by matching characteristics of each study unit with a control unit to better estimate the impact of a predictor. In our study, PSM was conducted using nearest neighbor matching in a 1:1 ratio without replacement. Propensity scores were estimated through logistic regression based on the important demographic covariates: sex, race, ethnicity and age. This method ensures that each treated unit (ICU admission or COVID-19 positive) is matched to a control unit (non-ICU admission or COVID-19 negative respectively) with the closest propensity score, thereby minimizing group differences and enabling direct comparisons.
To assess balance, we calculated absolute standardized mean differences (SMDs) for both the unadjusted (pre-matching) and adjusted (post-matching) datasets. The absolute mean difference plots demonstrate that all SMDs post-matching decreased to below 0.10, which is a commonly used threshold for acceptable balance, as previously reported [33]. This substantial reduction in SMDs indicates a significant improvement in balance after matching, ensuring the comparability of the matched groups (S13 and S14 Figs).
Results
1. Comparison of patients with and without fast eGFR decline
Of the cohort of 1,747 hospital survivors, 61.6% were noted to have a fast eGFR decline during a mean follow-up of 214.33 (±109.28) days (Table 1). Patients in the fast eGFR decline sub-group had a mean eGFR decline of 35.04 ml/min/1.73 m2 per year (standard deviation [SD] 34.44), whereas those in the control group (without fast eGFR decline) had a mean rise in eGFR of 11.84 ml/min/1.73 m2 per year (SD 21.00). Patients with fast eGFR decline were more likely to be younger, have a higher baseline eGFR, and had greater severity of hospital illness as reflected by a greater length of hospital stay (LOHS) and increased likelihood of requiring intensive care unit (ICU) admission, mechanical ventilation (MV), and vasopressors (Table 1). Patients with fast eGFR decline were 1.4 times more likely to have moderate/severe AKI (stages 2 and 3 ‐ AKI-2/3) during hospitalization (11.9 vs 8.4%).
Patients with fast eGFR decline were more likely to have a higher baseline eGFR (90.51 ± 29.52 mL/min/1.73m2) compared to those without fast eGFR decline (78.91 ± 29.12 mL/min/1.73m2). On further categorization by baseline eGFR, those with baseline eGFR >120 mL/min/1.73m2 were significantly more likely to be fast eGFR decliners, while those with baseline eGFR <60 mL/min/1.73m2 were more likely to be in the group without fast eGFR decline (Table 1).
1a. Machine learning analysis.
In RF analysis, LOHS was among the top three variables associated with fast versus not fast eGFR decline in both the MDA and MDG plots (S1 Fig). The other top variables included vasopressor use, age, COPD and BMI. Also, among the top 3 variables were AKI-2/3 in the COVID-19 negative subset (S2 Fig) and Hispanic ethnicity, MV and MV days in the COVID-19 positive subset (S3 Fig). The top ten variables from RF analysis were used for LR and CART decision-tree analysis.
In multivariate LR analysis of the whole cohort, longer LOHS, and vasopressor use were significantly associated with greater odds of fast (vs. not fast) eGFR decline whereas older age was associated with lower odds (Table 2). Greater odds of fast (vs. not fast) eGFR decline were significantly associated with longer LOHS and vasopressor use in the COVID-19 negative subset (S1 Table) and with diabetes mellitus (DM) in the COVID-19 positive subgroup (S2 Table).
In CART decision tree analysis of the full cohort, LOHS was the most important factor followed by vasopressor use and AKI-2/3 diagnosis (Fig 1). Those with AKI and hospital stay ≥ 27 days had the highest likelihood of fast (vs. not fast) eGFR decline, whereas the lowest risk was found in two subgroups (those without vasopressor use and body mass index [BMI] ≥ 22 and those with vasopressor use, but with a hospital stay < 6 days) (Fig 1). In the COVID-19 negative subgroup, LOHS was again the most important risk factor followed by vasopressor use (Fig 2). Patients with hospital stay ≥ 6 days, with vasopressor use and of female sex had the highest likelihood of fast eGFR decline, whereas those with hospital stay < 2 days and with coronary artery disease (CAD) had the lowest risk (Fig 2). In the COVID-19 positive subgroup, LOHS was also the most important risk factor followed by cancer diagnosis and age (Fig 3). Patients with hospital stay ≥ 15 days, age < 65 and BMI < 27 had the highest likelihood of fast eGFR decline, whereas those with hospital stay < 15 days with cancer and without hypertension (HTN) had the lowest risk (Fig 3).
The number of observations in a terminal node was set as at least 2% of the sample size. The percentage mentioned in the terminal node is the % of patients of the starting cohort of the analyses. In each terminal node, the risk of fast eGFR decline (vs. not fast) ranges from 0.00 (lowest) to 1.00 (highest). The color of the terminal node represents the risk associated with the tree attached to each node, with the intensity of green color indicating a stronger risk, while intensity of blue color representing a lower risk. The maximum depth of the decision tree was set to be 3.
The number of observations in a terminal node was set as at least 2% of the sample size. The percentage mentioned in the terminal node is the % of patients of the starting cohort of the analyses. In each terminal node, the risk of fast eGFR decline (vs. not fast) ranges from 0.00 (lowest) to 1.00 (highest). The color of the terminal node represents the risk associated with the tree attached to each node, with the intensity of green color indicating a stronger risk, while intensity of blue color representing a lower risk. The maximum depth of the decision tree was set to be 3.
The number of observations in a terminal node was set as at least 2% of the sample size. The percentage mentioned in the terminal node is the % of patients of the starting cohort of the analyses. In each terminal node, the risk of fast eGFR decline (vs. not fast) ranges from 0.00 (lowest) to 1.00 (highest). The color of the terminal node represents the risk associated with the tree attached to each node, with the intensity of green color indicating a stronger risk, while intensity of blue color representing a lower risk. The maximum depth of the decision tree was set to be 3.
2. Comparison of patients with and without ICU admission after PSM for demographics
Patients admitted to the ICU were more likely to require vasopressors or have a greater LOHS or diagnosis of sepsis (S3 Table). Patients with ICU admission were 3.7 times more likely to have AKI-2/3 during hospitalization and had a greater proportion of patients with fast eGFR decline (72.9 vs 53.9%) during follow-up compared those not admitted to the ICU.
2a. Machine learning analysis.
In the PSM matched cohort based on ICU status, LOHS was again among the top three variables associated with fast (vs. not fast) eGFR decline in RF analysis in both the MDA and MDG plots (S4 Fig). The other top variables were MV days, vasopressor use, age, and BMI. The top three variables also included ICU admission among the COVID-19 negative subgroup (S5 Fig) and MV among the COVID-19 positive subgroup (S6 Fig).
In multivariate LR analysis, longer LOHS and ICU admission were significantly associated with greater odds of fast (vs. not fast) eGFR decline whereas baseline CKD was associated with lower odds (Table 3). In the COVID-19 negative subset (S4 Table), older age and male sex were significantly associated with lower odds, whereas in the COVID-19 positive subgroup (S5 Table), DM was associated with greater odds of fast eGFR decline.
In CART analysis, ICU admission was the most important factor followed by LOHS and baseline CKD (Fig 4). The subgroups with ICU admission and either LOHS ≥ 32 days and White race, or LOHS< 32 days and age < 32 years had the highest likelihood of fast eGFR decline. The lowest risk occurred in those without an ICU admission who had CKD at baseline and a BMI<24 (Fig 4). In the COVID-19 negative subgroup, age was the most important factor followed by ICU admission and LOHS (S7 Fig). Patients with age < 70 years and hospital stay ≥ 44 days had the highest likelihood of fast eGFR decline, whereas those with age ≥ 70 years but < 81 years and no ICU admission had the lowest risk (S7 Fig). In the COVID-19 positive subgroup, LOHS was again the most important factor followed by DM diagnosis and BMI (S8 Fig). Patients with hospital stay ≥ 24 days, BMI < 35 and age ≥ 37 years had the highest likelihood of fast eGFR decline, whereas those with DM diagnosis but hospital stay < 3 days had the lowest risk (S8 Fig).
The number of observations in a terminal node was set as at least 2% of the sample size. The percentage mentioned in the terminal node is the % of patients of the starting cohort of the analyses. In each terminal node, the risk of fast eGFR decline (vs. not fast) ranges from 0.00 (lowest) to 1.00 (highest). The color of the terminal node represents the risk associated with the tree attached to each node, with the intensity of green color indicating a stronger risk, while intensity of blue color representing a lower risk. The maximum depth of the decision tree was set to be 3.
3. Comparison of patients with and without COVID-19 diagnosis after PSM for demographics
In the overall cohort, compared to those without COVID-19, patients admitted with COVID-19 diagnosis were more likely to have a greater mean decline in eGFR during follow-up, but the proportion of patients with fast eGFR decline was not significantly different (S6 Table). There was no significant difference in baseline age, sex or eGFR between the COVID positive and negative groups.
In the sub-group of patients who were propensity matched for demographics, compared to those without COVID-19, patients admitted with COVID-19 diagnosis were less likely to require vasopressors or have baseline HTN, CAD, and cancer; but were more likely to have an ICU admission, sepsis, ARDS, MV, and a greater LOHS (S7 Table). Patients with COVID-19 were 1.9 times more likely to have AKI-2/3 during hospitalization and a greater mean decline in eGFR during follow-up, but the proportion of patients with fast eGFR decline was not significantly different.
3a. Machine learning analysis.
In the PSM matched cohort based on COVID status, LOHS was again among the top 3 variables associated with fast (vs not fast) eGFR decline in RF analysis in both the MDA and MDG plots (S9 Fig). The other top variables were MV days, White race, age and BMI. The top 3 variables also included vasopressor use and Hispanic ethnicity among the COVID-19 negative subgroup (S10 Fig) and MV and Hispanic ethnicity among COVID-19 positive subset (S11 Fig).
In multivariate LR analysis, DM was significantly associated with greater odds of fast (vs. not fast) eGFR decline whereas older age was associated with lower odds (Table 4). In the COVID-19 negative subset, there were no statistically significant variables in multivariate LR (S8 Table). The findings of the COVID-19 positive subgroup are noted in S2 Table.
In CART analysis, LOHS was the most important factor followed by BMI and age (Fig 5). Those with hospital stay ≥ 8 days and age < 46 but ≥36 years had the highest likelihood of fast (vs. not fast) eGFR decline, whereas those with hospital stay < 8 days, BMI ≥37 but no DM diagnosis had the lowest risk (Fig 5). In the COVID-19 negative subgroup, LOHS was again the most important factor followed by BMI (S12 Fig). Patients with hospital stay ≥ 8 but < 12 days had the highest likelihood of fast) eGFR decline, whereas those with hospital stay < 2 days and BMI ≥ 31 had the lowest risk. The findings of the COVID-19 positive subgroup are noted in Fig 3.
The number of observations in a terminal node was set as at least 2% of the sample size. The percentage mentioned in the terminal node is the % of patients of the starting cohort of the analyses. In each terminal node, the risk of fast eGFR decline (vs. not fast) ranges from 0.00 (lowest) to 1.00 (highest). The color of the terminal node represents the risk associated with the tree attached to each node, with the intensity of green color indicating a stronger risk, while intensity of blue color representing a lower risk. The maximum depth of the decision tree was set to be 3.
4. Comparison of LR and CART methods
Both CART and LR methods showed similar accuracy and predictive power (S9 Table) in each dataset (primary and PSM cohorts). Both methods showed high specificity but low sensitivity for outcome prediction. The sensitivity and accuracy were comparatively higher in patients diagnosed with COVID-19 compared to those without.
Discussion
In this study of 1,747 hospital survivors with and without COVID-19, we identified age, baseline eGFR, greater severity of hospital illness, and moderate/severe AKI as the key hospital factors associated with fast kidney function decline after hospitalization. Among the factors associated with severity of hospital illness, length of hospital stay (LOHS) was the most important factor followed by vasopressor use in both LR and CART analysis. Using CART, we were able to identify the patient sub-groups with the highest and lowest risk of post-hospitalization fast eGFR decline. After stratification of the cohort based on ICU and COVID-19 status, other risk factors such as admission to the ICU and baseline CKD, DM, and BMI were identified. In all analyses, LOHS emerged as a highly significant factor. To our knowledge this is the first study to report the use of CART for identifying the patient sub-groups associated with the highest risk of fast post-hospitalization eGFR decline among patients hospitalized during the first year of the pandemic.
Severity of acute illness, patient characteristics, and complications during the hospitalization are associated with longer LOHS [34–37]. Longer LOHS has been associated with severity of kidney disease in the hospital [38, 39], and adverse post-discharge health outcomes [40, 41]. Similar findings have also been reported during the COVID-19 pandemic [42–44]. In a study of patients with COVID-19 who had AKI in the hospital, post-discharge eGFR decline was associated with longer hospitalization [45]. In this study, we report that length of stay during hospitalization is significantly associated with post-discharge eGFR decline in both LR and CART analyses in one of the largest patient cohorts to-date from the pandemic era. This finding has significant implications for the outpatient kidney monitoring and risk stratification of patients who were recently hospitalized.
Although the severity of illness in the hospital, including the diagnosis of AKI, is well known to be associated with worse long-term kidney outcomes, in our study we report other less recognized associations with fast post-discharge eGFR decline. For example, we found patients with fast eGFR decline were more likely to have a higher baseline eGFR. Baseline eGFR is known to be associated with eGFR change after AKI [46], and with faster decline in eGFR over time [6, 47, 48]. Pathologic glomerular hyperfiltration resulting in higher eGFR values might be a contributing factor [49]. While did not find a difference in CKD diagnosis (Table 1) in the eGFR decline groups, it’s not clear why patients with low baseline eGFR might have slower eGFR decline, although the number of patients with eGFR < 60 mL/min/1.73m2 in our study was low (only 21% of the cohort). This interesting association of baseline eGFR with eGFR decline certainly needs further exploration in research studies.
Another interesting finding in our study was that younger age was associated with higher risk of fast post-discharge eGFR decline. This association has been previously reported [50] and might be related to inadequacy of current eGFR estimates in older individuals [51] and possibly to different pathophysiological mechanisms of CKD progression [52]. Both findings have important clinical implications in renal monitoring of patients after hospital discharge.
As an advancement over previous models using Cox proportional hazards [53], newer ML-based models including gradient boosting, regression splines and random forest have been developed to predict kidney disease [54–60]. Decision tree analysis can identify the patient subgroups at lowest and highest risk of the eGFR decline through resulting cluster and hierarchy of decision nodes. CART has been used in the study of kidney diseases [14–19], however, the use of CART to predict eGFR decline has rarely been reported. A recent study used CART to identify factors associated with eGFR decline but was limited by a small sample and restricted to patients with partial nephrectomy [28]. To our knowledge, our study is the first to use CART to analyze factors associated with post-discharge eGFR decline in patients with and without COVID-19. Using CART analysis, we found that LOHS was the most important factor followed by vasopressor use and AKI-2/3 diagnosis. Those with AKI and hospital stay ≥ 27 days had the highest likelihood of fast (vs. not fast) eGFR decline, whereas the lowest risk was found in two subgroups: those with no vasopressor use and BMI≥ 22 and those with vasopressor use, but with a hospital stay < 6 days. These findings highlight the importance of evaluating recent hospitalization data of patients while monitoring kidney function in outpatient clinics.
Our study also highlights the importance of utilizing a non-parametric supervised learning algorithm like CART to identify high and low risk patient sub-groups, rather than relying solely on a traditional approach like LR which only identifies individual clinical factors. The application of CART models in clinical practice, particularly in ICU settings, offers several advantages [61, 62] CART’s ability to generate simple and interpretable decision trees makes it a practical tool for risk stratification and decision-making in high-risk environments. For ICU patients, who often experience a higher frequency of AKI due to severity illness and other nephrotoxic insults and consequently have a high risk of post-discharge eGFR decline and CKD progression. CART can help identify high-risk individuals early, allowing for targeted interventions such as nephroprotective measures or closer monitoring. Furthermore, CART’s flexibility in handling complex interactions between variables, such as baseline renal function, comorbidities, laboratory parameters and treatment modalities, is particularly relevant for ICU populations. Integrating CART models into electronic health record systems could facilitate real-time risk assessments, improving patient outcomes [63, 64]. However, successful implementation requires rigorous validation of the model in diverse ICU settings and the availability of high-quality, real-time data. This should be the focus of future validation and implementation studies.
During the pandemic period, multiple factors including COVID-19 infection were found to be associated with increased odds of rapid kidney function decline [65]. However, there have been only a few studies that report the use of ML to predict eGFR decline in hospital survivors from the first year of the pandemic when the treatment of this disease was evolving, and mass public vaccinations had not yet started [66]. Vaid et al. used several ML models for predicting dialysis requirement and death in patients hospitalized with COVID-19 and found an XGBoost model without imputation to have the highest accuracy [67]. We had previously reported the use of XGBoost to predict recovery after AKI in patients with and without COVID-19 [30]. The use of ML models, especially decision tree analysis, to predict post-hospitalization eGFR decline in COVID-19 survivors has been rarely reported. In a small study of 37 critically ill patients with COVID-19 during the first year of the pandemic, One Rule and decision trees methods were used to classify patients for risk of CKD and mortality [68]. However, this study was limited in design, sample size and focused only on in-hospital outcomes. In our study, among patients with COVID-19, baseline DM was significantly associated with greater odds of fast eGFR decline whereas older age was associated with lower odds in multivariate LR analysis. In CART analysis, LOHS was the most important factor followed by BMI and age. This data provides valuable insights into high-risk patients admitted with COVID-19 who require closer kidney monitoring after discharge.
Our study shows comparable accuracy between LR and CART as previously reported [69, 70]. Besides CART, other decision trees used in the prediction of kidney disease have had variable accuracy compared to other ML techniques [23–27]. CART’s strengths lie in its simplicity and ease of interpretation due to its binary tree structure, unlike methods with multi-way splits. Its tree pruning technique, which grows a large tree and prunes it to optimal size, effectively prevents overfitting. As a non-parametric approach that is applicable to both classification and regression tasks, CART does not assume data distribution, making it particularly effective for modeling nonlinear relationships that parametric methods like LR may not handle well.
In our study, we used RF for selecting the top ten most significant features associated with fast eGFR decline. We then undertook LR and CART analysis to study the relative importance of these features. Previous studies have shown that the employment of ML for feature selection before decision tree analyses increases accuracy [26, 71].
Our study had several limitations. Due to our study requirements of at least two outpatient eGFR values more than 90 days after hospital discharge, only a third of the hospital survivors during the pandemic had data available to evaluate post-discharge eGFR decline. We did not have accurate urine output data and AKI in the hospital was diagnosed by the serum creatinine criteria only. A significant proportion of hospitalized patients did not have pre-hospitalization baseline eGFR available, and in those cases, the lowest serum creatinine during hospitalization was used to estimate baseline eGFR. Since most patients in our cohort had normal baseline kidney function (mean baseline eGFR of 86.06 ±29.90 ml/min/1.73m2 and mean final eGFR of 77.38 ±30.11 ml/min/1.73m2), we were not able to study other outcomes associated with CKD progression such as incident CKD, > 40% eGFR decline or incident ESKD. The lack of data on proteinuria, retrospective design, and use of data from a single center are additional limitations.
In conclusion, we report that use of combined ML techniques can provide a comprehensive understanding of the key factors associated with kidney outcomes. CART analysis can help identify the subgroups of hospitalized patients with the highest and lowest risk of post-discharge eGFR decline. In this study, CART identified length of hospitalization as the most important factor. Based on these findings, we advise clinicians to consider the length of hospitalization in their post-discharge monitoring of kidney function. We anticipate that identification of high-risk hospitalized patients through CART can significantly improve the post-discharge clinical management. Further studies in other healthcare systems are required to validate our findings.
Supporting information
S1 Fig. Random Forest for fast eGFR decline in the original dataset (N = 1747).
https://doi.org/10.1371/journal.pone.0317558.s001
(DOCX)
S2 Fig. Random Forest for fast eGFR decline in the COVID negative subset (N = 1487).
https://doi.org/10.1371/journal.pone.0317558.s002
(DOCX)
S3 Fig. Random Forest for fast eGFR decline in the COVID positive subset (N = 260).
https://doi.org/10.1371/journal.pone.0317558.s003
(DOCX)
S4 Fig. Random Forest for fast eGFR decline in the ICU matched subset (N = 612).
https://doi.org/10.1371/journal.pone.0317558.s004
(DOCX)
S5 Fig. Random Forest for fast eGFR decline in the ICU matched COVID negative subset (N = 510).
https://doi.org/10.1371/journal.pone.0317558.s005
(DOCX)
S6 Fig. Random Forest for fast eGFR decline in the ICU matched COVID positive subset (N = 102).
https://doi.org/10.1371/journal.pone.0317558.s006
(DOCX)
S7 Fig. CART decision tree for fast eGFR decline in the COVID-19 negative subgroup of the PSM matched ICU subset (N = 510).
https://doi.org/10.1371/journal.pone.0317558.s007
(DOCX)
S8 Fig. CART decision tree for fast eGFR decline in the COVID-19 positive subgroup of the PSM matched ICU subset (N = 102).
https://doi.org/10.1371/journal.pone.0317558.s008
(DOCX)
S9 Fig. Random Forest for fast eGFR decline in the COVID matched subset (N = 520).
https://doi.org/10.1371/journal.pone.0317558.s009
(DOCX)
S10 Fig. Random Forest for fast eGFR decline in the COVID matched COVID negative subset (N = 260).
https://doi.org/10.1371/journal.pone.0317558.s010
(DOCX)
S11 Fig. Random Forest for fast eGFR decline in the COVID matched COVID positive subset (N = 260).
https://doi.org/10.1371/journal.pone.0317558.s011
(DOCX)
S12 Fig. CART decision tree for fast eGFR decline in the COVID negative subgroup of the PSM matched COVID-19 subset (N = 260).
https://doi.org/10.1371/journal.pone.0317558.s012
(DOCX)
S13 Fig. Standardized Mean Differences (SMDs) Before and After Matching for ICU Admission Groups.
https://doi.org/10.1371/journal.pone.0317558.s013
(DOCX)
S14 Fig. Standardized Mean Differences (SMDs) Before and After Matching for COVID-19 Diagnosis Groups.
https://doi.org/10.1371/journal.pone.0317558.s014
(DOCX)
S1 Table. Logistic regression for fast eGFR decline in the COVID negative subset (N = 1487).
https://doi.org/10.1371/journal.pone.0317558.s015
(DOCX)
S2 Table. Logistic regression for fast eGFR decline in the COVID positive subset (N = 260).
https://doi.org/10.1371/journal.pone.0317558.s016
(DOCX)
S3 Table. Univariate analysis of the followed-up patients admitted and not admitted to the ICU after PSM on 4 demographic variables.
https://doi.org/10.1371/journal.pone.0317558.s017
(DOCX)
S4 Table. Logistic regression for fast eGFR decline in the COVID negative subgroup of the PSM matched ICU subset (N = 510).
https://doi.org/10.1371/journal.pone.0317558.s018
(DOCX)
S5 Table. Logistic regression for fast eGFR decline in the COVID positive subgroup of the PSM matched ICU subset (N = 102).
https://doi.org/10.1371/journal.pone.0317558.s019
(DOCX)
S6 Table. Univariate analysis of the followed-up patients with and without COVID-19 (N = 1,747).
https://doi.org/10.1371/journal.pone.0317558.s020
(DOCX)
S7 Table. Univariate analysis of the followed-up patients with and without COVID-19 after PSM on 4 demographic variables.
https://doi.org/10.1371/journal.pone.0317558.s021
(DOCX)
S8 Table. Logistic regression for fast eGFR decline in the COVID-negative subgroup of the PSM matched COVID-19 subset (N = 260).
https://doi.org/10.1371/journal.pone.0317558.s022
(DOCX)
S9 Table. Comparisons between CART decision tree and logistic regression models.
https://doi.org/10.1371/journal.pone.0317558.s023
(DOCX)
References
- 1. Kim HJ, Kim DW, Rhee H, Song SH, Park SK, Kim SW, et al. Rapid decline in kidney function is associated with rapid deterioration of health-related quality of life in chronic kidney disease. Sci Rep. 2023;13(1):1786. pmid:36720969
- 2. Ali I, Chinnadurai R, Ibrahim ST, Kalra PA. Adverse outcomes associated with rapid linear and non-linear patterns of chronic kidney disease progression. BMC Nephrol. 2021;22(1):82. pmid:33676423
- 3. Rifkin DE, Shlipak MG, Katz R, Fried LF, Siscovick D, Chonchol M, et al. Rapid kidney function decline and mortality risk in older adults. Arch Intern Med. 2008;168(20):2212–8. pmid:19001197
- 4. Hussain J, Grubic N, Akbari A, Canney M, Elliott MJ, Ravani P, et al. Associations between modest reductions in kidney function and adverse outcomes in young adults: retrospective, population based cohort study. BMJ. 2023;381:e075062. pmid:37353230
- 5. Kurzhagen JT, Dellepiane S, Cantaluppi V, Rabb H. AKI: an increasingly recognized risk factor for CKD development and progression. J Nephrol. 2020;33(6):1171–87. pmid:32651850
- 6. Sawhney S, Marks A, Fluck N, Levin A, McLernon D, Prescott G, et al. Post-discharge kidney function is associated with subsequent ten-year renal progression risk among survivors of acute kidney injury. Kidney Int. 2017;92(2):440–52.
- 7. Haines RW, Powell-Tuck J, Leonard H, Crichton S, Ostermann M. Long-term kidney function of patients discharged from hospital after an intensive care admission: observational cohort study. Sci Rep. 2021;11(1):9928. pmid:33976354
- 8. Lai CF, Wu VC, Huang TM, Yeh YC, Wang KC, Han YY, et al. Kidney function decline after a non-dialysis-requiring acute kidney injury is associated with higher long-term mortality in critically ill survivors. Crit Care. 2012;16(4):R123. pmid:22789111
- 9. Saha I, Gourisaria MK, Harshvardhan GM, editors. Classification System for Prediction of Chronic Kidney Disease Using Data Mining Techniques. Advances in Data and Information Sciences; 2022 2022//; Singapore: Springer Singapore.
- 10. Poonia RC, Gupta MK, Abunadi I, Albraikan AA, Al-Wesabi FN, Hamza MA, et al. Intelligent Diagnostic Prediction and Classification Models for Detection of Kidney Disease. Healthcare. 2022;10(2):371.
- 11. Islam MA, Majumder MZH, Hussein MA. Chronic kidney disease prediction based on machine learning algorithms. J Pathol Inform. 2023;14:100189. pmid:36714452
- 12. Inaguma D, Hayashi H, Yanagiya R, Koseki A, Iwamori T, Kudo M, et al. Development of a machine learning-based prediction model for extremely rapid decline in estimated glomerular filtration rate in patients with chronic kidney disease: a retrospective cohort study using a large data set from a hospital in Japan. BMJ Open. 2022;12(6):e058833. pmid:35680264
- 13. Inaguma D, Kitagawa A, Yanagiya R, Koseki A, Iwamori T, Kudo M, et al. Increasing tendency of urine protein is a risk factor for rapid eGFR decline in patients with CKD: A machine learning-based prediction model by using a big database. PLoS One. 2020;15(9):e0239262. pmid:32941535
- 14. Wang R, Zhang J, He M, Xu J. Classification and Regression Tree Predictive Model for Acute Kidney Injury in Traumatic Brain Injury Patients. Ther Clin Risk Manag. 2024;20:139–49. pmid:38410117
- 15. Chi R, Liang M, Zou Q, Li C, Zhou H, Jian Z. [Construction and validation of a decision tree based on biomarkers for predicting severe acute kidney injury in critically ill patients]. Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2020;32(6):721–5. pmid:32684220
- 16. Schneider DF, Dobrowolsky A, Shakir IA, Sinacore JM, Mosier MJ, Gamelli RL. Predicting acute kidney injury among burn patients in the 21st century: a classification and regression tree analysis. J Burn Care Res. 2012;33(2):242–51. pmid:22370901
- 17. Gao W, Wang J, Zhou L, Luo Q, Lao Y, Lyu H, et al. Prediction of acute kidney injury in ICU with gradient boosting decision tree algorithms. Computers in Biology and Medicine. 2022;140:105097. pmid:34864304
- 18. Binongo JN, Taylor A, Hill AN, Schmotzer B, Halkar R, Folks R, et al. Use of classification and regression trees in diuresis renography. Acad Radiol. 2007;14(3):306–11. pmid:17307663
- 19. Lin S-J, Liu C-C, Tsai DMT, Shih Y-H, Lin C-L, Hsu Y-C. Prediction Models Using Decision Tree and Logistic Regression Method for Predicting Hospital Revisits in Peritoneal Dialysis Patients. Diagnostics. 2024;14(6):620. pmid:38535040
- 20. Debal DA, Sitote TM. Chronic kidney disease prediction using machine learning techniques. Journal of Big Data. 2022;9(1):109.
- 21. Tekale S, Shingavi P, Wandhekar S, Chatorikar A. Prediction of chronic kidney disease using machine learning algorithm. International Journal of Advanced Research in Computer and Communication Engineering. 2018;7(10):92–6.
- 22. Khan SH. Predictive models for chronic renal disease using decision trees, naïve bayes and case-based methods [Student thesis]2010.
- 23. Ilyas H, Ali S, Ponum M, Hasan O, Mahmood MT, Iftikhar M, et al. Chronic kidney disease diagnosis using decision tree algorithms. BMC Nephrol. 2021;22(1):273. pmid:34372817
- 24. Pasadana IA, Hartama D, Zarlis M, Sianipar AS, Munandar A, Baeha S, et al. Chronic Kidney Disease Prediction by Using Different Decision Tree Techniques. Journal of Physics: Conference Series. 2019;1255(1):012024.
- 25. Chiu Y-L, Jhou M-J, Lee T-S, Lu C-J, Chen M-S. Health Data-Driven Machine Learning Algorithms Applied to Risk Indicators Assessment for Chronic Kidney Disease. Risk Management and Healthcare Policy. 2021;14(null):4401–12. pmid:34737657
- 26. Shih CC, Lu CJ, Chen GD, Chang CC. Risk Prediction for Early Chronic Kidney Disease: Results from an Adult Health Examination Program of 19,270 Individuals. Int J Environ Res Public Health. 2020;17(14).
- 27. Cao X, Lin Y, Yang B, Li Y, Zhou J. Comparison Between Statistical Model and Machine Learning Methods for Predicting the Risk of Renal Function Decline Using Routine Clinical Data in Health Screening. Risk Manag Healthc Policy. 2022;15:817–26. pmid:35502445
- 28. Uleri A, Baboudjian M, Gallioli A, Territo A, Gaya JM, Sanz I, et al. A new machine-learning model to predict long-term renal function impairment after minimally invasive partial nephrectomy: the Fundacio Puigvert predictive model. World J Urol. 2023;41(11):2985–90.
- 29. Yoo YJ, Wilkins KJ, Alakwaa F, Liu F, Torre-Healy LA, Krichevsky S, et al. Geographic and Temporal Trends in COVID-Associated Acute Kidney Injury in the National COVID Cohort Collaborative. Clin J Am Soc Nephrol. 2023;18(8):1006–18. pmid:37131278
- 30. Sun S, Annadi RR, Chaudhri I, Munir K, Hajagos J, Saltz J, et al. Short- and Long-Term Recovery after Moderate/Severe AKI in Patients with and without COVID-19. Kidney360. 2022;3(2):242–57. pmid:35373118
- 31. Aklilu AM, Kumar S, Nugent J, Yamamoto Y, Coronel-Moreno C, Kadhim B, et al. COVID-19-Associated Acute Kidney Injury and Longitudinal Kidney Outcomes. JAMA Intern Med. 2024. pmid:38407862
- 32. Koraishy FM, Hooks-Anderson D, Salas J, Rauchman M, Scherrer JF. Fast GFR decline and progression to CKD among primary care patients with preserved GFR. Int Urol Nephrol. 2018;50(3):501–8. pmid:29404927
- 33. Austin PC. Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples. Stat Med. 2009;28(25):3083–107. pmid:19757444
- 34. Roger C, Debuyzer E, Dehl M, Bulaid Y, Lamrani A, Havet E, et al. Factors associated with hospital stay length, discharge destination, and 30-day readmission rate after primary hip or knee arthroplasty: Retrospective Cohort Study. Orthop Traumatol Surg Res. 2019;105(5):949–55. pmid:31208932
- 35. Inabnit LS, Blanchette C, Ruban C. Comorbidities and length of stay in chronic obstructive pulmonary disease patients. COPD. 2018;15(4):355–60. pmid:30273058
- 36. Tefera GM, Feyisa BB, Umeta GT, Kebede TM. Predictors of prolonged length of hospital stay and in-hospital mortality among adult patients admitted at the surgical ward of Jimma University medical center, Ethiopia: prospective observational study. J Pharm Policy Pract. 2020;13:24. pmid:32549990
- 37. Marfil-Garza BA, Belaunzaran-Zamudio PF, Gulias-Herrero A, Zuniga AC, Caro-Vega Y, Kershenobich-Stalnikowitz D, et al. Risk factors associated with prolonged hospital length-of-stay: 18-year retrospective study of hospitalizations in a tertiary healthcare center in Mexico. PLoS One. 2018;13(11):e0207203. pmid:30408118
- 38. Su G, Xu H, Marrone G, Lindholm B, Wen Z, Liu X, et al. Chronic kidney disease is associated with poorer in-hospital outcomes in patients hospitalized with infections: Electronic record analysis from China. Sci Rep. 2017;7(1):11530. pmid:28912532
- 39. Tam-Tham H, Ravani P, Zhang J, Weaver RG, Quinn RR, James MT, et al. Association of Initiation of Dialysis With Hospital Length of Stay and Intensity of Care in Older Adults With Kidney Failure. JAMA Netw Open. 2020;3(2):e200222. pmid:32108896
- 40. Han TS, Murray P, Robin J, Wilkinson P, Fluck D, Fry CH. Evaluation of the association of length of stay in hospital and outcomes. Int J Qual Health Care. 2022;34(2). pmid:34918090
- 41. Sud M, Yu B, Wijeysundera HC, Austin PC, Ko DT, Braga J, et al. Associations Between Short or Long Length of Stay and 30-Day Readmission and Mortality in Hospitalized Patients With Heart Failure. JACC Heart Fail. 2017;5(8):578–88.
- 42. Peixoto SG, Wolf JM, Glaeser AB, Maccari JG, Nasi LA. Longer length of stay, days between discharge/first readmission, and pulmonary involvement >/ = 50% increase prevalence of admissions in ICU in unplanned readmissions after COVID-19 hospitalizations. J Med Virol. 2022;94(8):3750–6.
- 43. Mastaneh Z, Mouseli A, Mohseni S, Dadipoor S. Predictors of hospital length of stay and mortality among COVID-19 inpatients during 2020–2021 in Hormozgan Province of Iran: A retrospective cohort study. Health Sci Rep. 2023;6(6):e1329. pmid:37324249
- 44. Lomba GSB, Silva P, Rosario NFD, Medeiros T, Alves LS, Silva AA, et al. Post-discharge all-cause mortality in COVID-19 recovered patients hospitalized in 2020: the impact of chronic kidney disease. Rev Inst Med Trop Sao Paulo. 2024;66:e1. pmid:38198374
- 45. Bento GAO, Leite VLT, Campos RP, Vaz FB, Daher EF, Duarte DB. Reduction of estimated glomerular filtration rate after COVID-19-associated acute kidney injury. J Bras Nefrol. 2023;45(4):488–94. pmid:37311051
- 46. Jensen SK, Heide-Jorgensen U, Vestergaard SV, Gammelager H, Birn H, Nitsch D, et al. Kidney function before and after acute kidney injury: a nationwide population-based cohort study. Clin Kidney J. 2023;16(3):484–93. pmid:36865015
- 47. Melsom T, Nair V, Schei J, Mariani L, Stefansson VTN, Harder JL, et al. Correlation Between Baseline GFR and Subsequent Change in GFR in Norwegian Adults Without Diabetes and in Pima Indians. Am J Kidney Dis. 2019;73(6):777–85. pmid:30704883
- 48. Baba M, Shimbo T, Horio M, Ando M, Yasuda Y, Komatsu Y, et al. Longitudinal Study of the Decline in Renal Function in Healthy Subjects. PLoS One. 2015;10(6):e0129036. pmid:26061083
- 49. van der Burgh AC, Rizopoulos D, Ikram MA, Hoorn EJ, Chaker L. Determinants of the Evolution of Kidney Function With Age. Kidney Int Rep. 2021;6(12):3054–63. pmid:34901574
- 50. Buyadaa O, Salim A, Morton JI, Magliano DJ, Shaw JE. Rate of decline in kidney function and known age-of-onset or duration of type 2 diabetes. Sci Rep. 2021;11(1):14705. pmid:34282181
- 51. Liu P, Quinn RR, Lam NN, Elliott MJ, Xu Y, James MT, et al. Accounting for Age in the Definition of Chronic Kidney Disease. JAMA Intern Med. 2021;181(10):1359–66. pmid:34459844
- 52. Douville P, Martel AR, Talbot J, Desmeules S, Langlois S, Agharazii M. Impact of age on glomerular filtration estimates. Nephrol Dial Transplant. 2009;24(1):97–103. pmid:18772177
- 53. Tangri N, Inker LA, Hiebert B, Wong J, Naimark D, Kent D, et al. A Dynamic Predictive Model for Progression of CKD. Am J Kidney Dis. 2017;69(4):514–20. pmid:27693260
- 54. Ferguson T, Ravani P, Sood MM, Clarke A, Komenda P, Rigatto C, et al. Development and External Validation of a Machine Learning Model for Progression of CKD. Kidney Int Rep. 2022;7(8):1772–81. pmid:35967110
- 55. Aoki J, Kaya C, Khalid O, Kothari T, Silberman MA, Skordis C, et al. CKD Progression Prediction in a Diverse US Population: A Machine-Learning Model. Kidney Med. 2023;5(9):100692. pmid:37637863
- 56. Chauhan K, Nadkarni GN, Fleming F, McCullough J, He CJ, Quackenbush J, et al. Initial Validation of a Machine Learning-Derived Prognostic Test (KidneyIntelX) Integrating Biomarkers and Electronic Health Record Data To Predict Longitudinal Kidney Outcomes. Kidney360. 2020;1(8):731–9. pmid:35372952
- 57. Tsai MH, Jhou MJ, Liu TC, Fang YW, Lu CJ. An integrated machine learning predictive scheme for longitudinal laboratory data to evaluate the factors determining renal function changes in patients with different chronic kidney disease stages. Front Med (Lausanne). 2023;10:1155426. pmid:37859858
- 58. Xiao J, Ding R, Xu X, Guan H, Feng X, Sun T, et al. Comparison and development of machine learning tools in the prediction of chronic kidney disease progression. J Transl Med. 2019;17(1):119. pmid:30971285
- 59. Hong MS, Lee YH, Kong JM, Kwon OJ, Jung CW, Yang J, et al. Personalized Prediction of Kidney Function Decline and Network Analysis of the Risk Factors after Kidney Transplantation Using Nationwide Cohort Data. J Clin Med. 2022;11(5).
- 60. Kaufman HW, Wang C, Wang Y, Han H, Chaudhuri S, Usvyat L, et al. Machine Learning Case Study: Patterns of Kidney Function Decline and Their Association With Clinical Outcomes Within 90 Days After the Initiation of Renal Dialysis. Adv Kidney Dis Health. 2023;30(1):33–9.
- 61. Berney SC, Gordon IR, Opdam HI, Denehy L. A classification and regression tree to assist clinical decision making in airway management for patients with cervical spinal cord injury. Spinal Cord. 2011;49(2):244–50. pmid:20697418
- 62. Kilincer Bozgul SM, Kurtulmus IA, Yargucu Zihni F, Akad Soyer N, Yagmur B, Gunes A, et al. Classification and Regression Tree Analysis in Adult Hemophagocytic Syndrome: Identifying Acute Heart Failure Predictors in ICU Patients. Journal of inflammation research. 2024;17(null):9711–23. pmid:39618930
- 63. Abu-Hanna A, de Keizer N. Integrating classification trees with local logistic regression in Intensive Care prognosis. Artificial Intelligence in Medicine. 2003;29(1):5–23. pmid:12957778
- 64. Trujillano J, Badia M, Serviá L, March J, Rodriguez-Pozo A. Stratification of the severity of critically ill patients with classification trees. BMC Medical Research Methodology. 2009;9(1):83. pmid:20003229
- 65. Diamantidis CJ, Cook DJ, Redelosa CK, Vinculado RB, Cabajar AA, Vassalotti JA. CKD and Rapid Kidney Function Decline During the COVID-19 Pandemic. Kidney Med. 2023;5(9):100701. pmid:37649727
- 66. Mahmood MA, Lata P, editors. A Machine Learning Approach to Predict Renal Diseases with SARS-CoV-2. 2021 Emerging Technology in Computing, Communication and Electronics (ETCCE); 2021 21–23 Dec. 2021.
- 67. Vaid A, Chan L, Chaudhary K, Jaladanki SK, Paranjpe I, Russak A, et al. Predictive Approaches for Acute Dialysis Requirement and Death in COVID-19. Clin J Am Soc Nephrol. 2021;16(8):1158–68. pmid:34031183
- 68. Herzog AL, von Jouanne-Diedrich HK, Wanner C, Weismann D, Schlesinger T, Meybohm P, et al. COVID-19 and the kidney: A retrospective analysis of 37 critically ill patients using machine learning. PLoS One. 2021;16(5):e0251932. pmid:34015009
- 69. Rudolfer SM, Paliouras G, Peers IS. A comparison of logistic regression to decision tree induction in the diagnosis of carpal tunnel syndrome. Comput Biomed Res. 1999;32(5):391–414. pmid:10529299
- 70. Tsien CL, Fraser HS, Long WJ, Kennedy RL. Using classification tree and logistic regression methods to diagnose myocardial infarction. Stud Health Technol Inform. 1998;52 Pt 1:493–7. pmid:10384505
- 71. Chaudhuri AK, Sinha D, Banerjee DK, Das A. A novel enhanced decision tree model for detecting chronic kidney disease. Network Modeling Analysis in Health Informatics and Bioinformatics. 2021;10(1):29.