- Research
- Open access
- Published:
Machine learning derivation of two cardiac arrest subphenotypes with distinct responses to treatment
Journal of Translational Medicine volume 23, Article number: 16 (2025)
Abstract
Introduction
Cardiac arrest (CA), characterized by its heterogeneity, poses challenges in patient management. This study aimed to identify clinical subphenotypes in CA patients to aid in patient classification, prognosis assessment, and treatment decision-making.
Methods
For this study, comprehensive data were extracted from the Medical Information Mart for Intensive Care IV (MIMIC-IV) 2.0 database. We excluded patients under 18 years old, those not initially admitted to the intensive care unit (ICU), or treated in the ICU for less than 72 h. A total of 57 clinical parameters relevant to CA patients were selected for analysis. These included demographic data, vital signs, and laboratory parameters. After an extensive literature review and expert consultations, key factors such as temperature (T), sodium (Na), creatinine (CR), glucose (GLU), heart rate (HR), PaO2/FiO2 ratio (P/F), hemoglobin (HB), mean arterial pressure (MAP), platelets (PLT), and white blood cell count (WBC) were identified as the most significant for cluster analysis. Consensus cluster analysis was utilized to examine the mean values of these routine clinical parameters within the first 24 h post-ICU admission to categorize patient classes. Furthermore, in-hospital and 28-day mortality rates of patients across different CA subphenotypes were assessed using multivariate logistic and Cox regression analysis.
Results
After applying exclusion criteria, 719 CA patients were included in the study, with a median age of 67.22 years (IQR: 55.50-79.34), of whom 63.28% were male. The analysis delineated two distinct subphenotypes: Subphenotype 1 (SP1) and Subphenotype 2 (SP2). Compared to SP1, patients in SP2 exhibited significantly higher levels of P/F, HB, MAP, PLT, and Na, but lower levels of T, HR, GLU, WBC, and CR. SP2 patients had a notably higher in-hospital mortality rate compared to SP1 (53.01% for SP2 vs. 39.36% for SP1, P < 0.001). 28-day mortality decreased continuously for both subphenotypes, with a more rapid decline in SP2. These differences remained significant after adjusting for potential covariates (adjusted OR = 1.82, 95% CI: 1.26–2.64, P = 0.002; HR = 1.84, 95% CI: 1.40–2.41, P < 0.001).
Conclusions
The study successfully identified two distinct clinical subphenotypes of CA by analyzing routine clinical data from the first 24 h following ICU admission. SP1 was characterized by a lower rate of in-hospital and 28-day mortality when compared to SP2. This differentiation could play a crucial role in tailoring patient care, assessing prognosis, and guiding more targeted treatment strategies for CA patients.
Introduction
The latest statistics from the American Heart Association (AHA) indicate that the incidence of out-of-hospital cardiac arrest (OHCA) stands at 140.7 cases per 100,000 population, while in-hospital cardiac arrest (IHCA) occurs at a rate of 17.16 cases per 1,000 hospitalized patients [1]. Cardiac arrest (CA) is a significant health concern, causing nearly 370,000 deaths annually in the United States [1] and almost 1 million deaths each year in China [2]. These numbers have been rising, especially in the period following the COVID-19 pandemic [1]. The high mortality rate associated with CA is largely attributed to its heterogeneity, making the identification of specific CA subphenotypes critical for the development of precise and effective treatment plans.
Now, there remains ambiguity about the various terms created to classify and study CA, and consensus documents from various cardiovascular societies characterize them distinctively [3,4,5]. North American (American Heart Association/Heart Rhythm Society/American Council of Cardiology), European (European Heart Society), and Asian-Pacific (Asia-Pacific Heart Rhythm Society) all have unique definitions of CA [6]. They differed in whether the emphasis is placed on timing, etiology, or situational context surrounding the cardiac arrest. However, the above-mentioned traditional classification methods inevitably have the following problems: (1) Unable to accurately reflect the pathophysiological state: The pathophysiological state of each patient after CA can be significantly different, but these classification criteria do not accurately reflect these differences; (2) Inability to provide effective support for treatment decisions: Different types of cardiac arrest require different treatment strategies. The above classification methods ignore individual differences and cannot provide enough clinical information to guide treatment. Therefore, this study focused on the above two issues, hoping to use a new tool to derive a new clinical classification method for CA patients. This classification method does not rely on external factors such as timing, etiology, or situational context surrounding the CA, but uses demographic variables, severity scores, comorbidities, vital signs, laboratory parameters, medication, and special treatments to depict the high-latitude stereoscopic characteristics of each CA patient, and uses computer tools to classify them in high-latitude digital space. This classification method is based on the pathophysiological state of CA patients, which can evaluate the prognosis of patients and help with clinical treatment decisions.
The machine learning (ML) classifier model has become one of the most indispensable tools in modern medical research for identifying various disease subphenotypes. Among them, consensus cluster analysis is a suitable clustering method that has been widely used to identify sepsis [7] and acute respiratory distress syndrome (ARDS) subphenotypes [8]. Until the initiation of this study, the application of similar methods for identifying clinical subphenotype models in CA patients had not been widely explored by researchers. This study sought to develop a consensus cluster analysis approach to identify clinical subphenotypes of CA patients based on data collected within 24 h of admission to the intensive care unit (ICU). This methodology aims to assist clinicians in classifying and assessing the prognosis of CA patients, thereby enabling the potential for early warning and precise interventions for CA in the future.
Methods
Data source
All the data were extracted from the Medical Information Mart for Intensive Care IV (MIMIC-IV) 2.0 database, which contained all medical record numbers corresponding to patients admitted to the ICU or emergency department between 2008 and 2019 in the Beth Israel Deaconess Medical Center (BIDMC). Access to the database was granted following the completion of NIH-required online training by one of the authors (Mengyuan Diao, with certification ID: 1630201). The Institutional Review Board (IRB) at BIDMC provided a waiver for informed consent and approved the use of this resource for research purposes. Consequently, the study was conducted using publicly available and anonymized data, negating the need for individual patient consent.
Study population and outcome
In this study, patients who experienced CA were included as research participants, regardless of the cause of the arrest. However, the study excluded patients who were under 18 years of age, as well as those who were either initially admitted to the ICU or treated in the ICU for less than 72 h. A flow diagram detailing the study design and participant selection is presented in Fig. 1. The primary outcome measured in the study was in-hospital mortality, while the secondary outcome focused on the 28-day mortality.
Data extraction and variable selection
The extracted data for this study encompassed a comprehensive range of parameters including demographic variables, severity scores, comorbidities, vital signs, laboratory parameters, medication, and special treatments. The demographic variables considered were age, race, sex, and body mass index (BMI). Comorbidities included in the data were hypertension, heart failure, cerebral infarction, chronic obstructive pulmonary disease (COPD), cirrhosis of the liver, chronic kidney disease, malignant cancer, diabetes, myocardial infarction, arrhythmia, coronary heart disease, and cardiomyopathy. Vital signs incorporated into the study were heart rate (HR), systolic blood pressure (SBP), diastolic blood pressure (DBP), mean arterial pressure (MAP), respiratory rate (RR), temperature (T), PaO2/FiO2(P/F), SPO2, and total input and output values. Laboratory parameters consisted of hemoglobin (HB), platelets (PLT), white blood cells (WBC), bicarbonate, chloride, creatinine (CR), glucose (GLU), sodium (Na), potassium, bilirubin, pH levels, PO2, PCO2, international normalized ratio (INR), base excess (BE), alanine aminotransferase (ALT), aspartate transaminase (AST), blood urea nitrogen (BUN), and lactate (LAC). Severity scoring included the Charlson score and the Sequential Organ Failure Assessment (SOFA) score. The medications considered were vasoactive drugs, antiarrhythmics, glucocorticoids, and sodium bicarbonate. Special treatments accounted for were positive end-expiratory pressure (PEEP), percutaneous transluminal coronary intervention (PCI), extracorporeal membrane oxygenation (ECMO), continuous renal replacement therapy (CRRT), intra-aortic balloon pump (IABP), and mechanical ventilation. All these parameters were average values within the first 24 h post-ICU admission, except for input and output, which were cumulative totals within the same timeframe.
For feature selection, variables with a missing fraction exceeding 30% were excluded. Data were imputed with the use of multiple imputation for variables that were missing by less than 30%. The proportion of missing data is detailed in supplementary Fig. 1 (sFigure 1). Selection of variables was informed by prior literature and their potential association with the onset and progression of CA, as identified through literature review and expert discussions. The final variables chosen for consensus cluster analysis included T, Na, CR, GLU, HR, P/F, HB, MAP, PLT, and WBC. The consensus clustering analysis utilized the k-means clustering method as its internal algorithm, with each variable representing a different physiological system or function. Correlations between the selected parameters were generally low, as indicated in supplementary Fig. 2 (sFigure 2), with the highest correlation observed between HB and MAP (r = 0.4).
Subphenotype classification
In this study, consensus cluster analysis was employed to identify clinical subphenotypes of CA in ICU patients. The analysis utilized a pre-defined subsampling parameter set at 80%, with 50 iterations. The potential number of clusters (k) was limited to a range from 2 to 6, ensuring that the number of clusters remained clinically manageable and relevant. The optimal number of clusters was determined through a comprehensive examination of various metrics. These included the consensus matrix (CM) heat map, the cumulative distribution function (CDF), and cluster-consensus plots that focused on within-cluster consensus scores. The proportion of ambiguously clustered pairs (PAC) and the Bayesian information criterion (BIC) also played a crucial role in this determination. The within-cluster consensus score, which varies between 0 and 1, represented the average consensus value for all pairs of individuals within the same cluster. A score closer to 1 indicated greater stability and homogeneity within that cluster. PAC, calculated in the range of 0 to 0.9, measured the proportion of sample pairs with consensus values within pre-set boundaries. A lower PAC value signified improved cluster stability. Lastly, the BIC, ranging between 20,000 and 20,500, served as a criterion for the selection of the number of clusters, with lower BIC values indicating a more parsimonious model.
Statistical analysis
Statistical analyses in this study were conducted using the R software tool, version 4.1.2. After the CA patients clusters were identified, the differences between these clusters were analyzed. Measurement data were represented as medians with their interquartile ranges (IQR), and the nonparametric rank sum test was used to compare these measurements across different groups. Counting data, such as frequencies, were expressed in percentages, and the Chi-square test was employed to compare these frequencies between groups. For multivariable logistic regression, four distinct models were developed, each adjusted for different confounding variables. Model 1 served as the baseline with no adjustments. Model 2 included adjustments from Model 1, plus factors such as age and sex. Model 3 built upon Model 2 by also adjusting for the Charlson score. Finally, Model 4 included all adjustments from Model 3, with the addition of the SOFA score. The odds ratios (ORs) and their 95% confidence intervals (CIs) were calculated using logistic regression to assess the prognosis for different patient classes. In this analysis, a p-value of less than 0.05 (two-tailed) was considered to indicate statistical significance.
Results
Characteristics of the cohorts
In this study, a total of 76,540 ICU patients from the MIMIC-IV database were initially considered. After applying the exclusion criteria, 719 patients who experienced CA were ultimately enrolled. The patient cohort in this study had a median age of 67.2 years, with an interquartile range (IQR) of 55.5 to 79.3 years. Among these patients, 63.2% (455 out of 719) were male. The racial demographics were predominantly white (389/719, 54.1%), followed by black (81/719, 11.2%), and others (249/719, 34.6%). The most common comorbidities among the patients were hypertension (489/719, 68.0%), heart failure (265/719, 36.8%), and chronic kidney disease (176/719, 24.4%). These demographic details are further outlined in Table 1.
Regarding treatments within the first 24 h post-ICU admission, a significant proportion of the patients received vasoactive drugs (74.4%, 535/719), and nearly all were put on ventilation on the first day (99.0%, 712/719). Additionally, 29.7% (214/719) received antiarrhythmic drugs, and 29.2% (210/719) were administered sodium bicarbonate during the same period. The median Sequential Organ Failure Assessment (SOFA) score across the patient cohort was 7 (IQR: 4–9), indicative of a generally severe level of illness among these patients.
Among included patients, 74.4%(535/719) were administered vasoactive agents dose within 24 h of ICU admission, 99.0%(712/719) received mechanical ventilation, 29.7%(214/719) were given antiarrhythmic agents dose, and 29.2%(210/719) were treated with sodium bicarbonate. Across the dataset, Charlson score had a median score of 6 (IQR: 4–8), indicating a high risk of comorbidity.
Derivation of two subphenotypes
The CDF plot (sFigure 3A) shows the consensus distributions for each cluster, where the curve had negligible variation when k = 2. The relative change in the area under the CDF curve is shown in the delta plot (sFigure 3B), revealing significant differences in area when k = 3 or 4, indicating the relative increased area became considerably small. The mean cluster consensus score was comparable between a scenario of two clusters (sFigure 3C), and the two clusters showed favorably low PACs by the criteria (take [0, 0.9] as the predetermined boundary, sFigure 3D). Then, we computed the optimal BIC values using Least Common Ancestors analysis (LCA), which was found to be minimum (20100) when k = 2, indicating the model reached the optimal clustering number (sFigure 4). The consensus matrix (CM) heatmap (Fig. 2) shows that consensus cluster analysis identifies clusters 2 and 4 with clear boundaries, indicating good cluster stability over repeated iterations.
Through consensus clustering analysis, two distinct patient groups emerged from the parameters of CA patients within the first 24 h of their admission to the ICU. For simplicity, we referred to the two classes as subphenotype 1 (SP1) and subphenotype 2 (SP2), followed by creating 2D images using t-distributed stochastic neighbor embedding (t-SNE) to mark the differences between before and after clustering (Fig. 3A) for easier exploration and visualization of two subphenotypes. Abnormal clinical variables of the two subphenotypes are shown in Fig. 4.
(A) T-distributed stochastic neighbor embedding (t-SNE) plot. This nonlinear dimension reduction technique is used to visualize high-latitude data. (B) Selected variables by subphenotype in CA and the differences in the standardized values of each variable by subphenotype. All continuous variables were transformed into z-scores (mean: 0, standard deviation: −1 to 1)
WBC, white blood cell count; MAP, mean arterial pressure
Chord diagram showing abnormal clinical variables by suphenotype. In (A), the ribbons connect an individual subphenotype to an organ or system if the group mean is greater or less than the overall mean for the entire cohort. For example, subphenotype 1 (light blue) is more likely to have patients with acid–base imbalance (the ribbons connect to these portions of the circle) than patients with subphenotype 2 (light red), who are more likely to have cardiovascular, pulmonary, and hepatic dysfunction. In (B) and (C), each subphenotype is highlighted separately, and the ribbons connect to different patterns of clinical variables and organ or system dysfunctions located at the top of the circle
Characteristics of two subphenotypes
Patients within both subphenotypes were of similar age. Among those with SP1, 57.6% were males. These patients also had lower Charlson Comorbidity Index and SOFA scores, and they required fewer vasoactive drugs and CRRT. In terms of laboratory parameters, patients in SP1 exhibited elevated levels of HB, Bicarbonate, Chloride, pH, PO2, and BE, contrasted with diminished levels of Potassium, Bilirubin, INR, and LAC. Similarly, 73.8% were male in SP2. However, patients in SP2 showed higher PEEP and a higher comorbidities burden. In addition, SP2 patients demonstrated elevated GLU, BUN, ALT and AST levels (Table 1). The distinct variables of both subphenotypes are depicted in Fig. 3B. In comparison to SP2, SP1 is distinguished by markedly higher levels of P/F, HB, MAP, PLT and Na, alongside lower values of T, HR, GLU, WBC and CR.
Clinical outcomes of two subphenotypes
As shown in Fig. 5A, SP1 had a lower proportion of in-hospital mortality than SP2, where KM survival curves showed that SP1 had a lower 28-day mortality than SP2 (P < 0.001) (Fig. 5B). The univariate model 1 was adjusted three times by including different parameters and analyzed using multivariate logistic and COX regression analysis, respectively. Results showed that in either model, the in-hospital mortality (OR = 1.82, 95% CI: 1.26–2.64, p = 0.002) and 28-day mortality (HR = 1.84, 95% CI: 1.40–2.41, p < 0.001) of SP2 patients were higher than SP1 (Table 2).
We also analyzed the relationship between different vital signs and mortality (Table 3), and results showed that patients with MAP ≥ 65mmHg had lower mortality than those with MAP < 65mmHg, independent of a whole group or in SP1 or SP2 individually. The in-hospital mortality and 28-day mortality of SP1 patients with temperature 32 ∼ 36℃ were lower than those with temperature 36 ∼ 37.5℃(OR = 2.91, 95% Cl: 1.46–5.78], HR = 2.37 [95% Cl: 1.52–3.69]). However, the analyses for SP2 patients were not significantly different for patients with temperature 36–37.5 °C and others. In the whole cohort, patients also tended to have a lower mortality rate when PO2 ≥ 80mmHg. Nevertheless, PO2 ≥ 80mmHg showed the opposite outcome in SP2 as in SP1. Similarly, compared to patients with PCO2 < 35mmHg, those with 35mmHg ≤ PCO2 ≤ 45mmHg or PCO2 ≥ 45mmHg had lower mortality, independent of a whole group or in SP1 or SP2.
Discussion
In medicine, subphenotype and disease stage are two distinct concepts that can sometimes be difficult to distinguish. Is it possible that the discrimination that is being observed simply reflects two different stages of progression rather than separate clusters? Let me begin with our answer - “NO”. The two subphenotypes that have been identified are clearly not two stages of CA progression, but rather refer to a group of CA patients who are further subdivided according to different characteristics or responses. The stages of CA progression can include prodromal phase, onset phase, CA phase, and biological death phase. The subphenotypes identified in this study are certainly in one of these stages of development, but it is more of a coexisting condition. The identification of subphenotypes usually relies on in-depth analysis of patient clinical characteristics (such as symptoms and signs) and biomarkers (such as specific molecules in blood and urine). These features and markers can help physicians classify patients into subgroups with similar pathophysiological mechanisms or different responses to therapy.
The analysis of CA patient data identified two distinct clinical subphenotypes, referred to as SP1 and SP2. This was achieved through the use of consensus cluster analysis, employing a range of variables for the study, including T, Na, CR, GLU, HR, P/F, HB, MAP, PLT, and WBC. Results showed that both subphenotypes differed in several dimensions, such as demographics, vital signs, laboratory results, and organ dysfunction, which differs from how patients are grouped using traditional methods. Furthermore, we refined the parameters of the univariate model on three separate occasions. It was consistently observed that patients categorized under the SP2 had higher in-hospital and 28-day mortality rates compared to those with the SP1. The findings of this study are anticipated to serve as a vital reference for clinicians in classifying patients, assessing their prognosis, and making informed treatment decisions for CA patients.
ML has been widely used for classifying various disease subphenotypes, among which consensus cluster analysis is one of the most popular ML clustering algorithms, and has been widely used for identifying subphenotypes of diseases such as ARDS [8], sepsis [9], septic acute kidney injury (septic-AKI) [10], and liver dysfunction [11]. Under comprehensive consideration, consensus cluster analysis was selected for defining CA subphenotypes. CA subphenotypes can be determined after medical information collection and routine examination of patients following their admission, which is helpful for the timing of advanced treatment intervention, flexible adjustment of treatment plans, and screening of clinical trial subjects. After two critical care medicine experts discussed and reached a consensus, 52 relevant parameters were screened following the patient’s admission to the ICU. To avoid excessive confounding leading to decreased clinical utility, we selected the most representative 10 parameters (T, Na, CR, GLU, HR, P/F, HB, MAP, PLT, and WBC) for cluster analysis. Only the mean values within 24 h after ICU admission were selected to achieve the early differentiation of clinical subphenotypes of CA patients. Although previous studies have shown that some new parameters can achieve considerable prediction results, it wasn’t easy to carry out in practical clinical work [8, 12]. To achieve the study objective, clinically available data was used in the cluster model and two subphenotypes were derived. In these two phenotypes, SP2 patients were most closely associated with abnormalities in organs or systems, notably including the cardiovascular and hematologic systems, as well as hepatic dysfunction. Conversely, SP1 patients exhibited considerably more favorable characteristics than SP2. Since the statistical analysis showed that SP1 patients had overall fewer clinical anomalies despite a high proportion of acid-base imbalance, we may be able to classify patients with CA more quickly based on limited laboratory tests. The multiple cluster analyses showed consistency across clinical subphenotypes, which could help identify patients who would benefit from future intervention. Nonetheless, further understanding of the pathophysiological mechanisms underlying CA subphenotypes is necessary.
Among the 10 most representative parameters, there were 5 parameters which were higher or lower than SP2 patients, respectively. Results showed that patients with SP2 had higher HR and lower MAP than those with SP1, in addition to significant history of underlying hypertension, heart failure, and a higher proportion of vasoactive drug use. A higher MAP within a specific range refers to better perfusion, leading to a better prognosis. A study recently reported an insignificant difference in outcome between the MAP of patients with CA between 77mmHg and 63mmHg [13]. However, the grouping method used in above study was fundamentally different from the subphenotypes classification method used in our study. Patients were grouped based on different target blood pressure in above study and ML was applied to cluster and classify patients under multi-parameter conditions in our study, which could be the primary reason why the results of the two studies are partially biased. Additionally, patients with potentially poor long-term outcomes were excluded from the sample. Hence, the influence of MAP on CA patients’ outcome remains to be determined. Na was used in this study as a representative of electrolytes. The Na levels in both subphenotypes were similar and within the normal range but were slightly lower in SP2 patients than in SP1. Moreover, hypo- and hypernatremia were associated with a decreased probability of favorable neurological outcomes compared with normal Na [14]. It should also be noted that SP2 patients had lower Na even though they used sodium bicarbonate at a higher rate than SP1, suggesting that Na in SP2 patients might be lower before the use of sodium bicarbonate, and the difference between the two groups may be more significant. The main function of PLT is coagulation and hemostasis, and they are often used to evaluate the coagulation function of patients in clinic. The PLT of two subphenotypes in this study was within the normal range but slightly reduced in SP2 than in SP1. According to the neurologic outcome at 6 months post-CA, patients with good prognosis had a mean PLT of 230.31*10^9/L at admission, and patients with poor prognosis had a mean PLT of 197.30*10^9/L at admission [15], which were similar to our results and end outcomes. P/F is an important indicator of respiratory function and is widely used to distinguish high-risk patients with adverse clinical outcomes [16]. Our results showed that SP2 patients had lower P/F than SP1, suggesting that SP2 patients had poor respiratory function and more severe organ or tissue ischemia and hypoxia. Multiple factors could contribute to a lower P/F in CA patients, such as aspiration pneumonia, pulmonary embolism, systemic inflammatory reaction, pulmonary exudation, and ARDS, among which some were also common CA causes.
Furthermore, our results showed that patients with SP2 had abnormally high levels of CR, while patients with SP1 had normal levels of CR, which suggesting worse renal function in SP2 patients. Meanwhile, SP2 patients had a higher proportion of chronic kidney disease comorbidities. A decrease in CR of > 0.2 mg/dl in the first 24 h may indicate a good prognosis, while a constant or even elevated serum CR level suggests a poor prognosis [17]. Excessively high GLU can lead to immunosuppression and oxidative stress, leading to poor patient outcomes. A study reported that higher GLU levels were associated with poor neurological outcomes in those patients with CA treated with targeted temperature management (TTM) [18]. Our data showed that SP2 patients had higher GLU than SP1. The statistical analysis revealed infection as the most significant difference between the two subphenotypes since SP2 patients had a higher WBC than SP1, indicating a severe infection. However, it could have happened before CA onset, or the infection could be the cause of CA. Furthermore, a study reported that WBC had no significant ability to distinguish infectionin in CA patients receiving TTM [19]. Although WBC showed apparent difference between the two subphenotypes, but its reliability needs to be demonstrated by more clinical practice in the future. The temperature between the two subphenotypes not statistically significant, which was related to the TTM commonly adopted in clinical practice. Despite some controversy, TTM at 32–36 °C for at least 24 h post-CA remained the primary neuroprotective approach following OHCA or IHCA, consistent with AHA recommendations [20,21,22]. The results of COX and multivariate logistic analysis in this study showed that, in the whole population, the in-hospital and 28-day mortality of patients with T of 32–36 °C were lower than those of 36–37.5 °C, which indicated the benifet of TTM for the prognosis of CA patients.
CA patients are classified into two subphenotypes (SP1 and SP2) on the basis of different clinical features and physiological responses (on the basis of 10 vital signs and laboratory parameters that represent the pathophysiology of CA). Compared with the previous classification methods, these subphenotypes are more helpful for doctors to diagnose, treat and predict the prognosis of patients more accurately. The variables used for clustering can be obtained quickly after the patient is admitted to the hospital, so that clinicians can quickly identify the patient’s subphenotype (SP1 or SP2). Clinicians can then tailor treatment to patients on the basis of the pathophysiological features of the subphenotypes that we have so far clustered. SP1 is more likely to have patients with acid–base imbalance than SP2 patients, who are more likely to have cardiovascular, pulmonary, and hepatic dysfunction. Therefore, attention to acid-base balance (e.g., an aggressive adjunctive CRRT strategy) is warranted beyond usual care in SP1 patients. For SP2 patients, we should be alert to the occurrence of multiple organ dysfunction syndrome (MODS). This may be because: 1. Poor basic organ function; 2. CA has a greater impact on the organs of patients. Active intra-aortic ballon pump (IABP), extracorporeal membrane oxygenation (ECMO) and artificial liver support system (ALSS) may be helpful for patients with organ support. In-hospital and 28-day mortality were lower with SP1 than with SP2. We can make a preliminary prognosis assessment of CA patients according to the classification of patients after admission. This can be used as a theoretical basis for prognosis prediction in conversation with the patient’s family.
Translating theoretical research results into clinical practice is a significant challenge faced by the majority of researchers. While previous studies found some critical factors in subphenotypes identification for ARDS and septic AKI, but these indicators have not been popularized because of the difficulty in extracting these indicators in clinical practice [8, 12, 23]. However, this challenge also exists in the classification of CA subphenotypes. Unlike previous research [12, 23], we used 10 conventional clinical variables to derive clinical CA subphenotypes, which were more straightforward and easier to obtain, and their values reflected the functional states of different systems or organs, making our cluster analysis more representative and universal. In the future, we aimed to conduct an external multicenter validation to refine the underlying model.
This study had several limitations. Firstly, since CA causes are numerous and complex, using the currently known disease patterns to identify all subphenotypes might not be sufficient. We should examine the heterogeneity of CA from a higher latitude. This puts forward higher requirements for researchers’ ability to multi-system joint thinking. Secondly, the two experts believed that age is an important factor affecting the survival rate of patients with CA since younger tends to mean fewer underlying diseases, stronger immunity, and a better prognosis. Considering that the potential influence of age on patients with CA is fundamental and multifaceted, this study was not temporarily included. Thirdly, including more potential variables, such as shock type and microbiological data et cetera could provide new insights. However, these were not present in the MINIC-IV 2.0 database. Lastly, the variables for clinical subphenotypes in this study were derived from a single-center retrospective database in the United States. Therefore, whether these subphenotypes can be generalized to more diverse populations of severely ill patients with CA in other parts of the world remains to be seen. We would invest more time and energy to verify and adjust it further. In the next few years, we planned to collect patients with CA information from multiple centers to establish the CA database of ICU patients in China.
Conclusions
Analysis of CA patients data retrieved from the MIMIC-IV 2.0 database revealed two clinical subphenotypes of CA, namely SP1 and SP2. Consensus cluster analysis was performed using the mean values of clinical, vital signs, and laboratory indicators as the analysis variables. The SP1 ad considerably higher levels of P/F, HB, MAP, PLT, and Na, and lower T, HR, GLU, WBC, and CR levels than SP2. The in-hospital and 28-day mortality of patients with SP2 was higher than patients with SP1. The study outcomes are envisaged to provide helpful information to the clinicians regarding patient classification, prognosis evaluation, and making treatment decisions for prospective CA patients.
References
Tsao CW, et al. Heart disease and stroke statistics-2022 update: a report from the American Heart Association. Circulation. 2022;145(8):e153–639.
Gu XM, et al. Meta-analysis of the success rate of heartbeat recovery in patients with prehospital cardiac arrest in the past 40 years in China. Mil Med Res. 2020;7(1):34.
Al-Khatib SM, et al. 2017 AHA/ACC/HRS guideline for management of patients with ventricular arrhythmias and the prevention of sudden cardiac death: a report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines and the Heart Rhythm Society. Circulation. 2018;138(13):e272–391.
Priori SG et al. 2015 ESC guidelines for the management of patients with ventricular arrhythmias and the prevention of sudden cardiac death: the task force for the management of patients with ventricular arrhythmias and the prevention of sudden cardiac death of the European Society of Cardiology (ESC). Endorsed by: Association for European Paediatric and Congenital Cardiology (AEPC). Eur Heart J. 2015;36(41):2793–2867.
Stiles MK, et al. 2020 APHRS/HRS expert consensus statement on the investigation of decedents with sudden unexplained death and patients with sudden cardiac arrest, and of their families. Heart Rhythm. 2021;18(1):e1–50.
Elfassy MD, et al. Understanding etiologies of cardiac arrest: seeking definitional clarity. Can J Cardiol. 2022;38(11):1715–8.
Seymour CW, et al. Derivation, validation, and potential treatment implications of novel clinical phenotypes for sepsis. JAMA. 2019;321(20):2003–17.
Maddali MV, et al. Validation and utility of ARDS subphenotypes identified by machine-learning models using clinical data: an observational, multicohort, retrospective analysis. Lancet Respir Med. 2022;10(4):367–77.
Hu C, et al. Application of machine learning for clinical subphenotype identification in sepsis. Infect Dis Ther. 2022;11(5):1949–64.
Chaudhary K, et al. Utilization of deep learning for subphenotype identification in sepsis-associated acute kidney injury. Clin J Am Soc Nephrol. 2020;15(11):1557–65.
Miao H, et al. Identification of subphenotypes of sepsis-associated liver dysfunction using cluster analysis. Shock. 2023;59(3):368–74.
Wiersema R, et al. Two subphenotypes of septic acute kidney injury are associated with different 90-day mortality and renal recovery. Crit Care. 2020;24(1):150.
Roedl K, Kluge S. Blood-pressure targets in comatose survivors of cardiac arrest. N Engl J Med. 2023;388(3):285.
Shida H, et al. Early prognostic impact of serum sodium level among out-of-hospital cardiac arrest patients: a nationwide multicentre observational study in Japan (the JAAM-OHCA registry). Heart Vessels. 2022;37(7):1255–64.
Kim HJ, et al. Time course of platelet counts in relation to the neurologic outcome in patients undergoing targeted temperature management after cardiac arrest. Resuscitation. 2019;140:113–9.
Villar J, et al. A universal definition of ARDS: the PaO2/FiO2 ratio under a standard ventilatory setting–a prospective, multicenter validation study. Intensive Care Med. 2013;39(4):583–92.
Hasper D, et al. Changes in serum creatinine in the first 24 hours after cardiac arrest indicate prognosis: an observational cohort study. Crit Care. 2009;13(5):R168.
Daviaud F, et al. Blood glucose level and outcome after cardiac arrest: insights from a large registry in the hypothermia era. Intensive Care Med. 2014;40(6):855–62.
Schuetz P, et al. Serum procalcitonin, C-reactive protein and white blood cell levels following hypothermia after cardiac arrest: a retrospective cohort study. Eur J Clin Invest. 2010;40(4):376–81.
Andersen LW, et al. In-hospital cardiac arrest: a review. JAMA. 2019;321(12):1200–10.
Mody P, et al. Targeted temperature management for cardiac arrest. Prog Cardiovasc Dis. 2019;62(3):272–8.
Sandroni C, et al. ERC-ESICM guidelines on temperature control after cardiac arrest in adults. Intensive Care Med. 2022;48(3):261–9.
Sinha P, et al. Development and validation of parsimonious algorithms to classify acute respiratory distress syndrome phenotypes: a secondary analysis of randomised controlled trials. Lancet Respir Med. 2020;8(3):247–57.
Funding
1. Zhejiang Provincial Medical and Health Technology Project (grant. WKJ-ZJ-2315).
2. The Construction Fund of Key Medical Disciplines of Hangzhou (OO20200485).
3. Science and Technology Development Project of Hangzhou (grant. 202204A10).
Author information
Authors and Affiliations
Contributions
The specifc division of labor was as follows: Conception, WZ; Funding, WH, and MD; Investigations, WZ; Methods, WZ, CW, and PN; Project management, SZ, HZ, and YZ.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that there are no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Zhang, W., Wu, C., Ni, P. et al. Machine learning derivation of two cardiac arrest subphenotypes with distinct responses to treatment. J Transl Med 23, 16 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12967-024-05975-1
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12967-024-05975-1