Skip to main content

Coronary health index based on immunoglobulin light chains to assess coronary heart disease risk with machine learning: a diagnostic trial

Abstract

Background

Recent studies suggest a connection between immunoglobulin light chains (IgLCs) and coronary heart disease (CHD). However, current diagnostic methods using peripheral blood IgLCs levels or subtype ratios show limited accuracy for CHD, lacking comprehensive assessment and posing challenges in early detection and precise disease severity evaluation. We aim to develop and validate a Coronary Health Index (CHI) incorporating total IgLCs levels and their distribution. Additionally, we aim to evaluate its effectiveness by integrating patient data and using machine learning models through diagnostic trial.

Methods

The CHI was developed and combined with other clinical data. Nine machine learning models were screened to identify optimal diagnostic performance, with the XGBoost model emerging as the top performer. Performance was assessed based on accuracy, sensitivity, and its ability to identify severe CHD cases characterized by complex lesions (SYNTAX score > 33).

Results

The XGBoost model demonstrated high accuracy and sensitivity in diagnosing CHD, with an area under the curve (AUC) of 0.927. It also accurately identified patients with severe CHD, achieving an AUC of 0.991. An online web tool was introduced for broader external validation, confirming the model’s effectiveness.

Conclusions

Combining the CHI with the XGBoost model offers significant advantages in diagnosing CHD and assessing disease severity. This approach can guide clinical interventions and improve large-scale CHD screening.

Background

Cardiovascular disease, particularly coronary heart disease (CHD), poses a formidable global challenge, both in treatment and prevention [1]. The intricate treatments, coupled with costly interventions and often asymptomatic early stages, make managing and mitigating its incidence exceedingly difficult, posing a significant threat to global health [2]. According to estimates, CHD claimed the lives of 9.14 million people globally in 2019, while 197 million individuals were living with the condition. This burden is escalating rapidly in low- and middle-income countries such as China and Southeast Asia [3]. The alarming incidence and mortality rates underscore the urgent need for efficient diagnostic and therapeutic strategies to manage CHD effectively. Furthermore, CHD encompasses a wide spectrum of myocardial ischemia conditions resulting from coronary artery atherosclerosis, ranging from minor myocardial insufficiency (angina pectoris) to severe myocardial necrosis (myocardial infarction) [4]. The majority of clinicians have profoundly acknowledged the pivotal role of precision medicine in enhancing patient treatment outcomes and quality of life. However, we face a novel challenge: how to promptly and conveniently differentiate CHD patients based on varying degrees of severity, as this differentiation directly impacts the selection of the most suitable treatment modalities [5]. Nevertheless, we still have significant deficiencies in this area at present.

Although notable advancements have been made in the diagnosis and treatment of CHD over the past several decades, numerous challenges persist. CHD diagnosis has traditionally relied on imaging techniques, and although many are noninvasive, such as echocardiography and computed tomography, can clearly visualize the structure and function of the heart without harming the patient, effectively identifying coronary artery stenosis and plaque formation. Nevertheless, these techniques are not flawless, as the presence of false positives and false negatives may lead to diagnostic errors. Others are invasive and carry high risks, like coronary angiography and intravascular ultrasound offer more precise diagnostic information. However, their procedures are accompanied by risks, including vascular puncture, bleeding, infection, or allergic reactions that may arise from the use of contrast agents, causing additional discomfort and burden to patients; additionally, all techniques are costly and equipment-dependent, limiting access [6, 7].

Other studies have demonstrated the successful application of biochemical markers, such as troponin and brain natriuretic peptide (BNP), in diagnosing CHD. However, despite their desirable diagnostic performance, these biochemical markers currently in use in clinical practice, such as troponin and BNP, also have some limitations. Troponin, for example, is primarily employed for diagnosing acute coronary syndrome but is insufficient for long-term prognosis prediction or overall risk assessment in CHD. Similarly, the specificity of BNP is a crucial biomarker in the assessment of heart failure, where elevated levels of BNP often signify the presence of left ventricular dysfunction and increased cardiac stress, however, it is limited because its serum concentration is influenced by various noncardiac factors [8, 9]. Consequently, researchers are now aiming to uncover novel biomarkers that can be rapidly and noninvasively measured and are highly specific and sensitive; such biomarkers would not only increase diagnostic precision but also potentially reduce diagnostic costs, thus benefiting a larger patient population.

Extensive research has confirmed the relationships between atherosclerotic diseases, including CHD, and the immune system, highlighting the crucial roles played by different components of the immune system in the initiation, development, and progression of atherosclerosis [10]. Developments in the field of immunology have garnered immunoglobulin light chains (IgLCs) significant attention from researchers as a biomarker of great potential. Alterations in the levels of total immunoglobulin light chains (TLCs) and free immunoglobulin light chains (FLCs) have been used as key indicators for monitoring disease activity in conditions such as multiple myeloma and certain autoimmune disorders [11, 12]. IgLCs are crucial components of the immunoglobulins produced by B cells and play indispensable roles in the immune system. Each immunoglobulin molecule is composed of exactly two heavy chains and two light chains, the latter of which exist in two subtypes, κ and λ [13]. In healthy individuals, IgLCs exist in a stable proportion, the κ:λ light chain ratio is approximately 2:1 [14].

Studies have revealed an unexpected association between IgLCs and cardiovascular events, potentially indicating their role as important biomarkers for cardiovascular disease (CVD) [15, 16]. Patients with heart failure, for example, have presented with abnormal IgLCs levels that are highly correlated with disease severity and the probability of adverse clinical outcomes [17]. Another study found that high levels of serum free κ light chains are independently associated with the presence of CVD, potentially indicating their suitability as biomarkers for early disease detection [18]. Similarly, there seems to be a potential connection between IgLCs and CHD. Existing studies have confirmed the correlation between variations in IgLCs levels and the occurrence, progression, and prognosis of CHD. As early as 2015, a study already found that, in patients with ST-elevation myocardial infarction (STEMI), the levels of FLCs could predict the future necessity for percutaneous coronary intervention (PCI) [19]. Another recent study has shown that, in CHD, patients with elevated κ/λ ratios had an improved left ventricular ejection fraction. These serum FLCs have also been suggested to play pivotal roles in the pathogenic mechanisms underlying the disease [20]. However, despite the correlation between IgLCs and CHD, the application of IgLCs in diagnosing CHD still faces challenges. The accuracy of predicting risk using IgLCs levels or subtype ratios alone is insufficient, primarily due to significant individual differences influenced by genetics, environment, lifestyle, and other factors. Additionally, the pathogenesis of CHD is complex, encompassing inflammation, lipid metabolism, endothelial dysfunction, platelet aggregation, and other processes. Therefore, for predicting and diagnosing CHD, a deeper understanding of the interactions between IgLCs and other biomarkers as well as clinical indicators is required, and more comprehensive and precise diagnostic strategies need to be explored.

Machine learning (ML) models offer a novel solution for enhancing diagnostic accuracy by analyzing large datasets to precisely identify subtle differences among biomarkers, thereby strengthening the discriminatory power of diagnosis [21]. This study aims to leverage algorithmic advantages to explore and optimize the utilization of IgLCs in distinguishing CHD patients. Although IgLCs hold immense potential as biomarkers for cardiovascular diseases, there remains a need for further research on how to optimally utilize them to differentiate CHD patients from high-risk individuals and for initial classification of CHD patients. Additionally, the application of ML models in improving the diagnostic accuracy and predictive capability based on IgLCs methods is yet to be fully explored.

To address this gap, in our study, we have constructed a novel assessment tool named the “Coronary Health Index” (CHI), grounded on the distinctive variations of the two light chain subtypes of IgLCs within the human body. Furthermore, we intend to integrate the CHI with multifaceted clinical data and employ machine learning to develop a predictive model. Our central objective is to utilize this model to comprehensively and systematically evaluate the efficacy of IgLCs in diagnosing CHD, while also delving into their potential value in CHD screening, early detection, as well as stratifying disease severity and progression. In this study, our model has preliminarily demonstrated its potential role in stratifying the severity or progression of CHD. We anticipate that this research will deepen our scientific understanding of how IgLCs influence the pathogenesis of CHD and provide insights and inspiration for precision management and optimized treatment strategies of cardiovascular diseases.

Methods

MethodsStudy subject inclusion and exclusion criteria

In this study, gender was not considered as a biological variable. This study was approved by the institutional review board and ethics committee of the First Hospital, Jilin University (2024 − 729) and was conducted according to the ethical standards established in the Declaration of Helsinki. Informed consent was obtained from all participants. We aim to investigate the diagnostic effectiveness of IgLCs by retrospectively recruiting patients suspected of CHD who were admitted to the First Hospital of Jilin University from January 2024 to April 2024.

The inclusion criteria for the CHD patient group were rigorously defined to ensure the validity and reliability of the study results. Specifically, the inclusion criteria for the CHD patient group included the presentation of relevant symptoms (such as chest pain, n, shortness of breath, fatigue, or palpitations), age below 80 years, a suitable physical condition for undergoing coronary angiography, and first-time CHD with no previous treatment history. Patients who had taken anticoagulant medications, such as aspirin, in the three months prior to the study, those with a poor overall condition, those who were unsuitable for surgical treatment, or those with other contraindications for blood sampling were excluded. All suspected CHD patients after enrollment need to undergo coronary angiography, and volunteers who do not obtain this data will also be excluded from the study cohort. For the volunteers finally included in the CHD group, we will further categorize them into three subgroups based on their SYNTAX scores, including mild lesions (SYNTAX score = 1–22), moderate lesions (SYNTAX score = 22.5–32), and severe lesions (SYNTAX score = 32.5–49.5).

For the control group of this study, we selected patients who were admitted with suspected CHD due to other similar symptoms but were confirmed to have no abnormalities in their coronary arteries after undergoing coronary angiography. These patients may exhibit similar symptoms to those in the CHD patient group, but they were excluded from a CHD diagnosis based on the absence of coronary stenosis or obstruction, as confirmed by rigorous coronary angiography combined with other clinical manifestations. The baseline information and other examination results of the volunteers in the two groups who were ultimately enrolled underwent desensitization processing via our internal Laboratory Information System. After personal privacy information of the patients was deleted, these data were utilized for subsequent statistical analysis.

Sample collection and preparation

To investigate the diagnostic effectiveness of IgLCs in patients suspected of CHD, a total of 486 samples were collected. At the same time, we also collected other clinical information of these patients from the case system, including basic information such as age, sex, and other test data. All participants were strictly instructed to fast for 12 h and avoid strenuous activities prior to blood sampling to ensure that the levels of the collected blood components would be stable and could be accurately measured.

After collection, the blood samples were preprocessed within 2 h to ensure they would be fresh and suitable for further analysis. During processing, the samples were centrifuged at a constant temperature of 4 °C and a speed of 3000 rpm for 10 min, allowing adequate separation of the blood components. The rigorously processed blood samples were then submitted for TLC and FLC detection within 2 h after preprocessing.

TLC and FLC detection and calculation of the CHI

TLCs and FLCs were detected and their concentrations measured on a Siemens BN II Automated Protein Analyzer (Siemens Healthcare Diagnostics Ltd., Erlangen, Germany). In accordance with the manufacturer’s instructions, the instrument was calibrated with quality control samples prior to testing the study samples. The plasma samples were tested for measuring the concentrations TLC κ, TLC λ, FLC κ, and FLC λ using the reagents provided by the manufacturer, and the data were recorded for further analysis. The ratios TLC κ/TLC λ and FLC κ/FLC λ are automatically generated by the BN II Automated Protein Analyzer. Drawing upon the biological observation that the ratio of κ-type to λ-type free light chains (FLC κ/FLC λ) remains relatively stable in the peripheral blood of healthy individuals, we devised and calculated the CHI. This indicator is defined as the ratio of the proportion of κ-type free light chains to total κ-type light chains (FLC κ/TLC κ) divided by the proportion of λ-type free light chains to total λ-type light chains (FLC λ/TLC λ), expressed as (FLC κ/TLC κ) / (FLC λ/TLC λ). This novel approach seeks to quantify the relative variations in these proportions, with the goal of uncovering potential associations between these changes and coronary health status.

Establishment and selection of machine learning models

We have fully leveraged the powerful capabilities of the DxAI platform (https://www.xsmartanalysis.com) to complete all analytical work related to ML models. A binary logistic regression model was constructed using all the collected clinical data to conduct univariate analysis. The relationship between each feature and CHD as well as the control group was observed. A p-value less than 0.05 was considered statistically significant, and these significant features were then utilized for subsequent machine learning model construction.

We employed a resampling-based 5-fold cross-validation training/validation framework to systematically evaluate the average AUC scores of various machine learning models across different data partitions, providing a comprehensive assessment of their performance. Specifically, we allocated 20% of the data as the validation set to assess model generalizability. We employed nine distinct machine learning algorithms to generate the classifier: XGBoost, logistic regression (LR), Light GBM, random forest (RF), adaptive boosting (AdaBoost), Gaussian naive Bayes (GNB), multilayer perceptron (MLP), Decision Tree (DT) classification, and k-nearest neighbors (KNN). The model performance was evaluated using Receiver Operating Characteristic (ROC) curves, and the Area Under the Curve (AUC) was calculated for each model. Furthermore, the DeLong test was performed to assess whether the differences between the performances of the different models were statistically significant (p < 0.05). The model that demonstrated the best performance in both the training set and the validation set was selected as the subsequent research object. After selecting the best-performing ML model, we utilized the ROC curves generated from 10-fold cross-validation to evaluate the model’s classification performance in both the training and validation sets. We also employed learning curves (LCs) to assess the model’s performance trends as the number of training samples increases. Furthermore, calibration curves (CCs) were used to evaluate the consistency between the model’s predicted probabilities and the actual sample probabilities. Lastly, decision curve analysis (DCA) was applied to quantify the clinical benefit of the model. We did not perform an in-depth tuning of the XGBoost model. This was primarily due to our aim of maintaining the simplicity of the model and mitigating the risk of overfitting that could arise from excessive tuning. Furthermore, given the limitations of time and computational resources, we chose to proceed with the XGBoost model using its default parameters for subsequent analysis.

Data analysis

Baseline data analysis as well as the construction of binary logistic regression and machine learning models were conducted on the DxAI platform (https://www.xsmartanalysis.com). All data analyses were performed via R version 4.2.3 and Python version 3.11.4. For single-factor regression analysis, the statsmodels Python package version 0.11.1 was utilized. The classification models were implemented in Python for subsequent performance comparison: eXtreme Gradient Boosting (XGBoost) was implemented with xgboost 2.0.1, Light Gradient Boosting Machine (LightGBM) with lightgbm version 3.2.1, and the other methods with scikit-learn 1.1.3. Apart from the construction of predictive models, all other statistical analysis results were generated via the CNSknowall platform (https://cnsknowall.com), a comprehensive web service for biomedical data analysis and visualization.

Statistics

The data are presented and the mean and standard deviation (SD) of each group. Comparisons of sex distributions between the control and CHD groups were conducted with the chi-square test, whereas the other indicators were compared with the t test or Mann‒Whitney U test. When comparing data among multiple groups, we first employed ANOVA; if it yielded significant differences, we then proceeded to use Fisher’s least significant difference (LSD) method for a more detailed, pairwise comparison. A p value < 0.05was considered to indicate statistical significance.

Results

Subsection describing the patient recruitment

Initially, 1921 samples and patient data were collected. After rigorous screening, 1435 were excluded, leaving 486 final cases. Exclusion reasons included age > 80 (N = 132), recent anticoagulant use (N = 549), ineligibility for angiography, and other factors (N = 5). Based on angiography, the 486 cases were divided into Control (N = 137) and CHD (N = 349) groups. Based on coronary angiography and SYNTAX scores, the CHD group was divided into mild (N = 125, SYNTAX 1–22), moderate (N = 124, SYNTAX 22.5–32), and severe (N = 100, SYNTAX 32.5–49.5) lesions for modeling to assess CHI’s distinction among CHD severities (Fig. 1). Descriptive stats were computed (Table 1), with BMI having the highest missing rate (23.66%), but most below 16%. No imputation was done due to low missing rates’ minimal impact on analysis.

Fig. 1
figure 1

Details of the sample inclusion criteria and grouping procedure. After initial screening, the 486 included samples were divided into control and CHD groups. The CHD group was classified into mild, moderate, and severe lesion groups via the SYNTAX scoring system

Table 1 Descriptive statistics of the characteristics of the sample

CHD patients present with unique baseline characteristics and a lower CHI than controls

To verify the overall disparities between volunteers in the two groups categorized based on coronary angiography data, we thoroughly compared the baseline characteristics of the CHD group with those of the control group. The outcomes revealed differences (p<0.05) in multiple crucial physiological and biochemical parameters between the two groups. Notably, CHD patients presented with higher levels of various biomarkers, such as age, troponin, BNP, D-dimer level (DD), neutrophil count (NEU#), neutrophil-to-lymphocyte ratio (NLR), myoglobin, and creatinine kinase-MB (CK-MB), and lower diastolic blood pressure, lymphocyte count (LYM#), and lymphocyte percentage (LYM%) (Table 2). After comparing the control group with the CHD group, we found that the ratios of TLCκ/TLCλ (p = 0.120) was not significantly different, whereas the FLCκ/FLCλ ratio (p = 0.0102) demonstrated a statistically significant difference (Fig. 2). However, when we convert the differences in IgLCs into CHI, significant statistical differences are shown between the CHD group and the Control group (p<0.01).

Fig. 2
figure 2

CHD patients show distinct features and lower CHI than controls. Comparison of the classical diagnostic indicators as well as the novel CHI between the control group and the CHD group

Table 2 Baseline characteristics of the control and CHD groups

Feature selection for machine learning models

We plan to explore the diagnostic value of CHI for CHD through the construction of ML models. In the initial stage, we have already conducted a univariate analysis using a binary logistic regression model for all baseline characteristics. Based on the significance level of p < 0.05 and the predictive strength suggested by odds ratios (ORs), we have identified a set of key indicators that will serve as the foundation for the subsequent construction of ML models. The results revealed that CHI, age, troponin, BNP, HDL, TC, LYM#, NEU#, NLR, LYM%, and myoglobin were significantly different between the groups (Fig. 3; Table 3). These indicators were then used to establish the diagnostic models.

Fig. 3
figure 3

Logic regression is used for feature selection. Results of the univariate regression analysis used to screen for baseline characteristics significantly associated with CHD

Table 3 Univariate logistic regression analyses for selecting variables for model development

XGBoost classification model exhibits optimal performance

In our preliminary screening to identify the optimal machine learning model for classifying samples as CHD or control patients, we compared the performance of various models by calculating the average AUC of their ROC curves across both the training and validation sets. The model selection process was conducted on the DxAI platform, employing a five-fold cross-validation method with oversampling as the validation approach. The validation set comprised 20% of the total data, as mentioned in our methodology section, where we comprehensively evaluated nine machine learning models. Among these models, the XGBoost classifier model stood out, achieving an exceptional average AUC of 0.917 ± 0.023 (Fig. 4A). Individual AUC values for the training and validation sets are also provided to offer a more comprehensive view of the model’s performance. Using 5-fold cross-validation and variables such as CHI, Age, Troponin, BNP, HDL, TC, CKMB, Myoglobin, LYM#, NLR#, NEU, and LYM% as inputs, the XGBoost classifier model achieved nearly perfect performance in the training set, exhibiting an AUC of 1.0 ± 0.000, while metrics such as accuracy (0.996), sensitivity (0.995), and specificity (1.0) were all close to or reached 1.0 (Fig. 4B, Supplementary Table 1). In the validation set, the XGBoost classifier model maintained this good performance, with an AUC value of 0.920 ± 0.019; its accuracy (0.852), sensitivity (0.834), and specificity (0.940) in the validation set also demonstrated the good generalization ability of the XGBoost classifier model (Fig. 4C, Supplementary Table 2). The results of the DeLong test statistically validated the superiority of the XGBoost model over the other models in terms of AUC (p < 0.001) (Fig. 4D). After a comprehensive comparison of various metrics, including AUC, accuracy, specificity, and sensitivity, across both the training and validation sets for each model, the XGBoost model exhibited superior performance in both, with consistent results. Based on its performance in both sets, we can confidently conclude that the XGBoost classifier model is the optimal choice.

Fig. 4
figure 4

Screening of machine learning classification models. (A) The forest plot displays the results of ROC curve analysis for the CHD prediction models, with the data indicating the means (dots) and SDs (error bars) of the model AUCs. Through 5-fold cross-validation, XGBoost emerged as the top performer in the training set in terms of AUC. (B) Plots of the model ROC curves in the training set. (C) Plots of the model ROC curves in the validation set. The AUC value of the XGBoost classification model was the highest. (D) The DeLong test results indicate that the area under the ROC curve of the XGBoost model is significantly different from the AUCs of the other models

The XGBoost model excels in identifying coronary heart disease patients

After initially selecting the optimal model, we conducted a more thorough evaluation of the XGBoost classification model’s efficacy in diagnosing CHD patients (with respect to the actual patient outcomes through calibration curve analysis) and its potential benefits for clinical practice (through decision curve analysis). Specifically, the training, validation, and tests comprised 65%, 20%, and 15% of the original data; we trained XGBoost model with the training set through 10-fold cross-validation, then assessed model performance in the test set. Subsequently, in the training set, validation set, and test set, the XGBoost model achieved AUCs of 0.949 ± 0.003, 0.882 ± 0.088, and 0.925, respectively, while the accuracies were 0.858, 0.803, and 0.878, respectively (Fig. 5A, B and C, Supplementary Table 3). Although the AUC in the validation set did not surpass the training set AUC, the XGBoost model clearly achieved a successful fit, thereby validating its suitability for classification in this specific dataset (Fig. 5D).

The results of decision curve (Fig. 5E) and calibration curve (Fig. 5F) analyses also indicated that the XGBoost model had good clinical utility and calibration, respectively, in identifying patients with CHD. Decision curve analysis demonstrated that the model performed exceptionally well across multiple intervention thresholds, offering net benefits over “treat all” and “treat none” approaches and providing effective guidance for clinical decision-making. Calibration curve analysis indicated that the model’s predicted probabilities align well with the actual observed outcomes, with the red (fitted) line closely resembling the reference (diagonal) line, without significant overestimation or underestimation of risks. The XGBoost model exhibits not only exceptional effectiveness but also substantial clinical practicability and reliability when it comes to diagnosing CHD patients.

Fig. 5
figure 5

Performance evaluation of the XGBoost classification model. (A) 10-fold cross-validation ROC curves in the training set. (B) 10-fold cross-validation ROC curves in the validation set. (C) ROC curve in the test set. (D) The learning curve shows that model performance in both the training set and the validation set improves steadily and that the difference between them remains stable, showing that the model neither overfits nor underfits the data and is able to effectively learn from the training data and generalize to the validation data. (E) The decision curve highlights the model’s excellent performance across multiple thresholds, indicating that it could help guide clinical decision-making. (F) The calibration curve shows close alignment between the predicted and actual probabilities, demonstrating the accuracy of the model’s predictive performance

CHI decreases with severity of CHD patients

Beyond merely rapidly identifying CHD patients within high-risk populations, a precise assessment of the severity of an individual patient’s disease is paramount for crafting personalized treatment strategies and enhancing patient outcomes. Leveraging the SYNTAX scoring system, we have systematically divided the CHD group samples into mild, moderate, and severe categories. Our preliminary analysis unveiled striking disparities in the Coronary Heart Index (CHI) across these four groups, notably observing a decline in CHI as the SYNTAX score escalated, with the control group exhibiting the highest CHI values (Fig. 6A).

Subsequently, we embarked on developing a methodology to distinguish severe CHD patients from healthy controls. Our baseline data analysis disclosed significant variations in a multitude of indicators between these two groups, as detailed in Table 4. Employing univariate regression analysis, we pinpointed a total of 12 salient features for subsequent model selection and construction (Fig. 6B; Table 5). Following examination, we harnessed these screened variables to formulate classification models utilizing nine distinct algorithms tailored for the CHD versus control classification task.

Fig. 6
figure 6

CHI decreases with severity of CHD patients. (A) As the severity of CHD increases, the CHI decreases, as depicted in a set of box plots. (B) The forest plot displays the ROC curve results for the CHD prediction models, with dots and error bars indicating the means and SDs of the AUC. Through 2-fold cross-validation, XGBoost emerged as the top performer in the training set according to the AUC

Table 4 Comparison of baseline characteristics between the control and severe CHD groups
Table 5 Univariate logistic regression selects clinical data for CHD severity classification model development

XGBoost also accurately identifies severe coronary heart disease patients

Through rigorous comparisons, XGBoost again emerged as the optimal model for distinguishing between the control and severe groups (Supplementary Fig. 1, Supplementary Table 4, Supplementary Table 5). We first employed 10-fold cross-validation and conducted ROC curve analysis in the training and validation sets, achieving average AUC values of 1.0 ± 0.000 and 0.97 ± 0.036, respectively, indicating that the XGBoost model demonstrated excellent performance in predicting severe CHD patients (Fig. 7A and B); this performance was validated in the test set, in which ROC curve analysis yielded an AUC of 0.991 (Fig. 7C).

The learning curves showed similar trends between the training and validation sets, with small differences between them, indicating strong generalizability. As the sample size increased, the curves remained stable (Fig. 7D). Decision curve analysis revealed that the model provided greater net clinical benefits than strategies assuming all or none of the patients had severe CHD, confirming its clinical utility (Fig. 7E). Calibration curve analysis showed that the model was overconfident in predicting severe CHD, with a Brier score of 0.09 (Fig. 7F); nevertheless, considering the F1 score (0.857), accuracy (Table 6), and other evaluation metrics such as the AUC, we concluded that the model’s predictions are relatively trustworthy.

Fig. 7
figure 7

XGBoost model identifies patients with severe CHD. (A) 10-fold cross-validation ROC curves in the training set. (B) 10-fold cross-validation ROC curves in the validation set. (C) ROC curve in the test set. (D) The learning curves exhibit similar trends between the training and validation sets with small differences, indicating strong generalization capabilities and near-optimal performance. (E) Decision curve analysis reveals that the model outperforms alternative treatment strategies and is clinically effective. (F) The calibration curve reveals the model’s overconfidence in predicting severe CHD

Table 6 Comprehensive summary of the performance metrics of the model in the three sets

Development of online prediction tool for external validation

To obtain a wider range of external validation data, we have developed and presented an online web tool through the Roche i-Research platform. This tool generates predictive models based on current algorithms and can predict the risk of a positive result by setting parameters (Fig. 8A). Upon inputting relevant parameters, the tool calculates the probability of a subject having CHD (https://jg4dbugczt6zmxqjt7asg4.streamlit.app/). Subsequently, we conducted a preliminary external validation of the tool to assess its practical application effects. Samples were collected according to a unified standard from the Clinical Laboratory of Lequn Branch, The First Hospital of Jilin University, totaling 59 blood samples from patients with CHD. According to the SYNTAX score, 12 of the cases belong to severely complex lesions. By comparing the prediction results with clinical diagnoses, we found that the accuracy of the tool in diagnosing coronary heart disease was 86.44%, and the accuracy for diagnosing patients with severe lesions reached 100%, demonstrating good clinical application value and accuracy (Supplementary Table 6).

Fig. 8
figure 8

Development of online prediction tool. The visualization of the prediction model through Roche i-Research platform

Discussion

Despite significant advancements in the diagnosis and treatment of CHD in recent decades, there are still notable deficiencies in routinely screening high-risk populations to accurately identify asymptomatic CHD patients. Moreover, in line with the growing demand for precision medicine, there remains a gap in utilizing blood samples to rapidly and conveniently differentiate CHD patients based on varying degrees of severity, thereby guiding clinicians to adopt appropriate treatment strategies. Recent studies have uncovered potential associations between IgLCs and CHD, yet conventional diagnostic markers such as the FLCκ/λ have proven ineffective for direct application in CHD screening and diagnosis, necessitating further exploration and optimization. In this study, we developed a novel index based on IgLCs called the CHI, which could not only aid in the early diagnosis of CHD but also precisely identified patients with severe CHD. By including it as an input to an XGBoost classification model, furthermore, the model demonstrates exceptional accuracy and sensitivity, ensuring its reliability in clinical applications.

Owing to the long-term progression of coronary artery atherosclerosis, CHD often develops subtly, with patients potentially remaining asymptomatic for extended periods. Indeed, the proportion of asymptomatic CHD patients even exceeds that of symptomatic patients [22]. Current diagnostic gold standards, which often involve invasive procedures and potential radiation risks, limit their applicability in screening a broader population of asymptomatic individuals with potential CHD. Prior studies have attempted to explore the potential correlation between the presence of FLCs in the blood and the development of CHD. One study revealed that there was no significant difference in the levels of FLCs between patients with ST-elevation myocardial infarction (STEMI) and healthy controls [19]. However, unlike in that study, the indicator we used in our research was the FLC κ/λ ratio, which is widely adopted in the diagnosis of multiple myeloma [23]. Our research showed no significant difference in TLCκ/TLCλ ratio between control and CHD groups, but a statistically significant difference was found in FLCκ/FLCλ ratio. This suggests that, while TLCκ/TLCλ changes may not be substantial in CHD, FLCκ/FLCλ alterations may indicate specific pathological processes. However, a previous study, using a different population or methodology, found no significant difference in the FLCκ/FLCλ between healthy controls and patients with type 2 diabetes, stable angina, non-STEMI, or STEMI. Interestingly, that study classified participants based on a median FLC κ/λ ratio of 0.63, and those with a κ/λ ratio above 0.63 were more likely to exhibit improved cardiac function over time [20]. This suggests that, although the FLCκ/FLCλ ratio may not differentiate between healthy and diseased states in all studies, it may still hold prognostic value in certain subpopulations. Therefore, we should concentrate on developing an assessment system for IgLCs that is grounded in FLCs and offers greater universality and precision.

Based on the information we have gathered, the ratio of kappa and lambda free light chains (FLC κ/FLC λ) in peripheral blood remains relatively stable under healthy conditions [14]. However, in disease states, the balance between the production and clearance of light chains is disrupted, leading to excessive accumulation of a specific light chain type in the blood [11]. This indicates that changes in the light chain ratio may not only be a direct outcome of certain diseases but also serve as an early signal of disease risk. Nevertheless, such a single ratio change may not comprehensively reflect the complex impact of diseases on the metabolism of light chains in the immune system, as found in our research results. In another study on multiple myeloma, variations in the levels of FLCs and TLCs suggested that when one type of monoclonal IgLCs (κ or λ) is overproduced, the production of the other type of light chain tends to be inhibited [24]. Therefore, we have decided to explore a more comprehensive method of assessing light chain ratio changes, specifically, a composite indicator that can reflect the relative changes in the proportions of light chains in both their free and total (bound) states.

Specifically, we sought to compare the proportions of the κ and λ types of FLC with the respective type of TLC, that is, the ratio of FLC κ/TLC κ to FLC λ/TLC λ. This method of calculation not only takes into account the total amount of IgLCs but also considers their distribution in different states, thereby enabling a more accurate reflection of the impact of disease on IgLCs metabolism. Our results indicated that this ratio, which we subsequently named the CHI, was significantly lower among CHD patients than control patients, suggesting that it may serve as a potential biomarker for CHD risk assessment. Moreover, to exclude the possibility that the differences in this index reflect underlying kidney damage [25], we additionally tested three major renal function assessment indicators in the peripheral blood of the patients, namely, creatinine, urea, and uric acid, and found no abnormalities in the CHD group. This discovery represents an important step in understanding the relationship between the immune system and cardiovascular health.

In our study, we found that the mean TLC κ/λ ratio was approximately 1.5, and the mean ratio of FLC κ/λ was approximately 1, which falls within the normal range provided by the reagent manufacturer. Therefore, a clear challenge arises: in terms of utilizing IgLCs for diagnosing diseases other than multiple myeloma and plasma cell dyscrasias, such as CHD, current approaches involving concentrations or ratios of κ and/or λ light chains cannot meet current diagnostic needs. Therefore, we postulate that the diagnostic model we devised, which incorporates the CHI, a comprehensive index that takes into account both the total amount of IgLCs and their distribution in different states, into an XGBoost-based model, has the potential to serve as a solution to this challenge. Among the nine machine learning models constructed using CHI and other clinical indicators, we identified the XGBoost-based model as the most suitable classifier for this dataset, achieving a diagnostic accuracy rate of over 85% for CHD in the final test set. Several evaluations were also conducted to further verify its generalization ability and reliability. These evaluations confirmed the stability and consistency of the model across different subsets of the data.

Notably, in this study, we found that as the severity of CHD increased (that is, as the SYNTAX score increased), the CHI gradually decreased. According to multiple pieces of evidence, coronary artery bypass grafting (CABG) is generally preferred over PCI for treating CHD when the SYNTAX score exceeds 33 or in cases involving distal bifurcation lesions or multivessel disease. However, for patients with a limited lesion range, shorter life expectancy, or high surgical risk, PCI may be a more suitable option [5, 26, 27]. The aforementioned discovery once again underscores the pivotal role of precision medicine in the treatment of CHD. Given the complex and variable conditions under which CHD can develop, a precise and personalized treatment plan is crucial, and the prompt and accurate identification of these patients is key to achieving this goal. Encouragingly, our research findings indicate that the XGBoost model has outstanding performance in the classification of patients with severe CHD according to the SYNTAX score. The AUC in the test set reached 0.991, demonstrating that the model had nearly perfect performance in identifying severe CHD patient. Moreover, the model also exhibited an accuracy of 0.857, which, alongside its desirable consistency and generalization capabilities, making it a reliable choice for CHD patient classification. Therefore, the predictive tool we have developed can serve as an auxiliary means for clinicians, providing valuable reference opinions when selecting appropriate treatment methods for patients with varying conditions of CHD.

However, it is important to acknowledge the limitations of this study, which should be considered in future research endeavors. While the CHI demonstrated promising results, but the sample size currently utilized for modeling merely meets the minimum standard, its applicability and generalizability must be further validated in larger and more diverse patient populations. This will provide a more comprehensive understanding of the index’s effectiveness across different demographic and clinical profiles. Furthermore, the current study relies heavily on the accuracy and consistency of laboratory data for IgLCs. However, it is recognized that discrepancies in detection standards and methodologies among different laboratories can potentially impact the accuracy and reproducibility of the CHI. To address this limitation, future research should focus on developing standardized detection protocols and implementing rigorous quality control measures to ensure the reliability and comparability of results across various laboratories. Lastly, regarding the online prediction tool we have developed, it is still in the testing phase. Its primary objective is to gather more external validation data in a broader context, in order to further verify the clinical utility of this tool. However, we must admit that the main limitation of this tool currently is its inability to automatically calculate CHI, which may pose certain limitations in practical applications. In our subsequent work, we will continue to optimize this tool. Despite these limitations, the findings of this study offer valuable insights into the potential of the CHI as a diagnostic or prognostic tool. As such, ongoing efforts to refine and validate the index in broader contexts will be crucial for advancing its clinical applications and ultimately improving patient outcomes.

Conclusions

Our findings indicate that the CHI, when incorporated into an XGBoost-based model, can serve as a noninvasive, efficient, and reliable diagnostic tool that may complement existing imaging techniques and assist clinicians in making informed treatment decisions. Use of this innovative approach could lead to more precise and effective therapeutic strategies, ultimately improving patient outcomes. Future studies should aim to validate the model’s performance in larger, more diverse patient populations, explore the integration of additional biomarkers and clinical factors, and refine the model’s ability to identify subtle changes in disease progression. Additionally, research on the cost-effectiveness and clinical feasibility of implementing such a precision medicine approach in real-world settings is crucial to ensure its widespread adoption and successful implementation. Furthermore, there is a need for more longitudinal studies in the future to track and evaluate the changes in CHI over different time periods within the same patient, as well as variations across subgroups such as different age groups, genders, or those with specific complications. This will enable a more accurate determination of its long-term effectiveness in predicting the progression of CHD and assessing treatment outcomes. Such research will enhance our comprehensive understanding of the clinical significance of CHI and provide robust support for its widespread adoption and successful implementation in precision medicine.

Data availability

The study is currently ongoing. To protect the privacy of study participants, the data supporting the findings of this study are available from the corresponding author, Taiyu Zhai, upon reasonable request.

Abbreviations

CHD:

Coronary heart disease

BNP:

Brain natriuretic peptide

IgLCs:

Immunoglobulin light chains

TLCs:

Total immunoglobulin light chains

FLCs:

Free immunoglobulin light chains

CVD:

Cardiovascular disease

CHI:

Coronary health index

ML:

Machine learning

LR:

Logistic regression

RF:

Random forest

AdaBoost:

Adaptive boosting

GNB:

Gaussian naive Bayes

MLP:

Multilayer perceptron

SVM:

Support vector machine

KNN:

k-nearest neighbors

ROC:

Receiver operating characteristic curve

AUC:

Area under the curve

LCs:

Learning curves

CCs:

Calibration curves

DCA:

Decision curve analysis

References

  1. Erdil N, GENE/CELL THERAPY SIGNALING, AND ADVANCED NANOBIOMATERIALS. Adv Biology Earth Sci. 2024;9(Special Issue):58–80.

    Article  Google Scholar 

  2. Malakar AK, Choudhury D, Halder B, Paul P, Uddin A, Chakraborty S. A review on coronary artery disease, its risk factors, and therapeutics. J Cell Physiol. 2019;234(10):16812–23.

    Article  CAS  PubMed  Google Scholar 

  3. Safiri S, Karamzad N, Singh K, Carson-Chahhoud K, Adams C, Nejadghaderi SA, Almasi-Hashiani A, Sullman MJM, Mansournia MA, Bragazzi NL, et al. Burden of ischemic heart disease and its attributable risk factors in 204 countries and territories, 1990–2019. Eur J Prev Cardiol. 2022;29(2):420–31.

    Article  PubMed  Google Scholar 

  4. Theofilis P, Antonopoulos AS, Sagris M, Papanikolaou A, Oikonomou E, Tsioufis K, Tousoulis D. Silent myocardial ischemia: from pathophysiology to diagnosis and treatment. Biomedicines 2024, 12(2).

  5. Gaba P, Christiansen EH, Nielsen PH, Murphy SA, O’Gara PT, Smith PK, Serruys PW, Kappetein AP, Park SJ, Park DW, et al. Percutaneous coronary intervention vs coronary artery bypass graft surgery for left main disease in patients with and without Acute Coronary syndromes: a pooled analysis of 4 randomized clinical trials. JAMA Cardiol. 2023;8(7):631–9.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Lee PH, Hong SJ, Kim HS, Yoon YW, Lee JY, Oh SJ, Lee JS, Kang SJ, Kim YH, Park SW, et al. Quantitative coronary angiography vs intravascular Ultrasonography to Guide Drug-Eluting Stent Implantation: a Randomized Clinical Trial. JAMA Cardiol. 2024;9(5):428–35.

    Article  PubMed  Google Scholar 

  7. Otsuka F, Joner M, Prati F, Virmani R, Narula J. Clinical classification of plaque morphology in coronary disease. Nat Rev Cardiol. 2014;11(7):379–89.

    Article  PubMed  Google Scholar 

  8. Thygesen K, Alpert JS, Jaffe AS, Chaitman BR, Bax JJ, Morrow DA, White HD. Executive Group on behalf of the Joint European Society of Cardiology /American College of Cardiology /American Heart Association /World Heart Federation Task Force for the Universal Definition of Myocardial I: fourth universal definition of myocardial infarction (2018). J Am Coll Cardiol. 2018;72(18):2231–64.

    Article  PubMed  Google Scholar 

  9. Januzzi JL, Troughton R. Are serial BNP measurements useful in heart failure management? Serial natriuretic peptide measurements are useful in heart failure management. Circulation. 2013;127(4):500–7. discussion 508.

    Article  PubMed  Google Scholar 

  10. Khan A, Roy P, Ley K. Breaking tolerance: the autoimmune aspect of atherosclerosis. Nat Rev Immunol. 2024;24(9):670–9.

  11. Gambino CM, Agnello L, Lo Sasso B, Giglio RV, Di Stefano V, Candore G, Pappalardo EM, Ciaccio AM, Brighina F, Vidali M, et al. The role of serum free light chain as biomarker of Myasthenia Gravis. Clin Chim Acta. 2022;528:29–33.

    Article  CAS  PubMed  Google Scholar 

  12. Wang J, Zhao Y, Ren Z, Sun A, Huang Z, Li X. Investigation of the diagnostic value of sFLC and other indicators in the detection of M protein in LCMM. Clin Lab 2023, 69(10).

  13. [2014 International Myeloma Working. Group updated criteria for the diagnosis of multiple myeloma]. Nihon Rinsho. 2016;74(Suppl 5):264–8.

    Google Scholar 

  14. Gudowska-Sawczuk M, Mroczko B. Free light chains kappa and lambda as new biomarkers of selected diseases. Int J Mol Sci 2023, 24(11).

  15. Matsumori A, Shimada T, Nakatani E, Shimada M, Tracy S, Chapman NM, Drayson MT, Hartz VL, Mason JW. Immunoglobulin free light chains as an inflammatory biomarker of heart failure with myocarditis. Clin Immunol. 2020;217:108455.

    Article  CAS  PubMed  Google Scholar 

  16. Cerro-Pardo I, Lindholt JS, Nunez E, Roldan-Montero R, Ortega-Villanueva L, Vegas-Dominguez C, Gomez-Guerrero C, Michel JB, Blanco-Colio LM, Vazquez J, et al. Combined Immunoglobulin Free Light Chains are Novel predictors of Cardiovascular events in patients with abdominal aortic aneurysm. Eur J Vasc Endovasc Surg. 2022;63(5):751–8.

    Article  PubMed  Google Scholar 

  17. Shantsila E, Wrigley B, Lip GY. Free light chains in patients with acute heart failure secondary to atherosclerotic coronary artery disease. Am J Cardiol. 2014;114(8):1243–8.

    Article  CAS  PubMed  Google Scholar 

  18. Matsumori A. Targeting inflammation in the diagnosis, management, and Prevention of Cardiovascular diseases. Glob Heart. 2022;17(1):80.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Shantsila E, Tapp LD, Lip GY. Free light chains in patients with acute coronary syndromes: relationships to inflammation and renal function. Int J Cardiol. 2015;185:322–7.

    Article  PubMed  Google Scholar 

  20. Basile U, La Rosa G, Napodano C, Pocino K, Cappannoli L, Gulli F, Cianfrocca C, Di Stasio E, Biasucci LM. Free light chains a novel biomarker of cardiovascular disease. A pilot study. Eur Rev Med Pharmacol Sci. 2019;23(6):2563–9.

    CAS  PubMed  Google Scholar 

  21. Li X, Xu C, Shang C, Wang Y, Xu J, Zhou Q. Machine learning predicts the risk of hemorrhagic transformation of acute cerebral infarction and in-hospital death. Comput Methods Programs Biomed. 2023;237:107582.

    Article  PubMed  Google Scholar 

  22. Hamdan M, Kossaify A. Silent myocardial ischemia revisited, another silent killer, emphasis on the Diagnostic Value of Stress Echocardiography with focused update and review. Adv Biomed Res. 2023;12:245.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Dimopoulos MA, Merlini G, Bridoux F, Leung N, Mikhael J, Harrison SJ, Kastritis E, Garderet L, Gozzetti A, van de Donk N, et al. Management of multiple myeloma-related renal impairment: recommendations from the International Myeloma Working Group. Lancet Oncol. 2023;24(7):e293–311.

    Article  PubMed  Google Scholar 

  24. Moghimi M, Kashkooli Behroozi M, Maghbooli M, Jafari S, Mazloomzadeh S, Pezeshgi A. Association between abnormal serum free light chains ratio and known prognostic factors in lymphoma; a nephrology viewpoint. J Ren Inj Prev. 2017;6(2):148–52.

    Article  CAS  PubMed  Google Scholar 

  25. Sirac C, Batuman V, Sanders PW. The proximal tubule toxicity of Immunoglobulin Light Chains. Kidney Int Rep. 2021;6(5):1225–31.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Disney L, Ramaiah C, Ramaiah M, Keshavamurthy S. Left main coronary artery disease in diabetics: percutaneous coronary intervention or coronary artery bypass grafting? Int J Angiol. 2021;30(3):194–201.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Thuijs D, Kappetein AP, Serruys PW, Mohr FW, Morice MC, Mack MJ, Holmes DR Jr., Curzen N, Davierwala P, Noack T, et al. Percutaneous coronary intervention versus coronary artery bypass grafting in patients with three-vessel or left main coronary artery disease: 10-year follow-up of the multicentre randomised controlled SYNTAX trial. Lancet. 2019;394(10206):1325–34.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

The authors are grateful to the i-Research team of Roche for their help to improve the model. In this manuscript, AI (DxAI platform) was involved in selecting appropriate statistical analysis methods and generating the final analysis results during the baseline analysis of patients.

Funding

This manuscript is supported by the National Natural Science Foundation of China (82302591), the Health Commission of Jilin Province (2022JC059) and the Project of Jilin Provincial Department of Education (JJKH20241327KJ).

Author information

Authors and Affiliations

Authors

Contributions

W.B.R was responsible for data collection and analysis, and drafted the initial version of the paper. Z.C.Z was responsible for sample collection and assisted in the literature review. Y.F.W, J.Y.W, L.L and L.S participated in the collection of samples and data. T.Y.Z was responsible for the research design and participated in writing and revising the initial version of the paper. J.H participated in data interpretation and paper revision, providing important insights into research methods and discussion of results.

Corresponding authors

Correspondence to Taiyu Zhai or Jing Huang.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the institutional review board and ethics committee of the First Hospital, Jilin University (2024 − 729) and was conducted according to the ethical standards established in the Declaration of Helsinki.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ren, W., Zhang, Z., Wang, Y. et al. Coronary health index based on immunoglobulin light chains to assess coronary heart disease risk with machine learning: a diagnostic trial. J Transl Med 23, 22 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12967-024-06043-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12967-024-06043-4

Keywords