• Department of Anesthesiology, The First Affiliated Hospital of Xinjiang Medical University, Urumqi, 830054, P. R. China;
DAI Xiaowen, Email: 582124869@qq.com
Export PDF Favorites Scan Get Citation

Objective  To develop and compare the predictive performance of five machine learning models for adverse postoperative outcomes in cardiac surgery patients, and to identify key decision factors through SHAP interpretability analysis.Methods  A retrospective collection of perioperative data (including demographic information, preoperative, intraoperative, and postoperative indicators) with 88 variables was conducted from adult cardiac surgery patients at the First Affiliated Hospital of Xinjiang Medical University in 2023. Adverse postoperative outcomes were defined as the occurrence of acute kidney injury and/or in-hospital mortality during the postoperative hospitalization period following cardiac surgery. Patients were divided into an adverse outcome group and a favorable outcome group based on the presence of adverse postoperative outcomes. After screening feature variables using the Least Absolute Shrinkage and Selection Operator (LASSO) regression method, five machine learning models were constructed: Extreme Gradient Boosting (XGBoost), Random Forest (RF), Gradient Boosting Machine (GBM), Light Gradient Boosting Machine (LightGBM), and Generalized Linear Model (GLM). The dataset was split into a training set (n=447) and a test set (n=192) using stratified sampling. Model performance was evaluated using Receiver Operating Characteristic (ROC) curves, Decision Curve Analysis (DCA), and F1 Score. The SHAP method was applied to analyze feature contribution.Results: A total of 639 patients were included, comprising 395 males and 244 females, with a median age of 62 (55, 69) years. The adverse outcome group consisted of 191 patients, while the favorable outcome group included 448 patients, resulting in an adverse postoperative outcome incidence of 29.9%. Univariate analysis showed no significant differences between the two groups for any variables (P>0.05). Using LASSO regression, 16 feature variables were selected (including cardiopulmonary bypass support time, blood glucose on postoperative day 3, creatine kinase-MB isoenzyme, systemic inflammatory response index, etc.), and five machine learning models (GLM, RF, GBM, LightGBM, XGBoost) were constructed. Evaluation results demonstrated that the XGBoost model exhibited the best predictive performance on both the training and test sets, with AUC values of 0.761 [95%CI (0.719, 0.800) ] and 0.759 [95%CI (0.692, 0.818) ], respectively. It also significantly outperformed other models in precision, positive predictive value (PPV), and balanced accuracy. Decision curve analysis further confirmed its clinical utility across various risk thresholds. SHAP analysis indicated that variables such as cardiopulmonary bypass support time, blood glucose on postoperative day 3, creatine kinase-MB isoenzyme, and inflammatory markers (SIRI, NLR, CAR) had high contributions to the prediction.Conclusion: The XGBoost model effectively predicts adverse postoperative outcomes in cardiac surgery patients. Clinically, attention should be focused on cardiopulmonary bypass support time, postoperative blood glucose control, and monitoring of inflammatory levels to improve patient prognosis.

Copyright © the editorial department of Chinese Journal of Clinical Thoracic and Cardiovascular Surgery of West China Medical Publisher. All rights reserved