Objective To construct the prediction model of hospitalization expenses for ischemic heart disease, reveal the key factors affecting hospitalization expenses, and analyze the interaction between variables. Methods Patients from Sichuan medical insurance comprehensive service platform from January 2020 to December 2021 were extracted. The training set and test set were divided according to the ratio of 7∶3. Six machine learning models were constructed and trained by ten-fold cross validation, and was explained by SHAP theory. Results XGBoost model had the best performance among these models, with a R2 of 0.60, RMSE of 9 969.71 yuan, and MAE of 5 242.90 yuan in the test set. SHAP results showed that the five variables with the greatest impact on hospitalization expenses were surgery, length of stay, hospital grade, disease type and DRG. Hospitalization costs were higher when grade 3 or 4 procedures were performed, the length of stay was prolonged, the hospitalization was in a tertiary hospital, and payments were made for acute myocardial infarction and non-DRG. With the prolongation of hospital stay, the hospitalization expenses increased faster when the patient had grade 4 surgery and was in a tertiary hospital. In addition, DRG payment will reduce the length of hospital stay and the hospitalization expenses of patients with different disease types. Conclusion The interpretable XGBoost model constructed in this study has a good predictive performance for the hospitalization expenses of patients with ischemic heart disease. Combined with SHAP theory, it can effectively identify the key factors affecting the hospitalization expenses and analyze their interactions.
Citation:
TU Lang, LI Shulei, FAN Ruoxi, YU Jingxin, ZHOU Xiaoyuan. Prediction of hospitalization expenses for ischemic heart disease: machine learning based on SHAP theory. Chinese Journal of Evidence-Based Medicine, 2025, 25(8): 881-887. doi: 10.7507/1672-2531.202503190
Copy
Copyright © the editorial department of Chinese Journal of Evidence-Based Medicine of West China Medical Publisher. All rights reserved
1. |
|
2. |
|
3. |
|
4. |
|
5. |
|
6. |
|
7. |
|
8. |
|
9. |
|
10. |
|
11. |
|
12. |
|
13. |
|
14. |
四川省医疗保障局. 关于省本级开展基本医疗保险区域点数法总额预算和按疾病诊断相关分组(DRG)付费工作的实施意见(川医保规〔2021〕11号). 2021.
|
15. |
|
16. |
|
17. |
|
18. |
|
19. |
|
20. |
|
21. |
|
22. |
Lundberg SM, Lee SI. A unified approach to interpreting model predictions. 2017.
|
23. |
|
24. |
|
25. |
|
26. |
|
27. |
|
28. |
|
29. |
|
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14. 四川省医疗保障局. 关于省本级开展基本医疗保险区域点数法总额预算和按疾病诊断相关分组(DRG)付费工作的实施意见(川医保规〔2021〕11号). 2021.
- 15.
- 16.
- 17.
- 18.
- 19.
- 20.
- 21.
- 22. Lundberg SM, Lee SI. A unified approach to interpreting model predictions. 2017.
- 23.
- 24.
- 25.
- 26.
- 27.
- 28.
- 29.