ObjectiveTo identify the risk factors of bone metastasis in breast cancer and construct a predictive model. MethodsThe data of breast cancer patients met inclusion and exclusion criteria from 2010 to 2015 were obtained from the SEER*Stat database. Additionally, the data of breast cancer patients diagnosed with distant metastasis in the Affiliated Hospital of Southwest Medical University from 2021 to 2023 were collected. The patients from the SEER database were randomly divided into training (70%) and validation (30%) sets using R software, and the breast cancer patients from the Affiliated Hospital of Southwest Medical University were included in the validation set. The univariate and multivariate logistic regressions were used to identify risk factors of breast cancer bone metastasis. A nomogram predictive model was then constructed based on these factors. The predictive effect of the nomogram was evaluated using the area under the receiver operating characteristic curve (AUC), calibration curve, and decision curve analysis. ResultsThe study included 8 637 breast cancer patients, with 5 998 in the training set and 2 639 (including 68 patients in the Affiliated Hospital of Southwest Medical University) in the validation set. The statistical differences in the race and N stage were observed between the training and validation sets (P<0.05). The multivariate logistic regression analysis revealed that being of white race, having a low histological grade (Ⅰ–Ⅱ), positive estrogen and progesterone receptors status, negative human epidermal growth factor receptor 2 status, and non-undergoing surgery for the primary breast cancer site increased the risk of breast cancer bone metastasis (P<0.05). The nomogram based on these risk factors showed that the AUC (95% CI) of the training and validation sets was 0.676 (0.533, 0.744) and 0.690 (0.549, 0.739), respectively. The internal calibration using 1 000 Bootstrap samples demonstrated that the calibration curves for both sets closely approximated the ideal 45-degree reference line. The decision curve analysis indicated a stronger clinical utility within a certain probability threshold range. ConclusionsThis study constructs a nomogram predictive model based on factors related to the risk of breast cancer bone metastasis, which demonstrates a good consistency between actual and predicted outcomes in both training and validation sets. The nomogram shows a stronger clinical utility, but further analysis is needed to understand the reasons of the lower differentiation of nomogram in both sets.
With the increasing availability of clinical and biomedical big data, machine learning is being widely used in scientific research and academic papers. It integrates various types of information to predict individual health outcomes. However, deficiencies in reporting key information have gradually emerged. These include issues like data bias, model fairness across different groups, and problems with data quality and applicability. Maintaining predictive accuracy and interpretability in real-world clinical settings is also a challenge. This increases the complexity of safely and effectively applying predictive models to clinical practice. To address these problems, TRIPOD+AI (transparent reporting of a multivariable prediction model for individual prognosis or diagnosis+artificial intelligence) introduces a reporting standard for machine learning models. It is based on TRIPOD and aims to improve transparency, reproducibility, and health equity. These improvements enhance the quality of machine learning model applications. Currently, research on prediction models based on machine learning is rapidly increasing. To help domestic readers better understand and apply TRIPOD+AI, we provide examples and interpretations. We hope this will support researchers in improving the quality of their reports.
Objective The management of pulmonary nodules is a common clinical problem, and this study constructed a nomogram model based on FUT7 methylation combined with CT imaging features to predict the risk of adenocarcinoma in patients with pulmonary nodules. Methods The clinical data of 219 patients with pulmonary nodules diagnosed by histopathology at the First Affiliated Hospital of Zhengzhou University from 2021 to 2022 were retrospectively analyzed. The FUT7 methylation level in peripheral blood were detected, and the patients were randomly divided into training set (n=154) and validation set (n=65) according to proportion of 7:3. They were divided into a lung adenocarcinoma group and a benign nodule group according to pathological results. Single-factor analysis and multi-factor logistic regression analysis were used to construct a prediction model in the training set and verified in the validation set. The receiver operating characteristic (ROC) curve was used to evaluate the discrimination of the model, the calibration curve was used to evaluate the consistency of the model, and the clinical decision curve analysis (DCA) was used to evaluate the clinical application value of the model. The applicability of the model was further evaluated in the subgroup of high-risk CT signs (located in the upper lobe, vascular sign, and pleural sign). Results Multivariate logistic regression analysis showed that female, age, FUT7_CpG_4, FUT7_CpG_6, sub-solid nodules, lobular sign and burr sign were independent risk factors for lung adenocarcinoma (P<0.05). A column-line graph prediction model was constructed based on the results of the multifactorial analysis, and the area under the ROC curve was 0.925 (95%CI 0.877 - 0.972 ), and the maximum approximate entry index corresponded to a critical value of 0.562, at which time the sensitivity was 89.25%, the specificity was 86.89%, the positive predictive value was 91.21%, and the negative predictive value was 84.13%. The calibration plot predicted the risk of adenocarcinoma of pulmonary nodules was highly consistent with the risk of actual occurrence. The DCA curve showed a good clinical net benefit value when the threshold probability of the model was 0.02 - 0.80, which showed a good clinical net benefit value. In the upper lobe, vascular sign and pleural sign groups, the area under the ROC curve was 0.903 (95%CI 0.847 - 0.959), 0.897 (95%CI 0.848 - 0.945), and 0.894 (95%CI 0.831 - 0.956). Conclusions This study developed a nomogram model to predict the risk of lung adenocarcinoma in patients with pulmonary nodules. The nomogram has high predictive performance and clinical application value, and can provide a theoretical basis for the diagnosis and subsequent clinical management of pulmonary nodules.
ObjectiveTo analyze the relevant risk factors affecting postoperative relapse-free survival (RFS) in the primary gastrointestinal stromal tumors (GIST) and develop a Nomogram predictive model of postoperative RFS for the GIST patients. MethodsThe patients diagnosed with GIST by postoperative pathology from January 2011 to December 2020 at the First Hospital of Lanzhou University and Gansu Provincial People’s Hospital were collected, and then were randomly divided into a training set and a validation set at a ratio of 7∶3 using R software function. The univariate and multivariate Cox regression analysis were used to identify the risk factors affecting the RFS for the GIST patients after surgery, and then based on this, the Nomogram predictive model was constructed to predict the probability of RFS at 3- and 5-year after surgery for the patients with GIST. The effectiveness of the Nomogram was evaluated using the area under the receiver operating characteristic curve (AUC), consistency index (C-index), and calibration curve, and the clinical utility of the Nomogram and the modified National Institutes of Health (M-NIH) classification standard was evaluated using the decision curve analysis (DCA). ResultsA total of 454 patients were included, including 317 in the training set and 137 in the validation set. The results of multivariate Cox regression analysis showed that the tumor location, tumor size, differentiation degree, American Joint Committee onCancer TNM stage, mitotic rate, CD34 expression, treatment method, number of lymph node detection, and targeted drug treatment time were the influencing factors of postoperative RFS for the GIST patients (P<0.05). The Nomogram predictive model was constructed based on the influencing factors. The C-index of the Nomogram in the training set and validation set were 0.731 [95%CI (0.679, 0.783)] and 0.685 [95%CI (0.647, 0.722)], respectively. The AUC (95%CI) of distinguishing the RFS at 3- and 5-year after surgery were 0.764 (0.681, 0.846) and 0.724 (0.661, 0.787) in the training set and 0.749 (0.625, 0.872) and 0.739 (0.647, 0.832) in the validation set, respectively. The calibration curve results showed that a good consistency of the 3-year and 5-year recurrence free survival rates between the predicted results and the actual results in the training set, while which was slightly poor in the validation set. There was a higher net benefit for the 3-year recurrence free survival rate after GIST surgery when the threshold probability range was 0.19 to 0.57. When the threshold probability range was 0.44 to 0.83, there was a higher net benefit for the 5-year recurrence free survival rate after GIST surgery. And within the threshold probability ranges, the net benefit of the Nomogram was better than the M-NIH classification system at the corresponding threshold probability. ConclusionsThe results of this study suggest that the patients with GIST located in the other sites (mainly including the esophagus, duodenum, and retroperitoneum), with tumor size greater than 5 cm, poor or undifferentiated differentiation, mitotic rate lower than 5/50 HPF, negative CD34 expression, ablation treatment, number of lymph nodes detected more than 4, and targeted drug treatment time less than 3 months need to closely pay attentions to the postoperative recurrence. The discrimination and clinical applicability of the Nomogram predictive model are good.
ObjectiveTo study the correlation between Periostin, interleukin-33 (IL-33), and chronic cough after thoracoscopic lobectomy in patients with coronary artery bypass grafting (CABG) combined with lung cancer. Methods A total of 102 lung cancer and coronary heart disease patients at Tianjin Chest Hospital from January 2022 to January 2024 were prospectively enrolled, and they were divided into a chronic cough group (n=42) and a non-chronic cough group (n=60) based on whether chronic cough occurred after surgery. Serum levels of Periostin and IL-33 were measured on the 1st, 7th, and 14th days post-lobectomy. The Pearson method was employed to analyze the correlation between Periostin and IL-33 levels and the severity of cough. Univariate and multivariate logistic regression analyses were conducted to identify factors influencing the occurrence of chronic cough. Additionally, receiver operating characteristic (ROC) curve analysis was utilized to assess the potential value of serum Periostin and IL-33 levels in predicting postoperative chronic cough. Results In patients with chronic cough, the peripheral blood Periostin and IL-33 levels measured on days 7 and 14 were significantly higher than those in patients with non-chronic cough, and the interactions between the two groups and at different time points were significant (P<0.001). The degree of cough was positively correlated with the levels of Periostin and IL-33 on days 7 and 14 (P<0.05), but had no significant correlation with the levels on day 1 (P>0.05). In patients with lung cancer, after thoracoscopic lobectomy, Periostin [OR=1.619, 95%CI (1.295, 2.025)] and IL-33 [OR=1.831, 95%CI (1.216, 2.758)] on day 7 and Periostin [OR=1.952, 95%CI (1.306, 2.918)] and IL-33 [OR=1.742, 95%CI (1.166, 2.603)] on day 14 were identified as risk factors for chronic cough. ROC curve analysis showed that the sensitivity of Periostin on day 7 was 69.05%, the specificity was 71.67%, and the area under the curve (AUC) was 0.756 [95%CI (0.616, 0.893)]. The sensitivity of Periostin on day 14 increased to 71.43% and the specificity was 76.67%, AUC was 0.762 [95%CI (0.633, 0.898)]. At the same time, the critical value of IL-33 on day 7 was 45.03 pg/mL, the sensitivity and specificity were both 83.33%, the AUC was 0.884 [95%CI (0.789, 0.980)], and the critical value of IL-33 on day 14 was 56.01 pg/mL, the sensitivity was 85.71%, the specificity was 80.00%, and the AUC was 0.899 [95%CI (0.799, 0.999)]. Joint logistic regression analysis of Periostin and IL-33 levels on days 7 and 14 showed showed that the sensitivity was 95.24%, the specificity was 95.00%, and the AUC reached 0.993 [95%CI (0.979, 1.000)]. ConclusionPeriostin and IL-33 levels, measured at various time points, are abnormally elevated following thoracoscopic lobectomy in patients with combined CABG and lung cancer. These levels significantly correlate with cough severity. Given their predictive potential for chronic cough, these markers are deemed valuable biomarkers.
ObjectiveTo explore the risk factors which affect the severity of acute pancreatitis by using machine learning algorithms. MethodsA retrospective review was conducted of medical records from 262 patients hospitalized for acute pancreatitis at the Second Affiliated Hospital of Zhengzhou University between October 2022 and February 2024. Patients were classified according to the revised edition Atlanta Classification into mild cases (n=146) and non-mild cases (n=116). LASSO analysis was employed to identify predictors for non-mild acute pancreatitis. Six machine learning algorithms, including extreme gradient boosting, random forest, logistic regression, decision tree, support vector machine, and K-nearest neighbors were integrated to construct predictive models. Model performance was evaluated by comparing the following metrics: area under the curve (AUC), sensitivity, specificity, accuracy, F1 score, calibration curves, and decision curves. ResultsThrough LASSO regression analysis, six feature variables, including heart rate, white blood cell count, neutrophil count, C-reactive protein, albumin, and calcium ion were selected to train and test machine learning models. Results showed that extreme gradient boosting achieved the highest AUC value of 0.93 on the test set, making it the optimal model. The sensitivity, specificity, accuracy, Brier score, and F1 score of the extreme gradient boosting model were 0.97, 0.70, 0.85, 0.108, and 0.84. ConclusionThe prediction model developed using extreme gradient boosting has high clinical utility value, helps to predict the severity of acute pancreatitis at an early stage and is valuable in guiding clinical decision-making.
ObjectiveTo establish and internally validate a predictive model for poorly differentiated adenocarcinoma based on CT imaging and tumor marker results. MethodsPatients with solid and partially solid lung nodules who underwent lung nodule surgery at the Department of Thoracic Surgery, the Affiliated Brain Hospital of Nanjing Medical University in 2023 were selected and randomly divided into a training set and a validation set at a ratio of 7:3. Patients' CT features, including average density value, maximum diameter, pleural indentation sign, and bronchial inflation sign, as well as patient tumor marker results, were collected. Based on postoperative pathological results, patients were divided into a poorly differentiated adenocarcinoma group and a non-poorly differentiated adenocarcinoma group. Univariate analysis and logistic regression analysis were performed on the training set to establish the predictive model. The receiver operating characteristic (ROC) curve was used to evaluate the model's discriminability, the calibration curve to assess the model's consistency, and the decision curve to evaluate the clinical value of the model, which was then validated in the validation set. ResultsA total of 299 patients were included, with 103 males and 196 females, with a median age of 57.00 (51.00, 67.25) years. There were 211 patients in the training set and 88 patients in the validation set. Multivariate analysis showed that carcinoembryonic antigen (CEA) value [OR=1.476, 95%CI (1.184, 1.983), P=0.002], cytokeratin 19 fragment antigen (CYFRA21-1) value [OR=1.388, 95%CI (1.084, 1.993), P=0.035], maximum tumor diameter [OR=6.233, 95%CI (1.069, 15.415), P=0.017], and average density [OR=1.083, 95%CI (1.020, 1.194), P=0.040] were independent risk factors for solid and partially solid lung nodules as poorly differentiated adenocarcinoma. Based on this, a predictive model was constructed with an area under the ROC curve of 0.896 [95%CI (0.810, 0.982)], a maximum Youden index corresponding cut-off value of 0.103, sensitivity of 0.750, and specificity of 0.936. Using the Bootstrap method for 1000 samplings, the calibration curve predicted probability was consistent with actual risk. Decision curve analysis indicated positive benefits across all prediction probabilities, demonstrating good clinical value. ConclusionFor patients with solid and partially solid lung nodules, preoperative use of CT to measure tumor average density value and maximum diameter, combined with tumor markers CEA and CYFRA21-1 values, can effectively predict whether it is poorly differentiated adenocarcinoma, allowing for early intervention.
Objective To identify independent risk factors for in-hospital all-cause mortality in patients with sepsis and to integrate them into the quick Sequential Organ Failure Assessment (qSOFA) score to construct modified models, thereby improving the ability of the original qSOFA to predict mortality risk. Methods This retrospective study included adult patients who met the Sepsis-3 criteria for sepsis and were admitted to the Intensive Care Unit or Emergency Intensive Care Unit of Zigong Fourth People’ s Hospital between January 2018 and December 2023. Demographic characteristics, vital signs, comorbidities, and laboratory parameters were collected, and the Sequential Organ Failure Assessment (SOFA) and qSOFA scores were calculated. Multivariable logistic regression analysis was used to identify independent predictors of in-hospital mortality. Independent predictors were dichotomized according to cut-off values derived from receiver operating characteristic (ROC) curves and combined with qSOFA to construct new models. The ROC analysis with bootstrap validation was used to assess predictive performance, and comparative performance was further evaluated using net reclassification improvement (NRI) and integrated discrimination improvement (IDI). Results A total of 218 patients were included. Multivariable logistic regression analysis identified blood urea nitrogen (BUN) [odds ratio (OR)=1.100, 95% confidence interval (CI) (1.040, 1.170)] and qSOFA [OR=2.610, 95%CI (1.450, 4.920)] as independent risk factors for in-hospital mortality, whereas high-density lipoprotein cholesterol (HDL-C) was an independent protective factor [OR=0.250, 95%CI (0.065, 0.841)]. After dichotomization by ROC-derived cut-off values, BUN and HDL-C were incorporated into qSOFA to generate B-qSOFA, H-qSOFA, and BH-qSOFA. Bootstrap ROC analysis showed that BH-qSOFA exhibited the highest discriminatory ability compared with all combined models as well as the conventional SOFA and qSOFA scores [area under the curve=0.803, 95%CI (0.735, 0.863)]. NRI and IDI analyses demonstrated that BH-qSOFA provided incremental prognostic improvement over qSOFA (NRI=0.969, IDI=0.165), B-qSOFA (NRI=0.644, IDI=0.054), and H-qSOFA (NRI=0.804, IDI=0.091) (all P<0.05). Conclusions Elevated BUN and qSOFA and decreased HDL-C are independent predictors of in-hospital mortality in sepsis. The BH-qSOFA model is simple and clinically practical, exhibits superior predictive performance over the original qSOFA. It may serve as a useful early instrument for prognostic risk stratification in patients with sepsis.
ObjectiveTo develop and validate a Nomogram for predicting severe immune-related adverse events (irAEs) in patients with advanced non-small cell lung cancer (NSCLC) undergoing immunotherapy based on clinical features and inflammatory indicators. MethodsA total of 423 patients with advanced NSCLC treated with immunotherapy between January 2023 and January 2025 at Tianjin Fourth Center Hospital and Tianjin Cancer Hospital Airport Hospital were enrolled. Patients were divided into a severe irAEs group (≥grade 3, n=76) and a non-severe irAEs group (n=347), then randomly allocated into training and validation cohorts (7:3 ratio) . Clinical data, neutrophil-to-lymphocyte ratio (NLR), and interleukin-6/C-reactive protein (IL-6/CRP) levels were collected. Independent risk factors for severe irAEs during immunotherapy in advanced NSCLC were identified through logistic regression analysis, and a nomogram model was constructed accordingly. The discriminative ability, accuracy, and clinical utility of the model were evaluated using receiver operating characteristic (ROC) curves, calibration curves, and decision curve analysis (DCA). ResultsAmong the 423 included patients [274 males, 149 females, aged 44-78 (60.77±5.91) years], the overall incidence of irAEs was 57.92% (245/423), with severe irAEs occurring in 17.97% (76/423). Multivariate analysis revealed that Eastern Cooperative Oncology Group (ECOG) performance score ≥2, programmed death-ligand 1 (PD-L1) expression [tumor proportion score (TPS) ≥50%], combination therapy regimen, low NLR values, and high IL-6/CRP ratio were independent risk factors for severe irAEs during immunotherapy in advanced NSCLC (P<0.05). The area under the ROC curve (AUC) was 0.948 [95%CI (0.912, 0.985)] in the training cohort and 0.946 [95%CI (0.917, 0.976)] in the validation cohort. Calibration curves and DCA demonstrated good consistency and clinical net benefit of the model. ConclusionThe nomogram integrating clinical features and inflammatory markers effectively predicts the risk of severe irAEs in advanced NSCLC patients receiving immunotherapy, exhibiting excellent discrimination, calibration, and clinical practicality.
ObjectiveTo explore the risk factors of lymph node metastasis in patients with colorectal cancer, and construct a risk prediction model to provide reference for clinical diagnosis and treatment.MethodsThe clinicopathological data of 416 patients with colorectal cancer who underwent radical resection of colorectal cancer in the Department of Gastrointestinal Surgery of the Second Affiliated Hospital of Nanchang University from May 2018 to December 2019 were retrospectively analyzed. The correlation between lymph node metastasis and preoperative inflammatory markers, clinicopathological factors and tumor markers were analyzed. Logistic regression was used to analyze the risk factors of lymph node metastasis, and R language was used to construct nomogram model for evaluating the risk of colorectal cancer lymph node metastasis before surgery, and drew a calibration curve and compared with actual observations. The Bootstrap method was used for internal verification, and the consistency index (C-index) was calculated to evaluate the accuracy of the model.ResultsThe results of univariate analysis showed that factors such as sex, age, tumor location, smoking history, hypertension and diabetes history were not significantly related to lymph node metastasis (all P>0.05). The factors related to lymph node metastasis were tumor size, T staging, tumor differentiation level, fibrinogen, neutrophil/lymphocyte ratio (NLR), platelet/lymphocyte ratio (PLR), fibrinogen/albumin ratio (FAR), fibrinogen/prealbumin ratio (FpAR), CEA, and CA199 (all P<0.05). The results of logistic regression analysis showed the FpAR [OR=3.630, 95%CI (2.208, 5.968), P<0.001], CA199 [OR=2.058, 95%CI (1.221, 3.470), P=0.007], CEA [OR=2.335, 95%CI (1.372, 3.975), P=0.002], NLR [OR=2.532, 95%CI (1.491, 4.301), P=0.001], and T staging were independent risk factors for lymph node metastasis. The above independent risk factors were enrolled to construct regression equation and nomogram model, the area under the ROC curve of this equation was 0.803, and the sensitivity and specificity were 75.2% and 73.5%, respectively. The consistency index (C-index) of the nomogram prediction model in this study was 0.803, and the calibration curve showed that the result of predicting lymph node metastasis was highly consistent with actual observations.ConclusionsFpAR>0.018, NLR>3.631, CEA>4.620 U/mL, CA199>21.720 U/mL and T staging are independent risk factors for lymph node metastasis. The nomogram can accurately predict the risk of lymph node metastasis in patients with colorectal cancer before surgery, and provide certain assistance in the formulation of clinical diagnosis and treatment plans.