Email Alert | RSS

Chinese Journal of Antituberculosis ›› 2026, Vol. 48 ›› Issue (6): 830-839.doi: 10.19982/j.issn.1000-6621.20260040

• Original Articles • Previous Articles     Next Articles

Construction of an optimal prediction model for treatment outcomes in retreatment pulmonary tuberculosis

Du Xilong1, Maiwulajiang·Yimamu 2, Na Yan1, Paziliya·Yasheng 1, Guo Gang3, Maiweilanjiang·Abulimiti 4, Zhang Liping5, Zheng Yanling5()   

  1. 1 School of Public Health, Xinjiang Medical University, Urumqi 830017, China
    2 Kashgar Prefecture Center for Disease Control and Prevention, Xinjiang Uygur Autonomous Region, Kashgar 844000, China
    3 Urumqi Maternal and Child Health Hospital, Xinjiang Uygur Autonomous Region, Urumqi 830037, China
    4 Kashgar Prefecture Tuberculosis Control Institute (Kashgar Prefecture Pulmonary Hospital), Xinjiang Uygur Autonomous Region, Kashgar 844000, China
    5 School of Medical Engineering and Technology, Xinjiang Medical University, Urumqi 830017, China
  • Received:2026-01-22 Online:2026-06-10 Published:2026-05-25
  • Contact: Zheng Yanling E-mail:zhengyl_math@sina.cn
  • Supported by:
    Outstanding Young Talents in Xinjiang-Youth Science and Technology Innovation Talent Project(2024TSYCCX0080);Outstanding Young Talents in Xinjiang-Youth Science and Technology Innovation Talent Project(2024TSYCJC0061);College Students’ Innovation and Entrepreneurship Training Program Project(S202510760111)

Abstract:

Objective: Based on the clinical data of patients with retreatment pulmonary tuberculosis, this study systematically compared the performance of nine machine learning models in treatment outcome classification to construct the optimal prediction model and provide a predictive tool for optimizing the management of retreatment pulmonary tuberculosis patients. Methods: A total of 1396 patients with retreatment pulmonary tuberculosis registered and managed by the Kashgar in Xinjiang Uyghur Autonomous Region from January 1 to December 31, 2022 were enrolled. They were randomly divided into a training set (978 cases) and a test set (418 cases) at a 7∶3 ratio. Feature selection was performed using Random Forest and Cramér’s V coefficient. Nine machine learning models were constructed, including Logistic Regression, Support Vector Machine, Decision Tree, Random Forest, XGBoost, LightGBM, Gradient Boosting Tree, Multilayer Perceptron, and CatBoost. Model performance was evaluated using accuracy, precision, recall, F1-score, area under the curve (AUC), and average precision (AP). The optimal model was selected and interpreted using SHapley Additive exPlanations (SHAP) analysis. Results: The CatBoost model exhibited the best overall performance on the test set, with an accuracy of 89.2%, precision of 88.5%, recall of 89.2%, F1-score of 0.885, AUC of 0.829, and AP of 0.941. SHAP analysis revealed that 2-month sputum smear examination (absolute MeanSHAP value=0.595), treatment regimen (absolute MeanSHAP value=0.367), and treatment mode (absolute MeanSHAP value=0.290) were the three features with the highest contribution to predicting treatment outcomes in retreatment pulmonary tuberculosis. Conclusion: The CatBoost model performed robustly and excellently in predicting adverse treatment outcomes in patients with retreatment pulmonary tuberculosis. Combined with SHAP interpretation, it can effectively identify key predictors and provide an effective tool for risk assessment of adverse treatment outcomes in retreatment pulmonary tuberculosis patients.

Key words: Tuberculosis, pulmonary, Retreatment, Treatment outcome, Models, statistical, Forecasting

CLC Number: