Email Alert | RSS    帮助

中国防痨杂志 ›› 2026, Vol. 48 ›› Issue (1): 94-105.doi: 10.19982/j.issn.1000-6621.20250311

• 论著 • 上一篇    下一篇

基于logistic回归、决策树和神经网络构建成人慢性肾脏病合并活动性结核病的风险预测模型与应用

陈德盼1,2, 李翔3, 张开义4, 李敏1,2, 夏加伟1, 高楚伊1, 杨亚涛1, 张乐1()   

  1. 1云南省传染性疾病临床医学中心/昆明市第三人民医院重症医学科,昆明650041
    2大理大学公共卫生学院,大理671000
    3云南省传染性疾病临床医学中心/昆明市第三人民医院影像科,昆明650041
    4云南省传染性疾病临床医学中心/昆明市第三人民医院结核科,昆明650041
  • 收稿日期:2025-07-31 出版日期:2026-01-10 发布日期:2025-12-31
  • 通信作者: 张乐 E-mail:9035643@qq.com
  • 基金资助:
    云南省科技厅重大科技专项计划项目(202402AA310011);云南省临床医学中心科研项目(2024YNLCYXZX0208);重症传染性疾病救治医学技术中心[2023-SW(技)-22];昆明市卫生科研课题项目(2025-12-05-003)

To construct a risk prediction model for adult chronic kidney disease complicated with active tuberculosis based on logistic regression, decision tree, and neural network

Chen Depan1,2, Li Xiang3, Zhang Kaiyi4, Li Min1,2, Xia Jiawei1, Gao ChuYi1, Yang Yatao1, Zhang Le1()   

  1. 1Clinical Medical Center for Infectious Diseases of Yunnan Province/Department of Critical Care Medicine, Third People’s Hospital of Kunming, Kunming 650041, China
    2School of Public Health, Dali University, Dali 671000, China
    3Clinical Medical Center for Infectious Diseases of Yunnan Province/Department of Imaging, Third People’s Hospital of Kunming, Kunming 650041, China
    4Clinical Medical Center for Infectious Diseases of Yunnan Province/Department of Tuberculosis, Third People’s Hospital of Kunming, Kunming 650041, China
  • Received:2025-07-31 Online:2026-01-10 Published:2025-12-31
  • Contact: Zhang Le E-mail:9035643@qq.com
  • Supported by:
    The Major Science and Technology Special Program of the Department of Science and Technology of Yunnan Province(202402AA310011);Research Project of Clinical Medical Center of Yunnan Province(2024YNLCYXZX0208);Medical Technology Center for Severe Infectious Diseases (2023-SW(Tech)-22);Health Research Project of Kunming City(2025-12-05-003)

摘要:

目的: 探究logistic回归模型、决策树模型和神经网络模型在预测慢性肾脏病(chronic kidney disease,CKD)患者合并活动性结核病中的应用价值。方法: 回顾性分析2021年1月至2024年1月昆明市第三人民医院收治的CKD患者392例,其中合并活动性结核病患者266例,为观察组,未合并活动性结核病患者126例,为对照组。收集两组患者的临床资料及实验室指标,采用logistic回归、决策树和神经网络三种模型筛选影响因素并构建风险预测模型,通过受试者工作特征(receiver operating characteristic,ROC)曲线比较模型的预测效能。结果: logistic回归模型的ROC曲线下面积(AUC)为0.726(95%CI: 0.676~0.777),敏感度为45.1%,特异度为92.1%,同时识别出饮酒、咯血、低淋巴细胞计数、低红细胞压积、高尿酸、高纤维蛋白原降解产物和低白细胞介素10(IL-10)水平为独立预测因子(P值均<0.05)。决策树模型的AUC为0.825(95%CI: 0.783~0.868),敏感度为62.0%,特异度为82.5%,该模型以纤维蛋白原降解产物为首要分层变量,进一步纳入CD4+ T淋巴细胞、淋巴细胞计数、红细胞压积、食欲不振和尿酸等变量构建决策路径。神经网络模型的预测效能最优,其AUC为0.876(95%CI: 0.843~0.909),敏感度为60.9%,特异度高达98.4%,其特征重要性分析显示,IL-10、红细胞压积、纤维蛋白原降解产物和CD4+ T淋巴细胞为排名前4的预测因子。结论: 神经网络模型对CKD患者合并活动性结核病的预测能力最佳。本研究识别出的关键预测因子,如低淋巴细胞计数、低CD4+ T淋巴细胞计数、高纤维蛋白原降解产物等有助于界定筛查重点人群。三种模型优势互补,可协同应用于临床:对于存在免疫功能抑制(如淋巴细胞、CD4+ T淋巴细胞计数低下)、凝血纤溶激活(如纤维蛋白原降解产物升高)及咯血等特征的CKD患者,应加强结核病筛查。

关键词: 结核, 肾脏病学, Logistic模型, 决策树, 神经网络(计算机), 预测

Abstract:

Objective: To evaluate the application value of logistic regression model, decision tree model, and neural network model in predicting active tuberculosis among patients with chronic kidney disease (CKD). Methods: A retrospective analysis was conducted on 392 CKD patients admitted to the Third People’s Hospital of Kunming between January 2021 and January 2024. Among them, 266 patients with active tuberculosis were included in the observation group, and 126 patients without active tuberculosis were included in the control group. Clinical data and laboratory indicators were collected for both groups. Logistic regression, decision tree and neural network models were used to identify influencing factors and construct the risk prediction models. The model performance was compared using receiver operating characteristic (ROC) curve. Results: The area under the curve (AUC) of the logistic regression model was 0.726 (95%CI: 0.766-0.777), with a sensitivity of 45.1% and a specificity of 92.1%. Alcohol consumption, hemoptysis, low lymphocyte count, low hematocrit, high uric acid, high fibrinogen degradation products, and low IL-10 level were identified as independent predictors (all P values <0.05). The AUC of the decision tree model was 0.825 (95%CI: 0.783-0.868), with a sensitivity of 62.0% and a specificity of 82.5%. Fibrinogen degradation products served as the primary stratification variable, and further included variables such as CD4+ T lymphocytes, lymphocyte count, hematocrit, loss of appetite, and uric acid to construct a decision path. The neural network model demonstrated the best predictive performance, with an AUC of 0.876 (95%CI: 0.843-0.909), a sensitivity of 60.9%, and a specificity of 98.4%. Feature importance analysis indicated that IL-10, hematocrit, fibrinogen degradation products, and CD4+ T lymphocytes were the top four predictors. Conclusion: The neural network model exhibited the best predictive ability for detecting CKD patients with active tuberculosis. Key predictors identified in this study—such as low lymphocyte count, low CD4+ T lymphocyte count, and high fibrinogen degradation product—may help define the key population for screening. The three models provide complementary strenghths and may be applied synergistically applied in clinical practice. Enhanced tuberculosis screening is warrented for CKD patients with immunosuppression (e.g., low lymphocyte count, CD4+ T lymphocyte count), activation of coagulation and fibrinolysis (e.g., increased fibrinogen degradation products), or hemoptysis.

Key words: Tuberculosis, Nephrology, Logistic models, Decision trees, Neural networks (computer), Forecasting

中图分类号: