中国组织工程研究 ›› 2021, Vol. 25 ›› Issue (27): 4300-4306.doi: 10.12307/2021.186

• 数字化骨科 digital orthopedics • 上一篇    下一篇

基于机器学习算法预测全膝关节置换后住院时长

陈潮锋,石宇雄,梁锦成,何智军,何达东   

  1. 广州市番禺区中医院骨关节科,广东省广州市   510000
  • 收稿日期:2020-10-19 修回日期:2020-10-22 接受日期:2020-11-19 出版日期:2021-09-28 发布日期:2021-04-10
  • 通讯作者: 陈潮锋,广州市番禺区中医院骨关节科,广东省广州市 510000
  • 作者简介:陈潮锋,男,1985 年生,广东省广州市人,汉族,2008 年广州中医药大学毕业,副主任医师。
  • 基金资助:
    广东省中医药局科研项目(20171202),项目负责人:陈潮锋

Prediction algorithm of hospitalization duration after total knee arthroplasty based on machine learning

Chen Chaofeng, Shi Yuxiong, Liang Jincheng, He Zhijun, He Dadong   

  1. Department of Osteoarthritis, Panyu Hospital of Chinese Medicine, Guangzhou 510000, Guangdong Province, China
  • Received:2020-10-19 Revised:2020-10-22 Accepted:2020-11-19 Online:2021-09-28 Published:2021-04-10
  • Contact: Chen Chaofeng, Department of Osteoarthritis, Panyu Hospital of Chinese Medicine, Guangzhou 510000, Guangdong Province, China
  • About author:Chen Chaofeng, Associate chief physician, Department of Osteoarthritis, Panyu Hospital of Chinese Medicine, Guangzhou 510000, Guangdong Province, China
  • Supported by:
    the Scientific Research Project of Guangdong Bureau of Traditional Chinese Medicine, No. 20171202 (to CCF)

摘要:

文题释义:
机器学习:机器学习方法是通过监督、无监督或半监督方法反映高维数据的性质,可以有效降低数据维数,提高数据的理解能力,能够对高维数、大量和复杂关系数据进行分析。
人工神经网络:人工神经元就是受自然神经元静息和动作电位的产生机制启发而建立的一个运算模型。以网络拓扑知识为理论基础,模拟人脑的神经系统对复杂信息的处理机制。该模型以并行分布的处理能力、高容错性、智能化和自学习等能力为特征,将信息的加工和存储结合在一起的一种数学模型。

背景:全膝关节置换后住院时长与患者的预后密切相关,但影响住院时长的相关因素并没有深入的研究。
目的:通过临床数据建立全膝关节置换后住院时长的预测模型,使用7种机器学习算法构建模型,评估不同算法的效能,获取与住院时长最相关的影响因素。
方法:通过医院病历系统,收集广州市番禺区中医院2012年1月至2019年12月符合纳入标准的行全膝关节置换的患者共777例,详细录入患者临床资料及既往病史资料。使用逻辑回归、多元自适应回归、K近邻、支持向量机、随机森林、极限梯度提算法和人工神经网络等7种机器学习构建算法模型,采用10倍交叉验证法对模型效能进行验证。计算并比较7个模型的受试者工作特征曲线下的面积、正确率、灵敏度、特异度、精度和平衡F分数。并评估人工神经网络模型预测变量的重要性,神经网络构架和绘制性状相关热图。
结果与结论:①共纳入777个样本,其中包括618例住院时长≤6 d的患者和159例住院时长>6 d的患者;②两组年龄、术前血红蛋白、手术方式、糖尿病病史、缺血性心脏病病史、脑血管疾病史、输血情况比较差异有显著性意义(P < 0.05);③逻辑回归、多元自适应回归、K最近邻、支持向量机、随机森林、极限梯度提算法和人工神经网络受试者工作特征曲线下的面积依次为:0.770,0.778,0.609,0.570,0.594,0.586和0.903,其中人工神经网络的预测效能最佳,同时通过正确率、灵敏度、特异度、精度和平衡F分数的比较判断认为人工神经网络相较于其他模型能够更准确的预测结局指标,其次是逻辑回归、多元自适应回归算法;④在人工神经网络模型中,手术时长和年龄在预测变量中重要性最高,领先于其他预测变量,而心力衰竭病史、心血管疾病、缺血性心脏病、手术方式、血红蛋白、输血情况、胰岛素使用情况、糖尿病病史和阻塞性睡眠呼吸暂停等因素重要性较前两者低,但仍是高相关性的预测变量,神经网络构架图也证实了上述因素的重要性;⑤结果证实,人工神经网络、逻辑回归和多元自适应回归算法这3种机器学习算法均可用于全膝关节置换后住院时长的预测,但人工神经网络的预测效果比逻辑回归和元自适应回归算法更准确。手术时长、年龄、心力衰竭病史、心血管疾病、缺血性心脏病、手术方式和血红蛋白等指标在人工神经网络模型中与住院时长密切相关,同时使用神经网络模型可以对患者进行个性化预测。人工神经网络预测模型的识别效能较高,这有助于全膝关节置换患者缩短住院时长提高医院病床利用率负担。
https://orcid.org/0000-0002-4158-6816 (陈潮锋) 

中国组织工程研究杂志出版内容重点:人工关节;骨植入物;脊柱;骨折;内固定;数字化骨科;组织工程

关键词: 骨, 膝, 关节, 机器学习, 关节置换, 住院时长, 人工神经网络, 预测模型

Abstract: BACKGROUND: The length of hospital stay after total knee arthroplasty is closely related to the prognosis of patients, but the related factors that affect the length of hospital stay have not been studied in depth.  
OBJECTIVE: The prediction model of length of stay after total knee arthroplasty was established based on clinical data. Seven machine learning algorithms were used to construct the model, evaluate the effectiveness of different algorithms, and obtain the most relevant influencing factors of length of stay.
METHODS:  Through the hospital medical record system, a total of 777 patients who underwent total knee arthroplasty that met the inclusion criteria from January 2012 to December 2019 were collected. The patient’s clinical data and past medical history were entered in detail. Seven kinds of machine learning, such as logistic regression, multiple adaptive regression, K-nearest neighbor, support vector machine, random forest, extreme gradient extraction algorithm, and artificial neural network, were used to build an algorithm model, and the 10-fold cross-validation method is used to verify the effectiveness of the model. The area under the receiver operating characteristic curve, accuracy, sensitivity, specificity, accuracy and F1 score of the seven models were calculated and compared. The importance of the predictive variables of the artificial neural network model, the neural network architecture and draw heat maps related to the traits were evaluated.  
RESULTS AND CONCLUSION: (1) We included a total of 777 samples, including 618 patients who were hospitalized for less than or equal to 6 days and 159 patients who were hospitalized for more than 6 days. (2) There were significant differences in age, preoperative hemoglobin, operation method, diabetes history, ischemic heart disease history, cerebrovascular disease history and blood transfusion between the two groups (P < 0.05). (3) Logistic regression, multiple adaptive regression, K-nearest neighbor, support vector machine, random forest, extreme gradient algorithm and artificial neural network area under the receiver operating characteristic curve were 0.770, 0.778, 0.609, 0.570, 0.594, 0.586, and 0.903 in sequence. The prediction efficiency of artificial neural network was the best. Simultaneously, through the comparison of accuracy, sensitivity, specificity, precision and F1 score, it is found that the artificial neural network performs best, followed by logistic regression and multiple adaptive regression algorithms. (4) In the artificial neural network model, the length and age of surgery were the most important among the predictors, leading the other predictors, and the history of heart failure, cardiovascular disease, ischemic heart disease, surgical methods, hemoglobin, blood transfusion, insulin use, history of diabetes, and obstructive sleep apnea were less important than the first two, but they were still highly correlated predictors. The neural network architecture also confirms the importance of these factors. (5) It is concluded that the three machine learning algorithms of artificial neural network, logistic regression and multiple adaptive regression algorithm can be used to predict the length of hospitalization after total knee arthroplasty, but the prediction effect of artificial neural network was more than that of logistic regression and meta-adaptive regression algorithm accurate. The length of surgery, age, history of heart failure, cardiovascular disease, ischemic heart disease, surgical method and hemoglobin are closely related to the length of hospitalization in the artificial neural network model. Simultaneously, using the neural network model can personally predict the patient. The artificial neural network prediction model has a high recognition efficiency, which helps to improve the utilization rate of hospital beds and better plan the length of hospital stay.

Key words: bone, knee, joint, machine learning, arthroplasty, length of hospital stay, artificial neural network, prediction model

中图分类号: