Fu Mei R, Wang Yao, Li Chenge, Qiu Zeyuan, Axelrod Deborah, Guth Amber A, Scagliola Joan, Conley Yvette, Aouizerat Bradley E, Qiu Jeanna M, Yu Gary, Van Cleave Janet H, Haber Judith, Cheung Ying Kuen
Rory Meyers College of Nursing, Tandon School of Engineering of NYU, New York University, New York, NY, USA.
Electrical and Computer Engineering, Tandon School of Engineering of NYU, New York University, New York, NY, USA.
Mhealth. 2018 May 29;4:17. doi: 10.21037/mhealth.2018.04.02. eCollection 2018.
In the digital era when mHealth has emerged as an important venue for health care, the application of computer science, such as machine learning, has proven to be a powerful tool for health care in detecting or predicting various medical conditions by providing improved accuracy over conventional statistical or expert-based systems. Symptoms are often indicators for abnormal changes in body functioning due to illness or side effects from medical treatment. Real-time symptom report refers to the report of symptoms that patients are experiencing at the time of reporting. The use of machine learning integrating real-time patient-centered symptom report and real-time clinical analytics to develop real-time precision prediction may improve early detection of lymphedema and long term clinical decision support for breast cancer survivors who face lifelong risk of lymphedema. Lymphedema, which is associated with more than 20 distressing symptoms, is one of the most distressing and dreaded late adverse effects from breast cancer treatment. Currently there is no cure for lymphedema, but early detection can help patients to receive timely intervention to effectively manage lymphedema. Because lymphedema can occur immediately after cancer surgery or as late as 20 years after surgery, real-time detection of lymphedema using machine learning is paramount to achieve timely detection that can reduce the risk of lymphedema progression to chronic or severe stages. This study appraised the accuracy, sensitivity, and specificity to detect lymphedema status using machine learning algorithms based on real-time symptom report.
A web-based study was conducted to collect patients' real-time report of symptoms using a mHealth system. Data regarding demographic and clinical information, lymphedema status, and symptom features were collected. A total of 355 patients from 45 states in the US completed the study. Statistical and machine learning procedures were performed for data analysis. The performance of five renowned classification algorithms of machine learning were compared: Decision Tree of C4.5, Decision Tree of C5.0, gradient boosting model (GBM), artificial neural network (ANN), and support vector machine (SVM). Each classification algorithm has certain user-definable hyper parameters. Five-fold cross validation was used to optimize these hyper parameters and to choose the parameters that led to the highest average cross validation accuracy.
Using machine leaning procedures comparing different algorithms is feasible. The ANN achieved the best performance for detecting lymphedema with accuracy of 93.75%, sensitivity of 95.65%, and specificity of 91.03%.
A well-trained ANN classifier using real-time symptom report can provide highly accurate detection of lymphedema. Such detection accuracy is significantly higher than that achievable by current and often used clinical methods such as bio-impedance analysis. Use of a well-trained classification algorithm to detect lymphedema based on symptom features is a highly promising tool that may improve lymphedema outcomes.
在移动健康已成为医疗保健重要场所的数字时代,计算机科学的应用,如机器学习,已被证明是一种强大的医疗保健工具,通过提供比传统统计或基于专家的系统更高的准确性来检测或预测各种医疗状况。症状通常是身体功能因疾病或医疗治疗副作用而发生异常变化的指标。实时症状报告是指患者在报告时正在经历的症状报告。利用机器学习整合以患者为中心的实时症状报告和实时临床分析来开发实时精准预测,可能会改善对淋巴水肿的早期检测,并为面临终身淋巴水肿风险的乳腺癌幸存者提供长期临床决策支持。淋巴水肿与20多种令人痛苦的症状相关,是乳腺癌治疗最令人痛苦和恐惧的晚期不良反应之一。目前淋巴水肿无法治愈,但早期检测有助于患者及时接受干预,以有效管理淋巴水肿。由于淋巴水肿可在癌症手术后立即出现,也可在手术后20年才出现,因此利用机器学习实时检测淋巴水肿对于实现及时检测至关重要,这可以降低淋巴水肿进展到慢性或严重阶段的风险。本研究评估了基于实时症状报告使用机器学习算法检测淋巴水肿状态的准确性、敏感性和特异性。
开展了一项基于网络的研究,使用移动健康系统收集患者的症状实时报告。收集了有关人口统计学和临床信息、淋巴水肿状态及症状特征的数据。来自美国45个州的355名患者完成了该研究。进行了统计和机器学习程序以进行数据分析。比较了机器学习的五种著名分类算法的性能:C4.5决策树、C5.0决策树、梯度提升模型(GBM)、人工神经网络(ANN)和支持向量机(SVM)。每种分类算法都有某些用户可定义的超参数。采用五折交叉验证来优化这些超参数,并选择导致平均交叉验证准确性最高的参数。
使用机器学习程序比较不同算法是可行的。人工神经网络在检测淋巴水肿方面表现最佳,准确率为93.75%,敏感性为95.65%,特异性为91.03%。
使用实时症状报告训练有素的人工神经网络分类器可以提供高度准确的淋巴水肿检测。这种检测准确性明显高于目前常用的临床方法,如生物阻抗分析所能达到的准确性。使用训练有素的分类算法基于症状特征检测淋巴水肿是一种非常有前景的工具,可能会改善淋巴水肿的治疗效果。