1Machine Intelligence in Clinical Neuroscience (MICN) Laboratory, Department of Neurosurgery, Clinical Neuroscience Center, University Hospital Zurich, University of Zurich, Zurich, Switzerland.
2Amsterdam UMC, Vrije Universiteit Amsterdam, Department of Neurosurgery, Amsterdam Movement Sciences, Amsterdam.
Neurosurg Focus. 2021 Nov;51(5):E8. doi: 10.3171/2021.8.FOCUS21386.
What is considered "abnormal" in clinical testing is typically defined by simple thresholds derived from normative data. For instance, when testing using the five-repetition sit-to-stand (5R-STS) test, the upper limit of normal (ULN) from a population of spine-healthy volunteers (10.5 seconds) is used to identify objective functional impairment (OFI), but this fails to consider different properties of individuals (e.g., taller and shorter, older and younger). Therefore, the authors developed a personalized testing strategy to quantify patient-specific OFI using machine learning.
Patients with disc herniation, spinal stenosis, spondylolisthesis, or discogenic chronic low-back pain and a population of spine-healthy volunteers, from two prospective studies, were included. A machine learning model was trained on normative data to predict personalized "expected" test times and their confidence intervals and ULNs (99th percentiles) based on simple demographics. OFI was defined as a test time greater than the personalized ULN. OFI was categorized into types 1 to 3 based on a clustering algorithm. A web app was developed to deploy the model clinically.
Overall, 288 patients and 129 spine-healthy individuals were included. The model predicted "expected" test times with a mean absolute error of 1.18 (95% CI 1.13-1.21) seconds and R2 of 0.37 (95% CI 0.34-0.41). Based on the implemented personalized testing strategy, 191 patients (66.3%) exhibited OFI. Type 1, 2, and 3 impairments were seen in 64 (33.5%), 91 (47.6%), and 36 (18.8%) patients, respectively. Increasing detected levels of OFI were associated with statistically significant increases in subjective functional impairment, extreme anxiety and depression symptoms, being bedridden, extreme pain or discomfort, inability to carry out activities of daily living, and a limited ability to work.
In the era of "precision medicine," simple population-based thresholds may eventually not be adequate to monitor quality and safety in neurosurgery. Individualized assessment integrating machine learning techniques provides more detailed and objective clinical assessment. The personalized testing strategy demonstrated concurrent validity with quality-of-life measures, and the freely accessible web app (https://neurosurgery.shinyapps.io/5RSTS/) enabled clinical application.
临床检测中的“异常”通常是通过源自正态数据的简单阈值来定义的。例如,在使用五次重复坐立测试(5R-STS)进行测试时,使用脊柱健康志愿者群体的正常上限(10.5 秒)来确定客观功能障碍(OFI),但这并未考虑到个体的不同特性(例如,高矮、老少)。因此,作者开发了一种个性化测试策略,使用机器学习来量化患者特定的 OFI。
纳入了来自两项前瞻性研究的椎间盘突出症、椎管狭窄症、脊椎滑脱症或椎间盘源性慢性下腰痛患者以及脊柱健康志愿者群体。基于简单的人口统计学数据,使用机器学习模型对正态数据进行训练,以预测个性化“预期”测试时间及其置信区间和正常上限(99%分位数)。OFI 定义为测试时间大于个性化 ULN 的情况。根据聚类算法将 OFI 分为 1 型至 3 型。开发了一个网络应用程序来在临床上部署该模型。
总体而言,共纳入了 288 名患者和 129 名脊柱健康个体。该模型预测“预期”测试时间的平均绝对误差为 1.18 秒(95%置信区间为 1.13-1.21),R2 为 0.37(95%置信区间为 0.34-0.41)。基于实施的个性化测试策略,191 名患者(66.3%)出现 OFI。1 型、2 型和 3 型损伤分别见于 64 名(33.5%)、91 名(47.6%)和 36 名(18.8%)患者。OFI 检测水平的增加与主观功能障碍、极度焦虑和抑郁症状、卧床不起、极度疼痛或不适、无法进行日常生活活动以及工作能力受限等显著增加有关。
在“精准医学”时代,基于人群的简单阈值最终可能不足以监测神经外科的质量和安全性。整合机器学习技术的个体化评估提供了更详细和客观的临床评估。个性化测试策略与生活质量测量具有同时效度,并且免费的网络应用程序(https://neurosurgery.shinyapps.io/5RSTS/)可实现临床应用。