Watts Jeremy, Khojandi Anahita, Vasudevan Rama, Ramdhani Ritesh
Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:5406-5409. doi: 10.1109/EMBC44109.2020.9175311.
More than one million people currently live with Parkinson's Disease (PD) in the U.S. alone. Medications, such as levodopa, can help manage PD symptoms. However, medication treatment planning is generally based on patient history and limited interaction between physicians and patients during office visits. This limits the extent of benefit that may be derived from the treatment as disease/patient characteristics are generally non-stationary. Wearable sensors that provide continuous monitoring of various symptoms, such as bradykinesia and dyskinesia, can enhance symptom management. However, using such data to overhaul the current static medication treatment planning approach and prescribe personalized medication timing and dosage that accounts for patient/care-giver/physician feedback/preferences remains an open question. We develop a model to prescribe timing and dosage of medications, given the motor fluctuation data collected using wearable sensors in real-time. We solve the resulting model using deep reinforcement learning (DRL). The prescribed policy determines the optimal treatment plan that minimizes patient's symptoms. Our results show that the model-prescribed policy outperforms the static a priori treatment plan in improving patients' symptoms, providing a proof-of-concept that DRL can augment medical decision making for treatment planning of chronic disease patients.
仅在美国,目前就有超过100万人患有帕金森病(PD)。左旋多巴等药物有助于控制帕金森病症状。然而,药物治疗方案通常基于患者病史,且在门诊期间医生与患者之间的互动有限。由于疾病/患者特征通常是不稳定的,这限制了治疗可能带来的益处程度。可穿戴传感器能够持续监测各种症状,如运动迟缓及运动障碍,从而加强症状管理。然而,利用这些数据彻底改革当前静态的药物治疗方案,并根据患者/护理人员/医生的反馈/偏好开出个性化的用药时间和剂量,仍是一个悬而未决的问题。我们开发了一个模型,根据使用可穿戴传感器实时收集的运动波动数据来开出药物的用药时间和剂量。我们使用深度强化学习(DRL)来求解所得模型。所规定的策略确定了能使患者症状最小化的最佳治疗方案。我们的结果表明,在改善患者症状方面,模型规定的策略优于静态的先验治疗方案,这为深度强化学习可增强慢性病患者治疗规划的医疗决策提供了概念验证。