Lo Yu-Lung, Chen Yi-Sheng, Wang Po-Yu, Chang Ching-Min, Wei Guan-Ting, Hung Wei-Chun
Department of Mechanical Engineering, National Cheng Kung University (NCKU), Tainan 701401, Taiwan.
Academy of Innovative Semiconductor and Sustainable Manufacturing (AISSM), National Cheng Kung University (NCKU), Tainan 701401, Taiwan.
Biomed Opt Express. 2024 Jul 29;15(8):4909-4924. doi: 10.1364/BOE.529032. eCollection 2024 Aug 1.
This study utilizes a Mueller matrix-based system to extract accurate glucose levels from human fingertips, addressing challenges in skin complexity. Integration of domain knowledge and data science aims to enhance prediction accuracy using a Random Forest model. The primary goal is to improve glucose level predictions by selecting effective features based on the Pearson product-moment correlation coefficient (PPMCC). The interpolation compensates for delayed glucose concentration. This study integrates domain knowledge and data science, combining a Mueller matrix-based system and a random forest model. It is noted that 16 effective features were identified from 27 test points collected from a healthy volunteer in the laboratory. These features were divided into training and prediction sets in a ratio of 8:2. As a result, the regression coefficient, R, was 0.8907 and the mean absolute relative difference (MARD) was 6.8%, respectively. This significantly improves prediction accuracy, demonstrating the model's robustness and reliability in accurately forecasting outcomes based on the identified features. In addition, in the Institutional Review Board (IRB) tests at NCKU's hospital, all data passed the same preprocessing and model. The measurement results from an individual diabetic patient demonstrate high accuracy for blood glucose concentrations below 150 mg/dL, with acceptable deviation at higher levels and no severe error zones. Over a three-month period, data from the participating diabetic patient showed a MARD of 4.44% with the R of 0.836, and the other patient recorded a MARD of 7.79% with the R of 0.855. The study shows the proposed approach accurately extracts glucose levels. Integrating domain knowledge, data science, and effective strategies significantly improves prediction accuracy.
本研究利用基于穆勒矩阵的系统从人体指尖提取准确的血糖水平,解决皮肤复杂性方面的挑战。领域知识与数据科学的整合旨在使用随机森林模型提高预测准确性。主要目标是通过基于皮尔逊积矩相关系数(PPMCC)选择有效特征来改善血糖水平预测。插值法可补偿延迟的葡萄糖浓度。本研究整合了领域知识和数据科学,将基于穆勒矩阵的系统与随机森林模型相结合。值得注意的是,从实验室中一名健康志愿者采集的27个测试点中识别出了16个有效特征。这些特征按8:2的比例分为训练集和预测集。结果,回归系数R为0.8907,平均绝对相对差异(MARD)为6.8%。这显著提高了预测准确性,证明了该模型基于所识别特征准确预测结果的稳健性和可靠性。此外,在国立成功大学医院的机构审查委员会(IRB)测试中,所有数据都经过了相同的预处理和模型。一名糖尿病患者的测量结果表明,对于血糖浓度低于150 mg/dL的情况具有较高的准确性,在较高水平时偏差可接受,且不存在严重误差区域。在三个月的时间里,参与研究的糖尿病患者的数据显示MARD为4.44%,R为0.836,另一名患者记录的MARD为7.79%,R为0.855。该研究表明所提出的方法能够准确提取血糖水平。整合领域知识、数据科学和有效策略可显著提高预测准确性。