IEEE J Biomed Health Inform. 2020 Jun;24(6):1717-1726. doi: 10.1109/JBHI.2019.2954553. Epub 2019 Nov 20.
We describe a novel machine-learning based method to estimate total Hemoglobin (Hb) using photoplethysmograms (PPGs) acquired non-invasively.
In a study conducted in Karnataka, India, 1583 women (pregnant and non-pregnant) of childbearing age, with Hb values ranging between 1.6 to 14.8 g/dL, had their Hb values estimated using intravenous blood samples and concurrently by a finger sensor custom designed and prototyped for this study. The finger sensor collected PPG signals at four wavelengths: 590 nm, 660 nm, 810 nm, and 940 nm. A novel feature vector was derived from these PPGs. A machine learning model comprising of a two-layer stack of regressors including Least Absolute Shrinkage and Selection Operator (LASSO), Ridge, Elastic Net, Adaptive (Ada) Boost and Support Vector Regressors (SVR) was designed and tested.
We report a statistically significant Pearson's correlation coefficient (PCC) of 0.81 (p < 0.01) between the Hb value estimated by the proposed methodology and gold standard values of Hb, with a Root Mean Square Error (RMSE) of 1.353 ± 0.042 g/dL. The performance of the stacked regressor model was significantly better than the performance of individual regressors (low RMSE, and better CC; p < 0.05). Post-hoc analysis showed that including pregnant women in the training data set significantly improved the performance of the algorithm.
This article demonstrates the feasibility of a machine learning based non-invasive hemoglobin measurement system, especially for maternal anemia detection.
By developing and demonstrating a machine learning approach on a large data set, we have demonstrated that such an approach could become the basis for a public health screening tool to detect and treat maternal anemia and could supplement global health intervention strategies.
我们描述了一种基于机器学习的新方法,用于使用非侵入式光体积描记图(PPG)估算总血红蛋白(Hb)。
在印度卡纳塔克邦进行的一项研究中,1583 名育龄妇女(孕妇和非孕妇)的 Hb 值范围在 1.6 至 14.8 g/dL 之间,她们的 Hb 值通过静脉血样和专为这项研究设计和制作的手指传感器同时进行估算。手指传感器在四个波长处采集 PPG 信号:590nm、660nm、810nm 和 940nm。从这些 PPG 中得出一个新的特征向量。设计并测试了一个由两层回归器组成的机器学习模型,包括最小绝对收缩和选择算子(LASSO)、岭回归、弹性网络、自适应(Ada)Boost 和支持向量回归器(SVR)。
我们报告了所提出的方法估算的 Hb 值与 Hb 金标准值之间具有统计学意义的皮尔逊相关系数(PCC)为 0.81(p < 0.01),根均方误差(RMSE)为 1.353 ± 0.042 g/dL。堆叠回归器模型的性能明显优于单个回归器(低 RMSE 和更好的 CC;p < 0.05)。事后分析表明,将孕妇纳入训练数据集可显著提高算法的性能。
本文证明了基于机器学习的非侵入性血红蛋白测量系统的可行性,特别是用于检测孕妇贫血。
通过在大型数据集上开发和演示机器学习方法,我们证明了这种方法可以成为检测和治疗孕妇贫血的公共卫生筛查工具的基础,并可以补充全球卫生干预策略。