Finnegan E, Villarroel M, Velardo C, Tarassenko L
Department of Engineering Science, Institute of Biomedical Engineering, University of Oxford, Oxford, UK.
J Med Eng Technol. 2019 Aug;43(6):341-355. doi: 10.1080/03091902.2019.1673844. Epub 2019 Nov 4.
There is an increasing need for fast and accurate transfer of readings from blood glucose metres and blood pressure monitors to a smartphone mHealth application, without a dependency on Bluetooth technology. Most of the medical devices recommended for home monitoring use a seven-segment display to show the recorded measurement to the patient. We aimed to achieve accurate detection and reading of the seven-segment digits displayed on these medical devices using an image taken in a realistic scenario by a smartphone camera. A synthetic dataset of seven-segment digits was developed in order to train and test a digit classifier. A dataset containing realistic images of blood glucose metres and blood pressure monitors using a variety of smartphone cameras was also created. The digit classifier was evaluated on a dataset of seven-segment digits manually extracted from the medical device images. These datasets along with the code for its development have been made public. The developed algorithm first preprocessed the input image using retinex with two bilateral filters and adaptive histogram equalisation. Subsequently, the digit segments were automatically located within the image by two techniques operating in parallel: Maximally Stable Extremal Regions (MSER) and connected components of a binarised image. A filtering and clustering algorithm was then designed to combine digit segments to form seven-segment digits. The resulting digits were classified using a Histogram of Orientated Gradients (HOG) feature set and a neural network trained on the synthetic digits. The model achieved 93% accuracy on digits found on the medical devices. The digit location algorithm achieved a F1 score of 0.87 and 0.80 on images of blood glucose metres and blood pressure monitors respectively. Very few assumptions were made of the locations of the digits on the devices so that the proposed algorithm can be easily implemented on new devices.
越来越需要将血糖仪和血压监测仪的读数快速准确地传输到智能手机移动健康应用程序中,而不依赖蓝牙技术。大多数推荐用于家庭监测的医疗设备使用七段式显示器向患者显示记录的测量值。我们的目标是通过智能手机摄像头在现实场景中拍摄的图像,准确检测和读取这些医疗设备上显示的七段数字。为了训练和测试数字分类器,开发了一个七段数字的合成数据集。还创建了一个包含使用各种智能手机摄像头拍摄的血糖仪和血压监测仪真实图像的数据集。在从医疗设备图像中手动提取的七段数字数据集上对数字分类器进行了评估。这些数据集及其开发代码已公开。所开发的算法首先使用带有两个双边滤波器的视网膜算法和自适应直方图均衡化对输入图像进行预处理。随后,通过两种并行操作的技术在图像中自动定位数字段:最大稳定极值区域(MSER)和二值化图像的连通分量。然后设计了一种滤波和聚类算法,将数字段组合成七段数字。使用定向梯度直方图(HOG)特征集和在合成数字上训练的神经网络对得到的数字进行分类。该模型在医疗设备上找到的数字上达到了93%的准确率。数字定位算法在血糖仪和血压监测仪图像上分别达到了0.87和0.80的F1分数。对设备上数字的位置几乎没有做任何假设,因此所提出的算法可以很容易地在新设备上实现。