Biomedical Engineering Department, Duke University, Durham, NC 27708, USA.
Sensors (Basel). 2022 Oct 20;22(20):8016. doi: 10.3390/s22208016.
Digital clinical measures collected via various digital sensing technologies such as smartphones, smartwatches, wearables, and ingestible and implantable sensors are increasingly used by individuals and clinicians to capture the health outcomes or behavioral and physiological characteristics of individuals. Time series classification (TSC) is very commonly used for modeling digital clinical measures. While deep learning models for TSC are very common and powerful, there exist some fundamental challenges. This review presents the non-deep learning models that are commonly used for time series classification in biomedical applications that can achieve high performance. We performed a systematic review to characterize the techniques that are used in time series classification of digital clinical measures throughout all the stages of data processing and model building. We conducted a literature search on PubMed, as well as the Institute of Electrical and Electronics Engineers (IEEE), Web of Science, and SCOPUS databases using a range of search terms to retrieve peer-reviewed articles that report on the academic research about digital clinical measures from a five-year period between June 2016 and June 2021. We identified and categorized the research studies based on the types of classification algorithms and sensor input types. We found 452 papers in total from four different databases: PubMed, IEEE, Web of Science Database, and SCOPUS. After removing duplicates and irrelevant papers, 135 articles remained for detailed review and data extraction. Among these, engineered features using time series methods that were subsequently fed into widely used machine learning classifiers were the most commonly used technique, and also most frequently achieved the best performance metrics (77 out of 135 articles). Statistical modeling (24 out of 135 articles) algorithms were the second most common and also the second-best classification technique. In this review paper, summaries of the time series classification models and interpretation methods for biomedical applications are summarized and categorized. While high time series classification performance has been achieved in digital clinical, physiological, or biomedical measures, no standard benchmark datasets, modeling methods, or reporting methodology exist. There is no single widely used method for time series model development or feature interpretation, however many different methods have proven successful.
越来越多的个人和临床医生使用各种数字传感技术(如智能手机、智能手表、可穿戴设备以及可摄入和可植入传感器)收集数字临床测量值,以捕捉个人的健康结果或行为和生理特征。时间序列分类(TSC)是用于对数字临床测量值进行建模的常用方法。尽管 TSC 的深度学习模型非常常见且功能强大,但仍存在一些基本挑战。本综述介绍了在生物医学应用中用于时间序列分类的常用非深度学习模型,这些模型可以实现高性能。
我们进行了系统综述,以描述在数字临床测量值的时间序列分类的各个阶段(包括数据处理和模型构建阶段)中使用的技术。我们在 PubMed、电气和电子工程师协会(IEEE)、Web of Science 和 SCOPUS 数据库中使用了一系列搜索词进行文献检索,以检索报告 2016 年 6 月至 2021 年 6 月五年期间数字临床测量值的学术研究的同行评议文章。我们根据分类算法和传感器输入类型对研究进行了分类和分类。
我们从四个不同的数据库(PubMed、IEEE、Web of Science 数据库和 SCOPUS)共找到了 452 篇论文。去除重复项和不相关的论文后,剩下 135 篇论文进行详细审查和数据提取。其中,使用时间序列方法生成工程特征,然后将其输入到广泛使用的机器学习分类器中的技术最为常用,也是最常获得最佳性能指标的技术(135 篇文章中有 77 篇)。统计建模(135 篇文章中有 24 篇)算法是第二常见和第二好的分类技术。
在本综述论文中,总结了生物医学应用中的时间序列分类模型和解释方法,并进行了分类。虽然在数字临床、生理或生物医学测量中已经实现了较高的时间序列分类性能,但目前还没有标准的基准数据集、建模方法或报告方法。没有一种用于时间序列模型开发或特征解释的单一通用方法,但许多不同的方法已被证明是成功的。