IEEE J Biomed Health Inform. 2021 Jul;25(7):2521-2532. doi: 10.1109/JBHI.2020.3040551. Epub 2021 Jul 27.
In the wake of Big Data, traditional Machine Learning techniques are now often integrated in the clinical workflow. Despite more capable, Deep Learning methods are not equally accepted given their unsatiated need for great amounts of training data and transversal use of the same architectures in fundamentally different areas with weakly-substantiated adaptations. To address the former, a cardiorespiratory signal synthesizer was designed by conditional sampling from a multimodally trained stochastic system of Gaussian copulas integrated in a Markov chain. With respect to the latter, a multi-branch convolutional neural network architecture was conceived to learn the best cardiac sensor-fusion strategy at every abstraction layer. The network was tailored to the tasks of cycle detection and classification for different cardiac modality combinations by a synthesizer-based data augmentation training framework and Bayesian hyperparameter optimization. The synthesizer yielded highly realistic signals in the time, frequency and phase domains for both healthy and pathological heart cycles as well as artifacts of different modalities. Benchmarking suggested that the network is able to surpass previous architectures and data augmentation provided a performance boost in realistic data availability scenarios. These included insufficient training data volume, as low as 150 cycles long, artifact contamination and absence of a classification data type in training.
在大数据时代,传统的机器学习技术现在通常被整合到临床工作流程中。尽管深度学习方法更强大,但由于它们对大量训练数据的强烈需求以及在根本不同的领域中使用相同架构的横向应用,以及在适应性方面缺乏充分的依据,因此并未得到同等接受。为了解决前者,通过从集成在马尔可夫链中的高斯 Copula 多模态训练随机系统中进行条件采样,设计了一种心肺信号合成器。至于后者,设计了一种多分支卷积神经网络架构,以在每个抽象层学习最佳的心脏传感器融合策略。该网络通过基于合成器的数据增强训练框架和贝叶斯超参数优化,针对不同的心脏模态组合的循环检测和分类任务进行了定制。该合成器在时间、频率和相位域中为健康和病理心脏周期以及不同模态的伪影生成了高度逼真的信号。基准测试表明,该网络能够超越以前的架构,并且数据增强在现实数据可用性场景中提供了性能提升。这些场景包括训练数据量不足,低至 150 个周期长,伪影污染以及训练中没有分类数据类型。