Zhang Zhifei, Song Yang, Cui Haochen, Wu Jayne, Schwartz Fernando, Qi Hairong
IEEE Trans Biomed Eng. 2017 Sep;64(9):2288-2299. doi: 10.1109/TBME.2016.2634531. Epub 2016 Dec 2.
Bucking the trend of big data, in microdevice engineering, small sample size is common, especially when the device is still at the proof-of-concept stage. The small sample size, small interclass variation, and large intraclass variation, have brought biosignal analysis new challenges. Novel representation and classification approaches need to be developed to effectively recognize targets of interests with the absence of a large training set.
Moving away from the traditional signal analysis in the spatiotemporal domain, we exploit the biosignal representation in the topological domain that would reveal the intrinsic structure of point clouds generated from the biosignal. Additionally, we propose a Gaussian-based decision tree (GDT), which can efficiently classify the biosignals even when the sample size is extremely small.
This study is motivated by the application of mastitis detection using low-voltage alternating current electrokinetics (ACEK) where five categories of bisignals need to be recognized with only two samples in each class. Experimental results demonstrate the robustness of the topological features as well as the advantage of GDT over some conventional classifiers in handling small dataset.
Our method reduces the voltage of ACEK to a safe level and still yields high-fidelity results with a short assay time.
This paper makes two distinctive contributions to the field of biosignal analysis, including performing signal processing in the topological domain and handling extremely small dataset. Currently, there have been no related works that can efficiently tackle the dilemma between avoiding electrochemical reaction and accelerating assay process using ACEK.
与大数据趋势背道而驰的是,在微器件工程中,小样本量很常见,尤其是当器件仍处于概念验证阶段时。小样本量、小类间差异和大类内差异给生物信号分析带来了新的挑战。需要开发新的表示和分类方法,以便在没有大量训练集的情况下有效地识别感兴趣的目标。
我们摒弃了传统的时空域信号分析方法,转而利用拓扑域中的生物信号表示,这种表示将揭示由生物信号生成的点云的内在结构。此外,我们提出了一种基于高斯的决策树(GDT),即使样本量极小,它也能有效地对生物信号进行分类。
本研究的动机源于使用低压交流电动力学(ACEK)进行乳腺炎检测的应用,其中需要识别五类生物信号,每类仅有两个样本。实验结果证明了拓扑特征的稳健性以及GDT在处理小数据集方面优于一些传统分类器的优势。
我们的方法将ACEK的电压降低到安全水平,并且在短检测时间内仍能产生高保真结果。
本文在生物信号分析领域做出了两项独特贡献,包括在拓扑域中进行信号处理以及处理极小的数据集。目前,尚无相关工作能够有效解决使用ACEK时避免电化学反应与加速检测过程之间的困境。