Chavez Tanny, Vohra Nagma, Wu Jingxian, Bailey Keith, El-Shenawee Magda
Electrical Engineering Department, University of Arkansas, Fayetteville, AR 72701 USA.
University of Illinois at Urbana-Champaign, Veterinary Diagnostic Laboratory, Urbana, IL 61802.
IEEE Trans Terahertz Sci Technol. 2020 Mar;10(2):176-189. doi: 10.1109/tthz.2019.2962116. Epub 2019 Dec 24.
This paper proposes a new dimension reduction algorithm based on low-dimension ordered orthogonal projection (LOOP), which is used for cancer detection with terahertz (THz) images of freshly excised human breast cancer tissues. A THz image can be represented by a data cube with each pixel containing a high dimension spectrum vector covering several THz frequencies, where each frequency represents a different dimension in the vector. The proposed algorithm projects the high-dimension spectrum vector of each pixel within the THz image into a low-dimension subspace that contains the majority of the unique features embedded in the image. The low-dimension subspace is constructed by sequentially identifying its orthonormal basis vectors, such that each newly chosen basis vector represents the most unique information not contained by existing basis vectors. A multivariate Gaussian mixture model is used to represent the statistical distributions of the low-dimension feature vectors obtained from the proposed dimension reduction algorithm. The model parameters are iteratively learned by using unsupervised learning methods such as Markov chain Monte Carlo or expectation maximization, and the results are used to classify the various regions within a tumor sample. Experiment results demonstrate that the proposed method achieves apparent performance improvement in human breast cancer tissue over existing approaches such as one-dimension Markov chain Monte Carlo. The results confirm that the dimension reduction algorithm presented in this paper is a promising technique for breast cancer detection with THz images, and the classification results present a good correlation with respect to the histopathology results of the analyzed samples.
本文提出了一种基于低维有序正交投影(LOOP)的新降维算法,该算法用于利用新鲜切除的人类乳腺癌组织的太赫兹(THz)图像进行癌症检测。太赫兹图像可以由一个数据立方体表示,每个像素包含一个覆盖多个太赫兹频率的高维光谱向量,其中每个频率代表向量中的一个不同维度。所提出的算法将太赫兹图像内每个像素的高维光谱向量投影到一个低维子空间中,该子空间包含图像中嵌入的大部分独特特征。通过依次识别其正交归一基向量来构建低维子空间,使得每个新选择的基向量代表现有基向量所不包含的最独特信息。使用多元高斯混合模型来表示从所提出的降维算法获得的低维特征向量的统计分布。通过使用马尔可夫链蒙特卡罗或期望最大化等无监督学习方法迭代学习模型参数,并将结果用于对肿瘤样本内的各个区域进行分类。实验结果表明,与一维马尔可夫链蒙特卡罗等现有方法相比,所提出的方法在人类乳腺癌组织中实现了明显的性能提升。结果证实,本文提出的降维算法是一种利用太赫兹图像进行乳腺癌检测的有前途的技术,并且分类结果与所分析样本的组织病理学结果具有良好的相关性。