基于深度结构化语义模型的傅里叶变换红外光谱检测导管原位癌的分类模型。

A classification model for detection of ductal carcinoma in situ by Fourier transform infrared spectroscopy based on deep structured semantic model.

机构信息

School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China.

Department of Breast Center, Peking University People's Hospital, Beijing, 100044, China.

出版信息

Anal Chim Acta. 2023 Apr 22;1251:340991. doi: 10.1016/j.aca.2023.340991. Epub 2023 Feb 17.

DOI:10.1016/j.aca.2023.340991

PMID:36925283

Abstract

At present, deep learning is widely used in spectral data processing. Deep learning requires a large amount of data for training, while the collection of biological serum spectra is limited by sample numbers and labor costs, so it is impractical to obtain a large amount of serum spectral data for disease detection. In this study, we propose a spectral classification model based on the deep structured semantic model (DSSM) and successfully apply it to Fourier Transform Infrared (FT-IR) spectroscopy for ductal carcinoma in situ (DCIS) detection. Compared with the traditional deep learning model, we match the spectral data into positive and negative pairs according to whether the spectra are from the same category. The DSSM structure is constructed by extracting features according to the spectral similarity of spectra pairs. This new construction model increases the data amount used for model training and reduces the dimension of spectral data. Firstly, the FT-IR spectra are paired. The spectra pairs are labeled as positive pairs if they come from the same category, and the spectra pairs are labeled as negative pairs if they come from different categories. Secondly, two spectra in each spectra pair are put into two deep neural networks of the DSSM structure separately. Then the spectral similarity between the output feature maps of two deep neural networks is calculated. The DSSM structure is trained by maximizing the conditional likelihood of the spectra pairs from the same category. Thirdly, after the training of DSSM is done, the training set and testing set are input into two deep neural networks separately. The output feature maps of the training set are put into the reference library. Lastly, the k-nearest neighbor (KNN) model is used for classification according to Euclidean distances between the output feature map of each unknown sample to the reference library. The category of the unknown sample is judged according to the categories of k nearest samples. We also use principal component analysis (PCA) to reduce dimension for comparison. The accuracies of the KNN model, principal component analysis-k nearest neighbor (PCA-KNN) model, and deep structured semantic model-k nearest neighbor (DSSM-KNN) model are 78.8%, 72.7%, and 97.0%, which proves that our proposed model has higher accuracy.

摘要

目前，深度学习在光谱数据处理中得到了广泛应用。深度学习需要大量的数据进行训练，而生物血清光谱的采集受到样本数量和劳动力成本的限制，因此获得大量用于疾病检测的血清光谱数据是不切实际的。在这项研究中，我们提出了一种基于深度结构化语义模型（DSSM）的光谱分类模型，并成功地将其应用于傅里叶变换红外（FT-IR）光谱，用于导管原位癌（DCIS）的检测。与传统的深度学习模型相比，我们根据光谱是否来自同一类别，将光谱数据匹配成正例和负例对。DSSM 结构是通过根据光谱对的光谱相似性提取特征来构建的。这种新的构建模型增加了用于模型训练的数据量，并降低了光谱数据的维度。首先，将 FT-IR 光谱进行配对。如果光谱来自同一类别，则将光谱对标记为正例；如果光谱来自不同类别，则将光谱对标记为负例。其次，将每个光谱对中的两个光谱分别放入 DSSM 结构的两个深度神经网络中。然后计算两个深度神经网络的输出特征图之间的光谱相似性。通过最大化来自同一类别的光谱对的条件似然来训练 DSSM 结构。第三，完成 DSSM 的训练后，将训练集和测试集分别输入两个深度神经网络。将训练集的输出特征图放入参考库中。最后，根据每个未知样本的输出特征图到参考库的欧几里得距离，使用 K 最近邻（KNN）模型进行分类。根据 K 个最近样本的类别判断未知样本的类别。我们还使用主成分分析（PCA）进行降维比较。KNN 模型、主成分分析-K 最近邻（PCA-KNN）模型和深度结构化语义模型-K 最近邻（DSSM-KNN）模型的准确率分别为 78.8%、72.7%和 97.0%，证明了我们提出的模型具有更高的准确率。

相似文献

A classification model for detection of ductal carcinoma in situ by Fourier transform infrared spectroscopy based on deep structured semantic model.基于深度结构化语义模型的傅里叶变换红外光谱检测导管原位癌的分类模型。

Anal Chim Acta. 2023 Apr 22;1251:340991. doi: 10.1016/j.aca.2023.340991. Epub 2023 Feb 17.

Classification of multicategory edible fungi based on the infrared spectra of caps and stalks.基于菌盖和菌柄的红外光谱对多类别可食用真菌的分类。

PLoS One. 2020 Aug 24;15(8):e0238149. doi: 10.1371/journal.pone.0238149. eCollection 2020.

Breast cancer early detection by using Fourier-transform infrared spectroscopy combined with different classification algorithms.利用傅里叶变换红外光谱结合不同分类算法进行乳腺癌早期检测。

Spectrochim Acta A Mol Biomol Spectrosc. 2022 Dec 15;283:121715. doi: 10.1016/j.saa.2022.121715. Epub 2022 Aug 5.

A novel diagnostic method: FT-IR, Raman and derivative spectroscopy fusion technology for the rapid diagnosis of renal cell carcinoma serum.一种新型诊断方法：FT-IR、拉曼和导数光谱融合技术用于快速诊断肾细胞癌血清。

Spectrochim Acta A Mol Biomol Spectrosc. 2022 Mar 15;269:120684. doi: 10.1016/j.saa.2021.120684. Epub 2021 Dec 14.

Discrimination of and Its Related Species Using IR Spectroscopy Combined with Feature Selection and Stacked Generalization.基于红外光谱结合特征选择和堆叠泛化的和其相关种的鉴别。

Molecules. 2020 Mar 23;25(6):1442. doi: 10.3390/molecules25061442.

Diagnosis and monitoring of hepatocellular carcinoma in Hepatitis C virus patients using attenuated total reflection Fourier transform infrared spectroscopy.应用衰减全反射傅里叶变换红外光谱技术诊断和监测丙型肝炎病毒患者的肝细胞癌。

Photodiagnosis Photodyn Ther. 2023 Sep;43:103677. doi: 10.1016/j.pdpdt.2023.103677. Epub 2023 Jun 29.

Classification of select category A and B bacteria by Fourier transform infrared spectroscopy.通过傅里叶变换红外光谱法对选定的A类和B类细菌进行分类。

Appl Spectrosc. 2009 Jan;63(1):14-24. doi: 10.1366/000370209787169867.

Rapid differentiation of Listeria monocytogenes epidemic clones III and IV and their intact compared with heat-killed populations using Fourier transform infrared spectroscopy and chemometrics.应用傅立叶变换红外光谱和化学计量学快速区分单核细胞增生李斯特氏菌流行克隆 III 和 IV 及其与热灭活菌的区别。

J Food Sci. 2014 Jun;79(6):M1189-96. doi: 10.1111/1750-3841.12475. Epub 2014 May 6.

Rapid Diagnosis of Ductal Carcinoma In Situ and Breast Cancer Based on Raman Spectroscopy of Serum Combined with Convolutional Neural Network.基于血清拉曼光谱结合卷积神经网络的导管原位癌和乳腺癌快速诊断

Bioengineering (Basel). 2023 Jan 4;10(1):65. doi: 10.3390/bioengineering10010065.

Colorectal Cancer and Colitis Diagnosis Using Fourier Transform Infrared Spectroscopy and an Improved K-Nearest-Neighbour Classifier.基于傅里叶变换红外光谱和改进的 K-最近邻分类器的结直肠癌和结肠炎诊断。

Sensors (Basel). 2017 Nov 27;17(12):2739. doi: 10.3390/s17122739.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于深度结构化语义模型的傅里叶变换红外光谱检测导管原位癌的分类模型。

A classification model for detection of ductal carcinoma in situ by Fourier transform infrared spectroscopy based on deep structured semantic model.

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献