Suppr超能文献

基于深度结构化语义模型的傅里叶变换红外光谱检测导管原位癌的分类模型。

A classification model for detection of ductal carcinoma in situ by Fourier transform infrared spectroscopy based on deep structured semantic model.

机构信息

School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China.

Department of Breast Center, Peking University People's Hospital, Beijing, 100044, China.

出版信息

Anal Chim Acta. 2023 Apr 22;1251:340991. doi: 10.1016/j.aca.2023.340991. Epub 2023 Feb 17.

Abstract

At present, deep learning is widely used in spectral data processing. Deep learning requires a large amount of data for training, while the collection of biological serum spectra is limited by sample numbers and labor costs, so it is impractical to obtain a large amount of serum spectral data for disease detection. In this study, we propose a spectral classification model based on the deep structured semantic model (DSSM) and successfully apply it to Fourier Transform Infrared (FT-IR) spectroscopy for ductal carcinoma in situ (DCIS) detection. Compared with the traditional deep learning model, we match the spectral data into positive and negative pairs according to whether the spectra are from the same category. The DSSM structure is constructed by extracting features according to the spectral similarity of spectra pairs. This new construction model increases the data amount used for model training and reduces the dimension of spectral data. Firstly, the FT-IR spectra are paired. The spectra pairs are labeled as positive pairs if they come from the same category, and the spectra pairs are labeled as negative pairs if they come from different categories. Secondly, two spectra in each spectra pair are put into two deep neural networks of the DSSM structure separately. Then the spectral similarity between the output feature maps of two deep neural networks is calculated. The DSSM structure is trained by maximizing the conditional likelihood of the spectra pairs from the same category. Thirdly, after the training of DSSM is done, the training set and testing set are input into two deep neural networks separately. The output feature maps of the training set are put into the reference library. Lastly, the k-nearest neighbor (KNN) model is used for classification according to Euclidean distances between the output feature map of each unknown sample to the reference library. The category of the unknown sample is judged according to the categories of k nearest samples. We also use principal component analysis (PCA) to reduce dimension for comparison. The accuracies of the KNN model, principal component analysis-k nearest neighbor (PCA-KNN) model, and deep structured semantic model-k nearest neighbor (DSSM-KNN) model are 78.8%, 72.7%, and 97.0%, which proves that our proposed model has higher accuracy.

摘要

目前,深度学习在光谱数据处理中得到了广泛应用。深度学习需要大量的数据进行训练,而生物血清光谱的采集受到样本数量和劳动力成本的限制,因此获得大量用于疾病检测的血清光谱数据是不切实际的。在这项研究中,我们提出了一种基于深度结构化语义模型(DSSM)的光谱分类模型,并成功地将其应用于傅里叶变换红外(FT-IR)光谱,用于导管原位癌(DCIS)的检测。与传统的深度学习模型相比,我们根据光谱是否来自同一类别,将光谱数据匹配成正例和负例对。DSSM 结构是通过根据光谱对的光谱相似性提取特征来构建的。这种新的构建模型增加了用于模型训练的数据量,并降低了光谱数据的维度。首先,将 FT-IR 光谱进行配对。如果光谱来自同一类别,则将光谱对标记为正例;如果光谱来自不同类别,则将光谱对标记为负例。其次,将每个光谱对中的两个光谱分别放入 DSSM 结构的两个深度神经网络中。然后计算两个深度神经网络的输出特征图之间的光谱相似性。通过最大化来自同一类别的光谱对的条件似然来训练 DSSM 结构。第三,完成 DSSM 的训练后,将训练集和测试集分别输入两个深度神经网络。将训练集的输出特征图放入参考库中。最后,根据每个未知样本的输出特征图到参考库的欧几里得距离,使用 K 最近邻(KNN)模型进行分类。根据 K 个最近样本的类别判断未知样本的类别。我们还使用主成分分析(PCA)进行降维比较。KNN 模型、主成分分析-K 最近邻(PCA-KNN)模型和深度结构化语义模型-K 最近邻(DSSM-KNN)模型的准确率分别为 78.8%、72.7%和 97.0%,证明了我们提出的模型具有更高的准确率。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验