Suppr超能文献

螺旋矩阵变换结合卷积神经网络算法用于基于基质辅助激光解吸电离飞行时间质谱的细菌鉴定

Helix Matrix Transformation Combined With Convolutional Neural Network Algorithm for Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry-Based Bacterial Identification.

作者信息

Ling Jin, Li Gaomin, Shao Hong, Wang Hong, Yin Hongrui, Zhou Hu, Song Yufei, Chen Gang

机构信息

NMPA Key Laboratory for Quality Control of Therapeutic Monoclonal Antibodies, Shanghai Institute for Food and Drug Control, Shanghai, China.

Department of Biochemical Drugs and Biological Products, Shanghai Institute for Food and Drug Control, Shanghai, China.

出版信息

Front Microbiol. 2020 Nov 12;11:565434. doi: 10.3389/fmicb.2020.565434. eCollection 2020.

Abstract

Matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) analysis is a rapid and reliable method for bacterial identification. Classification algorithms, as a critical part of the MALDI-TOF MS analysis approach, have been developed using both traditional algorithms and machine learning algorithms. In this study, a method that combined helix matrix transformation with a convolutional neural network (CNN) algorithm was presented for bacterial identification. A total of 14 bacterial species including 58 strains were selected to create an in-house MALDI-TOF MS spectrum dataset. The 1D array-type MALDI-TOF MS spectrum data were transformed through a helix matrix transformation into matrix-type data, which was fitted during the CNN training. Through the parameter optimization, the threshold for binarization was set as 16 and the final size of a matrix-type data was set as 25 × 25 to obtain a clean dataset with a small size. A CNN model with three convolutional layers was well trained using the dataset to predict bacterial species. The filter sizes for the three convolutional layers were 4, 8, and 16. The kernel size was three and the activation function was the rectified linear unit (ReLU). A back propagation neural network (BPNN) model was created without helix matrix transformation and a convolution layer to demonstrate whether the helix matrix transformation combined with CNN algorithm works better. The areas under the receiver operating characteristic (ROC) curve of the CNN and BPNN models were 0.98 and 0.87, respectively. The accuracies of the CNN and BPNN models were 97.78 ± 0.08 and 86.50 ± 0.01, respectively, with a significant statistical difference ( < 0.001). The results suggested that helix matrix transformation combined with the CNN algorithm enabled the feature extraction of the bacterial MALDI-TOF MS spectrum, which might be a proposed solution to identify bacterial species.

摘要

基质辅助激光解吸电离飞行时间质谱(MALDI-TOF MS)分析是一种快速且可靠的细菌鉴定方法。分类算法作为MALDI-TOF MS分析方法的关键部分,已通过传统算法和机器学习算法得以开发。在本研究中,提出了一种将螺旋矩阵变换与卷积神经网络(CNN)算法相结合的细菌鉴定方法。总共选择了包括58株菌株的14种细菌来创建一个内部MALDI-TOF MS光谱数据集。一维阵列型MALDI-TOF MS光谱数据通过螺旋矩阵变换转换为矩阵型数据,该数据在CNN训练期间进行拟合。通过参数优化,将二值化阈值设置为16,并将矩阵型数据的最终大小设置为25×25,以获得一个小尺寸的干净数据集。使用该数据集对具有三个卷积层的CNN模型进行了良好训练,以预测细菌种类。三个卷积层的滤波器大小分别为4、8和16。内核大小为3,激活函数为修正线性单元(ReLU)。创建了一个没有螺旋矩阵变换和卷积层的反向传播神经网络(BPNN)模型,以证明螺旋矩阵变换与CNN算法相结合是否效果更好。CNN和BPNN模型的受试者工作特征(ROC)曲线下面积分别为0.98和0.87。CNN和BPNN模型的准确率分别为97.78±0.08和86.50±0.01,具有显著的统计学差异(<0.001)。结果表明,螺旋矩阵变换与CNN算法相结合能够对细菌MALDI-TOF MS光谱进行特征提取,这可能是一种鉴定细菌种类的解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43d9/7693542/6edd6dd56c5f/fmicb-11-565434-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验