Suppr超能文献

从老化牙科复合材料中洗脱单体的 LC-MS/MS 数据的逻辑回归分析:一种有监督的机器学习方法。

Logistic Regression Analysis of LC-MS/MS Data of Monomers Eluted from Aged Dental Composites: A Supervised Machine-Learning Approach.

机构信息

Chemistry, University of Illinois Chicago, Chicago, Illinois 60607, United States.

Materials and Environmental Engineering, University of Illinois Chicago, Chicago, Illinois 60607, United States.

出版信息

Anal Chem. 2023 Mar 28;95(12):5205-5213. doi: 10.1021/acs.analchem.2c04362. Epub 2023 Mar 14.

Abstract

Compound identification by database searching that matches experimental with library mass spectra is commonly used in mass spectrometric (MS) data analysis. Vendor software often outputs scores that represent the quality of each spectral match for the identified compounds. However, software-generated identification results can differ drastically depending on the initial search parameters. Machine learning is applied here to provide a statistical evaluation of software-generated compound identification results from experimental tandem MS data. This task was accomplished using the logistic regression algorithm to assign an identification probability value to each identified compound. Logistic regression is usually used for classification, but here it is used to generate identification probabilities without setting a threshold for classification. Liquid chromatography coupled with quadrupole-time-of-flight tandem MS was used to analyze the organic monomers leached from resin-based dental composites in a simulated oral environment. The collected tandem MS data were processed with vendor software, followed by statistical evaluation of these results using logistic regression. The assigned identification probability to each compound provides more confidence in identification beyond solely by database matching. A total of 21 distinct monomers were identified among all samples, including five intact monomers and chemical degradation products of bisphenol A glycidyl methacrylate (BisGMA), oligomers of bisphenol-A ethoxylate methacrylate (BisEMA), triethylene glycol dimethacrylate (TEGDMA), and urethane dimethacrylate (UDMA). The logistic regression model can be used to evaluate any database-matched liquid chromatography-tandem MS result by training a new model using analytical standards of compounds present in a chosen database and then generating identification probabilities for candidates from unknown data using the new model.

摘要

通过与实验库质谱相匹配的数据库搜索来鉴定化合物,这在质谱(MS)数据分析中被广泛应用。供应商软件通常会输出代表鉴定化合物每个光谱匹配质量的分数。然而,软件生成的鉴定结果可能会因初始搜索参数的不同而有很大的差异。在这里,我们应用机器学习为实验串联 MS 数据的软件生成的化合物鉴定结果提供统计评估。这个任务是使用逻辑回归算法来为每个鉴定的化合物分配一个鉴定概率值来完成的。逻辑回归通常用于分类,但在这里,它用于生成鉴定概率,而不设置分类的阈值。我们使用液相色谱-四极杆飞行时间串联质谱法分析了在模拟口腔环境中从树脂基牙科复合材料中浸出的有机单体。采集到的串联 MS 数据由供应商软件进行处理,然后使用逻辑回归对这些结果进行统计评估。为每个化合物分配的鉴定概率提供了比仅通过数据库匹配更高的鉴定置信度。在所有样品中总共鉴定出 21 种不同的单体,包括五种完整的单体和双酚 A 缩水甘油甲基丙烯酸酯(BisGMA)的化学降解产物、双酚 A 乙氧基化物甲基丙烯酸酯(BisEMA)的低聚物、三乙二醇二甲基丙烯酸酯(TEGDMA)和尿烷二甲基丙烯酸酯(UDMA)。逻辑回归模型可以通过使用在选定数据库中存在的化合物的分析标准来训练新模型,然后使用新模型为未知数据中的候选物生成鉴定概率,从而评估任何数据库匹配的液相色谱-串联 MS 结果。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验