• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于分析肽气相化学的改进机器学习方法。

Improved machine learning method for analysis of gas phase chemistry of peptides.

作者信息

Gehrke Allison, Sun Shaojun, Kurgan Lukasz, Ahn Natalie, Resing Katheryn, Kafadar Karen, Cios Krzysztof

机构信息

Department of Computer Science and Engineering, University of Colorado at Denver, USA.

出版信息

BMC Bioinformatics. 2008 Dec 3;9:515. doi: 10.1186/1471-2105-9-515.

DOI:10.1186/1471-2105-9-515
PMID:19055745
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2612015/
Abstract

BACKGROUND

Accurate peptide identification is important to high-throughput proteomics analyses that use mass spectrometry. Search programs compare fragmentation spectra (MS/MS) of peptides from complex digests with theoretically derived spectra from a database of protein sequences. Improved discrimination is achieved with theoretical spectra that are based on simulating gas phase chemistry of the peptides, but the limited understanding of those processes affects the accuracy of predictions from theoretical spectra.

RESULTS

We employed a robust data mining strategy using new feature annotation functions of MAE software, which revealed under-prediction of the frequency of occurrence in fragmentation of the second peptide bond. We applied methods of exploratory data analysis to pre-process the information in the MS/MS spectra, including data normalization and attribute selection, to reduce the attributes to a smaller, less correlated set for machine learning studies. We then compared our rule building machine learning program, DataSqueezer, with commonly used association rules and decision tree algorithms. All used machine learning algorithms produced similar results that were consistent with expected properties for a second gas phase mechanism at the second peptide bond.

CONCLUSION

The results provide compelling evidence that we have identified underlying chemical properties in the data that suggest the existence of an additional gas phase mechanism for the second peptide bond. Thus, the methods described in this study provide a valuable approach for analyses of this kind in the future.

摘要

背景

准确的肽段鉴定对于使用质谱的高通量蛋白质组学分析至关重要。搜索程序将复杂消化产物中肽段的碎片谱(MS/MS)与蛋白质序列数据库中理论推导的谱进行比较。基于模拟肽段气相化学的理论谱能实现更好的区分,但对这些过程的有限理解影响了理论谱预测的准确性。

结果

我们采用了一种强大的数据挖掘策略,利用MAE软件的新特征注释功能,该功能揭示了第二个肽键断裂中出现频率的预测不足。我们应用探索性数据分析方法对MS/MS谱中的信息进行预处理,包括数据归一化和属性选择,以将属性减少到更小、相关性更低的集合用于机器学习研究。然后,我们将我们的规则构建机器学习程序DataSqueezer与常用的关联规则和决策树算法进行比较。所有使用的机器学习算法都产生了相似的结果,这些结果与第二个肽键处第二种气相机制的预期特性一致。

结论

结果提供了令人信服的证据,表明我们已经在数据中识别出潜在的化学性质,这表明第二个肽键存在额外的气相机制。因此,本研究中描述的方法为未来此类分析提供了一种有价值的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4185/2612015/2a548b32e6bb/1471-2105-9-515-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4185/2612015/6f65e9ad55b6/1471-2105-9-515-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4185/2612015/4352e1cdfdef/1471-2105-9-515-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4185/2612015/d096fe87f4b0/1471-2105-9-515-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4185/2612015/6636071202aa/1471-2105-9-515-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4185/2612015/68baeb420db1/1471-2105-9-515-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4185/2612015/cd04a67de7cf/1471-2105-9-515-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4185/2612015/2a548b32e6bb/1471-2105-9-515-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4185/2612015/6f65e9ad55b6/1471-2105-9-515-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4185/2612015/4352e1cdfdef/1471-2105-9-515-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4185/2612015/d096fe87f4b0/1471-2105-9-515-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4185/2612015/6636071202aa/1471-2105-9-515-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4185/2612015/68baeb420db1/1471-2105-9-515-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4185/2612015/cd04a67de7cf/1471-2105-9-515-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4185/2612015/2a548b32e6bb/1471-2105-9-515-7.jpg

相似文献

1
Improved machine learning method for analysis of gas phase chemistry of peptides.用于分析肽气相化学的改进机器学习方法。
BMC Bioinformatics. 2008 Dec 3;9:515. doi: 10.1186/1471-2105-9-515.
2
A machine learning approach to explore the spectra intensity pattern of peptides using tandem mass spectrometry data.一种利用串联质谱数据探索肽段光谱强度模式的机器学习方法。
BMC Bioinformatics. 2008 Jul 30;9:325. doi: 10.1186/1471-2105-9-325.
3
Improved validation of peptide MS/MS assignments using spectral intensity prediction.使用光谱强度预测改进肽段串联质谱(MS/MS)鉴定的验证
Mol Cell Proteomics. 2007 Jan;6(1):1-17. doi: 10.1074/mcp.M600320-MCP200. Epub 2006 Oct 2.
4
MUMAL2: Improving sensitivity in shotgun proteomics using cost sensitive artificial neural networks and a threshold selector algorithm.MUMAL2:使用成本敏感型人工神经网络和阈值选择算法提高鸟枪法蛋白质组学的灵敏度
BMC Bioinformatics. 2016 Dec 15;17(Suppl 18):472. doi: 10.1186/s12859-016-1341-x.
5
A machine learning approach to predicting peptide fragmentation spectra.一种用于预测肽段碎裂谱的机器学习方法。
Pac Symp Biocomput. 2006:219-30.
6
Improving Peptide-Spectrum Matching by Fragmentation Prediction Using Hidden Markov Models.利用隐马尔可夫模型进行碎片预测提高肽谱匹配。
J Proteome Res. 2019 Jun 7;18(6):2385-2396. doi: 10.1021/acs.jproteome.8b00499. Epub 2019 May 22.
7
Peak intensity prediction in MALDI-TOF mass spectrometry: a machine learning study to support quantitative proteomics.基质辅助激光解吸电离飞行时间质谱中的峰强度预测:一项支持定量蛋白质组学的机器学习研究。
BMC Bioinformatics. 2008 Oct 20;9:443. doi: 10.1186/1471-2105-9-443.
8
Support vector machines for improved peptide identification from tandem mass spectrometry database search.用于从串联质谱数据库搜索中改进肽段鉴定的支持向量机
Methods Mol Biol. 2009;492:453-60. doi: 10.1007/978-1-59745-493-3_28.
9
A cross-validation scheme for machine learning algorithms in shotgun proteomics. shotgun 蛋白质组学中机器学习算法的交叉验证方案。
BMC Bioinformatics. 2012;13 Suppl 16(Suppl 16):S3. doi: 10.1186/1471-2105-13-S16-S3. Epub 2012 Nov 5.
10
Preprocessing Tandem Mass Spectra Using Genetic Programming for Peptide Identification.基于遗传编程的串联质谱预处理在肽段鉴定中的应用。
J Am Soc Mass Spectrom. 2019 Jul;30(7):1294-1307. doi: 10.1007/s13361-019-02196-5. Epub 2019 Apr 25.

本文引用的文献

1
IRMPD spectroscopy shows that AGG forms an oxazolone b2+ ion.红外多光子解离光谱表明,AGG形成了一种恶唑酮b2+离子。
J Am Chem Soc. 2008 Dec 31;130(52):17644-5. doi: 10.1021/ja8067929.
2
Bifurcating fragmentation behavior of gas-phase tryptic peptide dications in collisional activation.碰撞激活中气相胰蛋白酶肽双阳离子的分叉碎裂行为
J Am Soc Mass Spectrom. 2008 Dec;19(12):1755-63. doi: 10.1016/j.jasms.2008.08.003. Epub 2008 Aug 9.
3
Improved validation of peptide MS/MS assignments using spectral intensity prediction.使用光谱强度预测改进肽段串联质谱(MS/MS)鉴定的验证
Mol Cell Proteomics. 2007 Jan;6(1):1-17. doi: 10.1074/mcp.M600320-MCP200. Epub 2006 Oct 2.
4
Peptide conformation in gas phase probed by collision-induced dissociation and its correlation to conformation in condensed phases.
J Am Soc Mass Spectrom. 2006 Jun;17(6):786-794. doi: 10.1016/j.jasms.2006.02.016. Epub 2006 Apr 3.
5
Highly scalable and robust rule learner: performance evaluation and comparison.高度可扩展且稳健的规则学习器:性能评估与比较
IEEE Trans Syst Man Cybern B Cybern. 2006 Feb;36(1):32-53. doi: 10.1109/tsmcb.2005.852983.
6
Prediction of low-energy collision-induced dissociation spectra of peptides with three or more charges.具有三个或更多电荷的肽的低能碰撞诱导解离光谱的预测
Anal Chem. 2005 Oct 1;77(19):6364-73. doi: 10.1021/ac050857k.
7
Mass spectrometry of peptides and proteins.肽和蛋白质的质谱分析。
Methods. 2005 Mar;35(3):211-22. doi: 10.1016/j.ymeth.2004.08.013. Epub 2005 Jan 20.
8
Fragmentation pathways of protonated peptides.质子化肽段的碎裂途径。
Mass Spectrom Rev. 2005 Jul-Aug;24(4):508-48. doi: 10.1002/mas.20024.
9
Prediction of low-energy collision-induced dissociation spectra of peptides.肽的低能量碰撞诱导解离光谱的预测。
Anal Chem. 2004 Jul 15;76(14):3908-22. doi: 10.1021/ac049951b.
10
Improving reproducibility and sensitivity in identifying human proteins by shotgun proteomics.通过鸟枪法蛋白质组学提高鉴定人类蛋白质的重现性和灵敏度。
Anal Chem. 2004 Jul 1;76(13):3556-68. doi: 10.1021/ac035229m.