• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于质谱实验中在线分数质量过滤和靶向前体碎片化的非线性分类。

Non-linear classification for on-the-fly fractional mass filtering and targeted precursor fragmentation in mass spectrometry experiments.

机构信息

Proteomics Center, Children's Hospital Boston, Boston, MA, USA.

出版信息

Bioinformatics. 2010 Mar 15;26(6):791-7. doi: 10.1093/bioinformatics/btq036. Epub 2010 Feb 4.

DOI:10.1093/bioinformatics/btq036
PMID:20134030
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3202308/
Abstract

MOTIVATION

Mass spectrometry (MS) has become the method of choice for protein/peptide sequence and modification analysis. The technology employs a two-step approach: ionized peptide precursor masses are detected, selected for fragmentation, and the fragment mass spectra are collected for computational analysis. Current precursor selection schemes are based on data- or information-dependent acquisition (DDA/IDA), where fragmentation mass candidates are selected by intensity and are subsequently included in a dynamic exclusion list to avoid constant refragmentation of highly abundant species. DDA/IDA methods do not exploit valuable information that is contained in the fractional mass of high-accuracy precursor mass measurements delivered by current instrumentation.

RESULTS

We extend previous contributions that suggest that fractional mass information allows targeted fragmentation of analytes of interest. We introduce a non-linear Random Forest classification and a discrete mapping approach, which can be trained to discriminate among arbitrary fractional mass patterns for an arbitrary number of classes of analytes. These methods can be used to increase fragmentation efficiency for specific subsets of analytes or to select suitable fragmentation technologies on-the-fly. We show that theoretical generalization error estimates transfer into practical application, and that their quality depends on the accuracy of prior distribution estimate of the analyte classes. The methods are applied to two real-world proteomics datasets.

AVAILABILITY

All software used in this study is available from http://software.steenlab.org/fmf

CONTACT

hanno.steen@childrens.harvard.edu

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

质谱(MS)已成为蛋白质/肽序列和修饰分析的首选方法。该技术采用两步法:检测离子化肽前体质量,选择进行碎片化,收集碎片质谱进行计算分析。目前的前体选择方案基于数据或信息依赖获取(DDA/IDA),其中通过强度选择碎片化质量候选物,并随后将其包含在动态排除列表中以避免高度丰富物种的不断重碎片化。DDA/IDA 方法没有利用当前仪器提供的高精度前体质量测量中包含的有价值信息。

结果

我们扩展了先前的贡献,表明分数质量信息允许对感兴趣的分析物进行靶向碎片化。我们引入了非线性随机森林分类和离散映射方法,这些方法可以经过训练以区分任意数量的分析物类别的任意分数质量模式。这些方法可用于提高特定分析物子集的碎片化效率,或实时选择合适的碎片化技术。我们表明,理论泛化误差估计可转化为实际应用,并且其质量取决于分析物类别的先验分布估计的准确性。该方法应用于两个真实的蛋白质组学数据集。

可用性

本研究中使用的所有软件均可从 http://software.steenlab.org/fmf 获得。

联系人

hanno.steen@childrens.harvard.edu

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
Non-linear classification for on-the-fly fractional mass filtering and targeted precursor fragmentation in mass spectrometry experiments.用于质谱实验中在线分数质量过滤和靶向前体碎片化的非线性分类。
Bioinformatics. 2010 Mar 15;26(6):791-7. doi: 10.1093/bioinformatics/btq036. Epub 2010 Feb 4.
2
Multiplexed and data-independent tandem mass spectrometry for global proteome profiling.多重和数据非依赖的串联质谱法用于全局蛋白质组分析。
Mass Spectrom Rev. 2014 Nov-Dec;33(6):452-70. doi: 10.1002/mas.21400. Epub 2013 Nov 26.
3
libfbi: a C++ implementation for fast box intersection and application to sparse mass spectrometry data.libfbi:用于快速框交集的 C++ 实现及其在稀疏质谱数据中的应用。
Bioinformatics. 2011 Apr 15;27(8):1166-7. doi: 10.1093/bioinformatics/btr084. Epub 2011 Feb 16.
4
Qupe--a Rich Internet Application to take a step forward in the analysis of mass spectrometry-based quantitative proteomics experiments.Qupe--一种在基于质谱的定量蛋白质组学实验分析中向前迈进的富互联网应用程序。
Bioinformatics. 2009 Dec 1;25(23):3128-34. doi: 10.1093/bioinformatics/btp568. Epub 2009 Oct 6.
5
MSAcquisitionSimulator: data-dependent acquisition simulator for LC-MS shotgun proteomics.MS采集模拟器:用于液相色谱-质谱鸟枪法蛋白质组学的数据依赖型采集模拟器。
Bioinformatics. 2016 Apr 15;32(8):1269-71. doi: 10.1093/bioinformatics/btv745. Epub 2015 Dec 17.
6
MS2Planner: improved fragmentation spectra coverage in untargeted mass spectrometry by iterative optimized data acquisition.MS2Planner:通过迭代优化的数据采集提高非靶向质谱中碎片谱的覆盖度。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i231-i236. doi: 10.1093/bioinformatics/btab279.
7
Improving protein and proteome coverage through data-independent multiplexed peptide fragmentation.通过数据非依赖型多重肽片段化提高蛋白质和蛋白质组覆盖度。
J Proteome Res. 2010 Jul 2;9(7):3621-37. doi: 10.1021/pr100144z.
8
Processing strategies and software solutions for data-independent acquisition in mass spectrometry.质谱中数据非依赖采集的处理策略与软件解决方案
Proteomics. 2015 Mar;15(5-6):964-80. doi: 10.1002/pmic.201400323. Epub 2015 Feb 2.
9
MultiAlign: a multiple LC-MS analysis tool for targeted omics analysis.MultiAlign:一种用于靶向组学分析的多重 LC-MS 分析工具。
BMC Bioinformatics. 2013 Feb 12;14:49. doi: 10.1186/1471-2105-14-49.
10
Mesh Fragmentation Improves Dissociation Efficiency in Top-down Proteomics.网格碎片化可提高自上而下蛋白质组学中的解离效率。
J Am Soc Mass Spectrom. 2021 Jun 2;32(6):1319-1325. doi: 10.1021/jasms.0c00462. Epub 2021 Mar 23.

引用本文的文献

1
Intelligence Algorithms for Protein Classification by Mass Spectrometry.基于质谱的蛋白质分类智能算法。
Biomed Res Int. 2018 Nov 11;2018:2862458. doi: 10.1155/2018/2862458. eCollection 2018.
2
Untargeted, spectral library-free analysis of data-independent acquisition proteomics data generated using Orbitrap mass spectrometers.使用轨道阱质谱仪生成的数据非依赖型采集蛋白质组学数据的非靶向、无谱库分析。
Proteomics. 2016 Aug;16(15-16):2257-71. doi: 10.1002/pmic.201500526. Epub 2016 Jul 22.
3
Towards automated discrimination of lipids versus peptides from full scan mass spectra.基于全扫描质谱对脂质与肽段进行自动鉴别
EuPA Open Proteom. 2014 Sep 1;4:87-100. doi: 10.1016/j.euprot.2014.05.002.
4
Feature selection and classification of leukocytes using random forest.使用随机森林对白细胞进行特征选择和分类。
Med Biol Eng Comput. 2014 Dec;52(12):1041-52. doi: 10.1007/s11517-014-1200-8. Epub 2014 Oct 5.
5
A classifier based on accurate mass measurements to aid large scale, unbiased glycoproteomics.基于精确质量测量的分类器,辅助大规模、无偏的糖蛋白质组学研究。
Mol Cell Proteomics. 2013 Apr;12(4):1017-25. doi: 10.1074/mcp.M112.025494. Epub 2013 Feb 25.
6
The use of classification trees for bioinformatics.分类树在生物信息学中的应用。
Wiley Interdiscip Rev Data Min Knowl Discov. 2011 Jan;1(1):55-63. doi: 10.1002/widm.14. Epub 2011 Jan 6.
7
Bioassay-directed fractionation for discovery of bioactive neutral lipids guided by relative mass defect filtering and multiplexed collision-induced dissociation.基于相对分子质量缺陷过滤和多重碰撞诱导解离的生物活性中性脂质导向生物测定分离发现
Rapid Commun Mass Spectrom. 2010 Dec 30;24(24):3578-84. doi: 10.1002/rcm.4796.

本文引用的文献

1
Improved detection of reactive metabolites with a bromine-containing glutathione analog using mass defect and isotope pattern matching.采用含溴谷胱甘肽类似物,通过质量亏损和同位素峰形匹配提高反应代谢物的检测能力。
Rapid Commun Mass Spectrom. 2010 May 15;24(9):1241-50. doi: 10.1002/rcm.4507.
2
Decon2LS: An open-source software package for automated processing and visualization of high resolution mass spectrometry data.Decon2LS:一个用于高分辨率质谱数据自动处理和可视化的开源软件包。
BMC Bioinformatics. 2009 Mar 17;10:87. doi: 10.1186/1471-2105-10-87.
3
Fractional mass filtering as a means to assess circulating metabolites in early human clinical studies.分数质量过滤作为一种在早期人体临床研究中评估循环代谢物的手段。
Rapid Commun Mass Spectrom. 2008 Nov;22(22):3510-6. doi: 10.1002/rcm.3758.
4
NITPICK: peak identification for mass spectrometry data.NITPICK:质谱数据的峰识别
BMC Bioinformatics. 2008 Aug 28;9:355. doi: 10.1186/1471-2105-9-355.
5
Mass defect profiles of biological matrices and the general applicability of mass defect filtering for metabolite detection.生物基质的质量亏损图谱及质量亏损过滤在代谢物检测中的普遍适用性。
Rapid Commun Mass Spectrom. 2008 Jul;22(13):2082-8. doi: 10.1002/rcm.3585.
6
Application of fractional mass for the identification of peptide-oligonucleotide cross-links by mass spectrometry.分数质量在通过质谱鉴定肽-寡核苷酸交联物中的应用。
J Mass Spectrom. 2008 Aug;43(8):1081-8. doi: 10.1002/jms.1391.
7
Statistical validation of peptide identifications in large-scale proteomics using the target-decoy database search strategy and flexible mixture modeling.使用目标-诱饵数据库搜索策略和灵活混合模型对大规模蛋白质组学中的肽段鉴定进行统计验证。
J Proteome Res. 2008 Jan;7(1):286-92. doi: 10.1021/pr7006818. Epub 2007 Dec 14.
8
MSE with mass defect filtering for in vitro and in vivo metabolite identification.用于体外和体内代谢物鉴定的具有质量缺陷过滤功能的质谱碎裂模式解析
Rapid Commun Mass Spectrom. 2007;21(9):1485-96. doi: 10.1002/rcm.2996.
9
NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.美国国立生物技术信息中心参考序列(RefSeq):一个经过整理的基因组、转录本和蛋白质的非冗余序列数据库。
Nucleic Acids Res. 2007 Jan;35(Database issue):D61-5. doi: 10.1093/nar/gkl842. Epub 2006 Nov 27.
10
Strategy for the identification of sites of phosphorylation in proteins: neutral loss triggered electron capture dissociation.蛋白质中磷酸化位点鉴定策略:中性丢失触发电子捕获解离
Anal Chem. 2006 Nov 1;78(21):7563-9. doi: 10.1021/ac061331i.