• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SCFIA:一种用于 LC/MS 的统计对应特征识别算法。

SCFIA: a statistical corresponding feature identification algorithm for LC/MS.

机构信息

Department of Electrical and Computer Engineering, the University of Texas at San Antonio, One UTSA Circle, San Antonio, TX 78249, USA.

出版信息

BMC Bioinformatics. 2011 Nov 11;12:439. doi: 10.1186/1471-2105-12-439.

DOI:10.1186/1471-2105-12-439
PMID:22078262
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3233610/
Abstract

BACKGROUND

Identifying corresponding features (LC peaks registered by identical peptides) in multiple Liquid Chromatography/Mass Spectrometry (LC-MS) datasets plays a crucial role in the analysis of complex peptide or protein mixtures. Warping functions are commonly used to correct the mean of elution time shifts among LC-MS datasets, which cannot resolve the ambiguity of corresponding feature identification since elution time shifts are random. We propose a Statistical Corresponding Feature Identification Algorithm(SCFIA) based on both elution time shifts and peak shape correlations between corresponding features. SCFIA first trains a set of statistical models, and then, all candidate corresponding features are scored by the statistical models to find the maximum likelihood solution.

RESULTS

We test SCFIA on publicly available datasets. We first compare its performance with that of warping function based methods, and the results show significant improvements. The performance of SCFIA on replicates datasets and fractionated datasets is also evaluated. In both cases, the accuracy is above 90%, which is near optimal. Finally the coverage of SCFIA is evaluated, and it is shown that SCFIA can find corresponding features in multiple datasets for over 90% peptides identified by Tandem MS.

CONCLUSIONS

SCFIA can be used for accurate corresponding feature identification in LC-MS. We have shown that peak shape correlation can be used effectively for improving the accuracy. SCFIA provides high coverage in corresponding feature identification in multiple datasets, which serves the basis for integrating multiple LC-MS measurements for accurate peptide quantification.

摘要

背景

在分析复杂的肽或蛋白质混合物时,识别多个液相色谱/质谱(LC-MS)数据集之间的对应特征(通过相同的肽注册的 LC 峰)至关重要。扭曲函数通常用于校正 LC-MS 数据集之间的洗脱时间偏移的平均值,但由于洗脱时间偏移是随机的,因此无法解决对应特征识别的模糊性。我们提出了一种基于洗脱时间偏移和对应特征之间的峰形相关性的统计对应特征识别算法(SCFIA)。SCFIA 首先训练一组统计模型,然后通过统计模型对所有候选对应特征进行评分,以找到最大似然解。

结果

我们在公开可用的数据集上测试了 SCFIA。我们首先将其性能与基于扭曲函数的方法进行比较,结果表明有显著的改进。还评估了 SCFIA 在重复数据集和分馏数据集上的性能。在这两种情况下,准确率都在 90%以上,接近最优。最后评估了 SCFIA 的覆盖范围,结果表明 SCFIA 可以在多个数据集为超过 90%的通过串联 MS 鉴定的肽找到对应特征。

结论

SCFIA 可用于 LC-MS 中准确的对应特征识别。我们已经表明,峰形相关性可有效用于提高准确性。SCFIA 在多个数据集的对应特征识别中具有较高的覆盖率,为准确的肽定量整合多个 LC-MS 测量提供了基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/776c93b73b42/1471-2105-12-439-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/31e31d68c662/1471-2105-12-439-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/a0494bfa0442/1471-2105-12-439-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/8fac23bde98f/1471-2105-12-439-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/83067d31aa51/1471-2105-12-439-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/d3d53eb08607/1471-2105-12-439-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/26d34881e686/1471-2105-12-439-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/4aecb7865cb0/1471-2105-12-439-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/776c93b73b42/1471-2105-12-439-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/31e31d68c662/1471-2105-12-439-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/a0494bfa0442/1471-2105-12-439-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/8fac23bde98f/1471-2105-12-439-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/83067d31aa51/1471-2105-12-439-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/d3d53eb08607/1471-2105-12-439-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/26d34881e686/1471-2105-12-439-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/4aecb7865cb0/1471-2105-12-439-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5106/3233610/776c93b73b42/1471-2105-12-439-8.jpg

相似文献

1
SCFIA: a statistical corresponding feature identification algorithm for LC/MS.SCFIA:一种用于 LC/MS 的统计对应特征识别算法。
BMC Bioinformatics. 2011 Nov 11;12:439. doi: 10.1186/1471-2105-12-439.
2
Shape-based feature matching improves protein identification via LC-MS and tandem MS.基于形状的特征匹配通过液相色谱-质谱联用和串联质谱提高蛋白质鉴定水平。
J Comput Biol. 2011 Apr;18(4):547-57. doi: 10.1089/cmb.2010.0155. Epub 2011 Mar 21.
3
PeakLink: a new peptide peak linking method in LC-MS/MS using wavelet and SVM.PeakLink:一种基于小波和支持向量机的液相色谱-串联质谱中新的肽峰连接方法。
Bioinformatics. 2014 Sep 1;30(17):2464-70. doi: 10.1093/bioinformatics/btu299. Epub 2014 May 9.
4
ICPD-a new peak detection algorithm for LC/MS.ICPD-一种用于 LC/MS 的新峰检测算法。
BMC Genomics. 2010 Dec 1;11 Suppl 3(Suppl 3):S8. doi: 10.1186/1471-2164-11-S3-S8.
5
Robust algorithm for alignment of liquid chromatography-mass spectrometry analyses in an accurate mass and time tag data analysis pipeline.用于在精确质量和时间标签数据分析流程中对液相色谱-质谱分析进行校准的稳健算法。
Anal Chem. 2006 Nov 1;78(21):7397-409. doi: 10.1021/ac052197p.
6
Accurate LC peak boundary detection for ¹⁶O/¹⁸O labeled LC-MS data.准确检测¹⁶O/¹⁸O 标记 LC-MS 数据的 LC 峰边界。
PLoS One. 2013 Oct 7;8(10):e72951. doi: 10.1371/journal.pone.0072951. eCollection 2013.
7
Improving mass and liquid chromatography based identification of proteins using bayesian scoring.使用贝叶斯评分改进基于质谱和液相色谱的蛋白质鉴定
J Proteome Res. 2005 Nov-Dec;4(6):2174-84. doi: 10.1021/pr050251c.
8
MRCQuant- an accurate LC-MS relative isotopic quantification algorithm on TOF instruments.MRCQuant- 一种基于飞行时间仪器的精确 LC-MS 相对同位素定量算法。
BMC Bioinformatics. 2011 Mar 15;12:74. doi: 10.1186/1471-2105-12-74.
9
MZDASoft: a software architecture that enables large-scale comparison of protein expression levels over multiple samples based on liquid chromatography/tandem mass spectrometry.MZDASoft:一种软件架构,可基于液相色谱/串联质谱对多个样本的蛋白质表达水平进行大规模比较。
Rapid Commun Mass Spectrom. 2015 Oct 15;29(19):1841-8. doi: 10.1002/rcm.7272.
10
MultiAlign: a multiple LC-MS analysis tool for targeted omics analysis.MultiAlign:一种用于靶向组学分析的多重 LC-MS 分析工具。
BMC Bioinformatics. 2013 Feb 12;14:49. doi: 10.1186/1471-2105-14-49.

引用本文的文献

1
A matching algorithm with isotope distribution pattern in LC-MS based on support vector machine (SVM) learning model.一种基于支持向量机(SVM)学习模型的液相色谱-质谱联用(LC-MS)中同位素分布模式匹配算法。
RSC Adv. 2019 Sep 4;9(48):27874-27882. doi: 10.1039/c9ra03789f. eCollection 2019 Sep 3.
2
Quality control of imbalanced mass spectra from isotopic labeling experiments.同位素标记实验中不平衡质谱的质量控制。
BMC Bioinformatics. 2019 Nov 6;20(1):549. doi: 10.1186/s12859-019-3170-1.
3
Quantitative Proteomic Approach for MicroRNA Target Prediction Based on O/O Labeling.

本文引用的文献

1
SIMA: simultaneous multiple alignment of LC/MS peak lists.SIMA:LC/MS 峰列表的同时多重比对。
Bioinformatics. 2011 Apr 1;27(7):987-93. doi: 10.1093/bioinformatics/btr051. Epub 2011 Feb 3.
2
Andromeda: a peptide search engine integrated into the MaxQuant environment.Andromeda:集成到 MaxQuant 环境中的肽搜索引擎。
J Proteome Res. 2011 Apr 1;10(4):1794-805. doi: 10.1021/pr101065j. Epub 2011 Feb 22.
3
Super-SILAC mix for quantitative proteomics of human tumor tissue.用于人肿瘤组织定量蛋白质组学的超级 SILAC 混合物。
基于O/O标记的用于微小RNA靶标预测的定量蛋白质组学方法
Cancer Inform. 2016 Dec 8;14(Suppl 5):163-173. doi: 10.4137/CIN.S30563. eCollection 2015.
4
MZDASoft: a software architecture that enables large-scale comparison of protein expression levels over multiple samples based on liquid chromatography/tandem mass spectrometry.MZDASoft:一种软件架构,可基于液相色谱/串联质谱对多个样本的蛋白质表达水平进行大规模比较。
Rapid Commun Mass Spectrom. 2015 Oct 15;29(19):1841-8. doi: 10.1002/rcm.7272.
5
PeakLink: a new peptide peak linking method in LC-MS/MS using wavelet and SVM.PeakLink:一种基于小波和支持向量机的液相色谱-串联质谱中新的肽峰连接方法。
Bioinformatics. 2014 Sep 1;30(17):2464-70. doi: 10.1093/bioinformatics/btu299. Epub 2014 May 9.
Nat Methods. 2010 May;7(5):383-5. doi: 10.1038/nmeth.1446. Epub 2010 Apr 4.
4
A guided tour of the Trans-Proteomic Pipeline.《跨蛋白质组学分析流程指南》
Proteomics. 2010 Mar;10(6):1150-9. doi: 10.1002/pmic.200900375.
5
MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification.MaxQuant可实现高肽段鉴定率、个体化的百万分之一级质量精度以及全蛋白质组范围的蛋白质定量。
Nat Biotechnol. 2008 Dec;26(12):1367-72. doi: 10.1038/nbt.1511. Epub 2008 Nov 30.
6
Critical assessment of alignment procedures for LC-MS proteomics and metabolomics measurements.液相色谱-质谱联用蛋白质组学和代谢组学测量校准程序的批判性评估。
BMC Bioinformatics. 2008 Sep 15;9:375. doi: 10.1186/1471-2105-9-375.
7
OpenMS - an open-source software framework for mass spectrometry.OpenMS——一个用于质谱分析的开源软件框架。
BMC Bioinformatics. 2008 Mar 26;9:163. doi: 10.1186/1471-2105-9-163.
8
Alignment of LC-MS images, with applications to biomarker discovery and protein identification.液相色谱-质谱成像的比对及其在生物标志物发现和蛋白质鉴定中的应用。
Proteomics. 2008 Feb;8(4):650-72. doi: 10.1002/pmic.200700791.
9
Chromatographic alignment of LC-MS and LC-MS/MS datasets by genetic algorithm feature extraction.通过遗传算法特征提取实现液相色谱-质谱联用(LC-MS)和液相色谱-串联质谱联用(LC-MS/MS)数据集的色谱对齐。
J Am Soc Mass Spectrom. 2007 Oct;18(10):1835-43. doi: 10.1016/j.jasms.2007.07.018. Epub 2007 Jul 26.
10
Quantitative mass spectrometry in proteomics: a critical review.蛋白质组学中的定量质谱分析:批判性综述。
Anal Bioanal Chem. 2007 Oct;389(4):1017-31. doi: 10.1007/s00216-007-1486-6. Epub 2007 Aug 1.