• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SIMA:LC/MS 峰列表的同时多重比对。

SIMA: simultaneous multiple alignment of LC/MS peak lists.

机构信息

Interdisciplinary Center for Scientific Computing, University of Heidelberg, Heidelberg, Germany.

出版信息

Bioinformatics. 2011 Apr 1;27(7):987-93. doi: 10.1093/bioinformatics/btr051. Epub 2011 Feb 3.

DOI:10.1093/bioinformatics/btr051
PMID:21296750
Abstract

MOTIVATION

Alignment of multiple liquid chromatography/mass spectrometry (LC/MS) experiments is a necessity today, which arises from the need for biological and technical repeats. Due to limits in sampling frequency and poor reproducibility of retention times, current LC systems suffer from missing observations and non-linear distortions of the retention times across runs. Existing approaches for peak correspondence estimation focus almost exclusively on solving the pairwise alignment problem, yielding straightforward but suboptimal results for multiple alignment problems.

RESULTS

We propose SIMA, a novel automated procedure for alignment of peak lists from multiple LC/MS runs. SIMA combines hierarchical pairwise correspondence estimation with simultaneous alignment and global retention time correction. It employs a tailored multidimensional kernel function and a procedure based on maximum likelihood estimation to find the retention time distortion function that best fits the observed data. SIMA does not require a dedicated reference spectrum, is robust with regard to outliers, needs only two intuitive parameters and naturally incorporates incomplete correspondence information. In a comparison with seven alternative methods on four different datasets, we show that SIMA yields competitive and superior performance on real-world data.

AVAILABILITY

A C++ implementation of the SIMA algorithm is available from http://hci.iwr.uni-heidelberg.de/MIP/Software.

摘要

动机

当今,由于需要进行生物学和技术重复,因此必须对多个液相色谱/质谱(LC/MS)实验进行对齐。由于采样频率的限制和保留时间的重现性差,当前的 LC 系统存在观测值缺失和保留时间在运行过程中非线性失真的问题。现有的峰对应估计方法几乎完全专注于解决两两对齐问题,对于多次对齐问题,得到的结果直接但不是最优的。

结果

我们提出了 SIMA,这是一种用于对齐多个 LC/MS 运行的峰列表的新型自动化程序。SIMA 将层次化的成对对应估计与同时对齐和全局保留时间校正相结合。它采用了专门的多维核函数和基于最大似然估计的过程来找到最适合观察数据的保留时间失真函数。SIMA 不需要专用的参考光谱,对离群值具有鲁棒性,只需要两个直观的参数,并且自然地包含不完整的对应信息。在对四个不同数据集的七种替代方法进行比较时,我们表明 SIMA 在真实数据上具有竞争力和优越的性能。

可用性

SIMA 算法的 C++实现可从 http://hci.iwr.uni-heidelberg.de/MIP/Software 获得。

相似文献

1
SIMA: simultaneous multiple alignment of LC/MS peak lists.SIMA:LC/MS 峰列表的同时多重比对。
Bioinformatics. 2011 Apr 1;27(7):987-93. doi: 10.1093/bioinformatics/btr051. Epub 2011 Feb 3.
2
Retention time alignment algorithms for LC/MS data must consider non-linear shifts.用于液相色谱/质谱数据的保留时间校准算法必须考虑非线性偏移。
Bioinformatics. 2009 Mar 15;25(6):758-64. doi: 10.1093/bioinformatics/btp052. Epub 2009 Jan 28.
3
LC-MS alignment in theory and practice: a comprehensive algorithmic review.液相色谱-质谱联用的理论与实践中的比对:全面的算法综述
Brief Bioinform. 2015 Jan;16(1):104-17. doi: 10.1093/bib/bbt080. Epub 2013 Nov 21.
4
A geometric approach for the alignment of liquid chromatography-mass spectrometry data.一种用于液相色谱-质谱数据比对的几何方法。
Bioinformatics. 2007 Jul 1;23(13):i273-81. doi: 10.1093/bioinformatics/btm209.
5
Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets.整合峰分组信息以对齐多个液相色谱-质谱数据集。
Bioinformatics. 2015 Jun 15;31(12):1999-2006. doi: 10.1093/bioinformatics/btv072. Epub 2015 Feb 2.
6
Shape-based feature matching improves protein identification via LC-MS and tandem MS.基于形状的特征匹配通过液相色谱-质谱联用和串联质谱提高蛋白质鉴定水平。
J Comput Biol. 2011 Apr;18(4):547-57. doi: 10.1089/cmb.2010.0155. Epub 2011 Mar 21.
7
Graph-based peak alignment algorithms for multiple liquid chromatography-mass spectrometry datasets.基于图的多液相色谱-质谱数据集的峰对齐算法。
Bioinformatics. 2013 Oct 1;29(19):2469-76. doi: 10.1093/bioinformatics/btt435. Epub 2013 Jul 30.
8
MassUntangler: a novel alignment tool for label-free liquid chromatography-mass spectrometry proteomic data.MassUntangler:一种用于无标记液相色谱-质谱蛋白质组学数据的新型对齐工具。
J Chromatogr A. 2011 Dec 9;1218(49):8859-68. doi: 10.1016/j.chroma.2011.06.062. Epub 2011 Jun 22.
9
Robust algorithm for alignment of liquid chromatography-mass spectrometry analyses in an accurate mass and time tag data analysis pipeline.用于在精确质量和时间标签数据分析流程中对液相色谱-质谱分析进行校准的稳健算法。
Anal Chem. 2006 Nov 1;78(21):7397-409. doi: 10.1021/ac052197p.
10
Multi-profile Bayesian alignment model for LC-MS data analysis with integration of internal standards.多谱图贝叶斯对齐模型,用于结合内标进行 LC-MS 数据分析。
Bioinformatics. 2013 Nov 1;29(21):2774-80. doi: 10.1093/bioinformatics/btt461. Epub 2013 Sep 6.

引用本文的文献

1
Multiomics reveals glutathione metabolism as a driver of bimodality during stem cell aging.多组学揭示谷胱甘肽代谢是干细胞衰老过程中双峰性的驱动因素。
Cell Metab. 2023 Mar 7;35(3):472-486.e6. doi: 10.1016/j.cmet.2023.02.001. Epub 2023 Feb 27.
2
Alignstein: Optimal transport for improved LC-MS retention time alignment.Alignstein:用于改进 LC-MS 保留时间对齐的最优传输。
Gigascience. 2022 Nov 3;11. doi: 10.1093/gigascience/giac101.
3
A matching algorithm with isotope distribution pattern in LC-MS based on support vector machine (SVM) learning model.
一种基于支持向量机(SVM)学习模型的液相色谱-质谱联用(LC-MS)中同位素分布模式匹配算法。
RSC Adv. 2019 Sep 4;9(48):27874-27882. doi: 10.1039/c9ra03789f. eCollection 2019 Sep 3.
4
Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography-Mass Spectrometry Metabolomics Datasets.在非靶向液相色谱-质谱代谢组学数据集之间寻找代谢特征的对应关系。
Anal Chem. 2022 Apr 12;94(14):5493-5503. doi: 10.1021/acs.analchem.1c03592. Epub 2022 Mar 31.
5
Comparative mass spectrometry-based metabolomics strategies for the investigation of microbial secondary metabolites.基于比较质谱的代谢组学策略用于微生物次级代谢产物的研究。
Nat Prod Rep. 2017 Jan 4;34(1):6-24. doi: 10.1039/c6np00048g.
6
MZDASoft: a software architecture that enables large-scale comparison of protein expression levels over multiple samples based on liquid chromatography/tandem mass spectrometry.MZDASoft:一种软件架构,可基于液相色谱/串联质谱对多个样本的蛋白质表达水平进行大规模比较。
Rapid Commun Mass Spectrom. 2015 Oct 15;29(19):1841-8. doi: 10.1002/rcm.7272.
7
Multivariate Analysis in Metabolomics.代谢组学中的多变量分析
Curr Metabolomics. 2013;1(1):92-107. doi: 10.2174/2213235X11301010092.
8
Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets.整合峰分组信息以对齐多个液相色谱-质谱数据集。
Bioinformatics. 2015 Jun 15;31(12):1999-2006. doi: 10.1093/bioinformatics/btv072. Epub 2015 Feb 2.
9
Shared functions of plant and mammalian StAR-related lipid transfer (START) domains in modulating transcription factor activity.植物和哺乳动物中与类固醇生成急性调节蛋白(StAR)相关的脂质转移(START)结构域在调节转录因子活性方面的共同功能。
BMC Biol. 2014 Aug 27;12:70. doi: 10.1186/s12915-014-0070-8.
10
LC-MS profiling of N-Glycans derived from human serum samples for biomarker discovery in hepatocellular carcinoma.用于肝细胞癌生物标志物发现的人血清样本中N-聚糖的液相色谱-质谱分析
J Proteome Res. 2014 Nov 7;13(11):4859-68. doi: 10.1021/pr500460k. Epub 2014 Aug 8.