• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于鸟枪法蛋白质组学中蛋白质推断问题的线性规划模型。

A linear programming model for protein inference problem in shotgun proteomics.

机构信息

School of Software, Dalian University of Technology, Dalian 116621, China.

出版信息

Bioinformatics. 2012 Nov 15;28(22):2956-62. doi: 10.1093/bioinformatics/bts540. Epub 2012 Sep 6.

DOI:10.1093/bioinformatics/bts540
PMID:22954624
Abstract

MOTIVATION

Assembling peptides identified from tandem mass spectra into a list of proteins, referred to as protein inference, is an important issue in shotgun proteomics. The objective of protein inference is to find a subset of proteins that are truly present in the sample. Although many methods have been proposed for protein inference, several issues such as peptide degeneracy still remain unsolved.

RESULTS

In this article, we present a linear programming model for protein inference. In this model, we use a transformation of the joint probability that each peptide/protein pair is present in the sample as the variable. Then, both the peptide probability and protein probability can be expressed as a formula in terms of the linear combination of these variables. Based on this simple fact, the protein inference problem is formulated as an optimization problem: minimize the number of proteins with non-zero probabilities under the constraint that the difference between the calculated peptide probability and the peptide probability generated from peptide identification algorithms should be less than some threshold. This model addresses the peptide degeneracy issue by forcing some joint probability variables involving degenerate peptides to be zero in a rigorous manner. The corresponding inference algorithm is named as ProteinLP. We test the performance of ProteinLP on six datasets. Experimental results show that our method is competitive with the state-of-the-art protein inference algorithms.

AVAILABILITY

The source code of our algorithm is available at: https://sourceforge.net/projects/prolp/.

CONTACT

zyhe@dlut.edu.cn.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics Online.

摘要

动机

将串联质谱鉴定的肽组装成蛋白质列表,即蛋白质推断,是 shotgun 蛋白质组学中的一个重要问题。蛋白质推断的目的是找到真正存在于样品中的蛋白质子集。尽管已经提出了许多用于蛋白质推断的方法,但仍有几个问题,如肽的简并性,尚未得到解决。

结果

在本文中,我们提出了一种用于蛋白质推断的线性规划模型。在这个模型中,我们使用每个肽/蛋白质对在样品中存在的联合概率的变换作为变量。然后,肽概率和蛋白质概率都可以表示为这些变量的线性组合的公式。基于这个简单的事实,蛋白质推断问题被表述为一个优化问题:在满足计算肽概率与肽鉴定算法生成的肽概率之间的差异应小于某个阈值的约束下,最小化具有非零概率的蛋白质数量。这个模型通过严格地迫使涉及简并肽的一些联合概率变量为零来解决肽的简并问题。相应的推断算法命名为 ProteinLP。我们在六个数据集上测试了 ProteinLP 的性能。实验结果表明,我们的方法与最先进的蛋白质推断算法具有竞争力。

可用性

我们的算法的源代码可在 https://sourceforge.net/projects/prolp/ 获得。

联系方式

zyhe@dlut.edu.cn。

补充信息

补充资料可在 Bioinformatics Online 上获得。

相似文献

1
A linear programming model for protein inference problem in shotgun proteomics.一种用于鸟枪法蛋白质组学中蛋白质推断问题的线性规划模型。
Bioinformatics. 2012 Nov 15;28(22):2956-62. doi: 10.1093/bioinformatics/bts540. Epub 2012 Sep 6.
2
ProteinLasso: A Lasso regression approach to protein inference problem in shotgun proteomics.ProteinLasso:一种在鸟枪法蛋白质组学中进行蛋白质推断问题的套索回归方法。
Comput Biol Chem. 2013 Apr;43:46-54. doi: 10.1016/j.compbiolchem.2012.12.008. Epub 2013 Jan 12.
3
In-depth analysis of protein inference algorithms using multiple search engines and well-defined metrics.使用多个搜索引擎和明确的指标对蛋白质推断算法进行深入分析。
J Proteomics. 2017 Jan 6;150:170-182. doi: 10.1016/j.jprot.2016.08.002. Epub 2016 Aug 4.
4
A combinatorial perspective of the protein inference problem.蛋白质推断问题的组合视角。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Nov-Dec;10(6):1542-7. doi: 10.1109/TCBB.2013.110.
5
MUMAL2: Improving sensitivity in shotgun proteomics using cost sensitive artificial neural networks and a threshold selector algorithm.MUMAL2:使用成本敏感型人工神经网络和阈值选择算法提高鸟枪法蛋白质组学的灵敏度
BMC Bioinformatics. 2016 Dec 15;17(Suppl 18):472. doi: 10.1186/s12859-016-1341-x.
6
Decoy-free protein-level false discovery rate estimation.无诱饵的蛋白质水平假发现率估计。
Bioinformatics. 2014 Mar 1;30(5):675-81. doi: 10.1093/bioinformatics/btt431. Epub 2013 Aug 6.
7
Protein inference: A protein quantification perspective.蛋白质推断:从蛋白质定量角度看
Comput Biol Chem. 2016 Aug;63:21-29. doi: 10.1016/j.compbiolchem.2016.02.006. Epub 2016 Feb 13.
8
ProtyQuant: Comparing label-free shotgun proteomics datasets using accumulated peptide probabilities.ProtyQuant:使用累积肽概率比较无标记 shotgun 蛋白质组学数据集。
J Proteomics. 2021 Jan 6;230:103985. doi: 10.1016/j.jprot.2020.103985. Epub 2020 Sep 19.
9
Protein inference: a review.蛋白质推断:综述。
Brief Bioinform. 2012 Sep;13(5):586-614. doi: 10.1093/bib/bbs004. Epub 2012 Feb 28.
10
BagReg: Protein inference through machine learning.BagReg:通过机器学习进行蛋白质推断。
Comput Biol Chem. 2015 Aug;57:12-20. doi: 10.1016/j.compbiolchem.2015.02.009. Epub 2015 Feb 7.

引用本文的文献

1
MetaLP: An integrative linear programming method for protein inference in metaproteomics.MetaLP:一种整合线性规划方法,用于宏蛋白质组学中的蛋白质推断。
PLoS Comput Biol. 2022 Oct 21;18(10):e1010603. doi: 10.1371/journal.pcbi.1010603. eCollection 2022 Oct.
2
An analysis of proteogenomics and how and when transcriptome-informed reduction of protein databases can enhance eukaryotic proteomics.蛋白质基因组学分析,以及转录组信息如何以及何时减少蛋白质数据库可增强真核蛋白质组学。
Genome Biol. 2022 Jun 20;23(1):132. doi: 10.1186/s13059-022-02701-2.
3
DeepPep: Deep proteome inference from peptide profiles.
DeepPep:基于肽谱的深度蛋白质组推断。
PLoS Comput Biol. 2017 Sep 5;13(9):e1005661. doi: 10.1371/journal.pcbi.1005661. eCollection 2017 Sep.
4
PGCA: An algorithm to link protein groups created from MS/MS data.PGCA:一种用于链接由串联质谱数据创建的蛋白质组的算法。
PLoS One. 2017 May 31;12(5):e0177569. doi: 10.1371/journal.pone.0177569. eCollection 2017.
5
Concerning the accuracy of Fido and parameter choice.关于 Fido 的准确性和参数选择。
Bioinformatics. 2013 Feb 1;29(3):412. doi: 10.1093/bioinformatics/bts687. Epub 2012 Nov 28.