Suppr
超能文献

ProteinLasso：一种在鸟枪法蛋白质组学中进行蛋白质推断问题的套索回归方法。

ProteinLasso: A Lasso regression approach to protein inference problem in shotgun proteomics.

机构信息

School of Software, Dalian University of Technology, China.

出版信息

Comput Biol Chem. 2013 Apr;43:46-54. doi: 10.1016/j.compbiolchem.2012.12.008. Epub 2013 Jan 12.

DOI:10.1016/j.compbiolchem.2012.12.008

PMID:23385215

Abstract

Protein inference is an important issue in proteomics research. Its main objective is to select a proper subset of candidate proteins that best explain the observed peptides. Although many methods have been proposed for solving this problem, several issues such as peptide degeneracy and one-hit wonders still remain unsolved. Therefore, the accurate identification of proteins that are truly present in the sample continues to be a challenging task. Based on the concept of peptide detectability, we formulate the protein inference problem as a constrained Lasso regression problem, which can be solved very efficiently through a coordinate descent procedure. The new inference algorithm is named as ProteinLasso, which explores an ensemble learning strategy to address the sparsity parameter selection problem in Lasso model. We test the performance of ProteinLasso on three datasets. As shown in the experimental results, ProteinLasso outperforms those state-of-the-art protein inference algorithms in terms of both identification accuracy and running efficiency. In addition, we show that ProteinLasso is stable under different parameter specifications. The source code of our algorithm is available at: http://sourceforge.net/projects/proteinlasso.

摘要

蛋白质推断是蛋白质组学研究中的一个重要问题。其主要目的是选择一个合适的候选蛋白质子集，以最好地解释观察到的肽。尽管已经提出了许多方法来解决这个问题，但肽简并性和一次性奇迹等几个问题仍然没有得到解决。因此，准确识别真正存在于样品中的蛋白质仍然是一项具有挑战性的任务。基于肽可检测性的概念，我们将蛋白质推断问题表述为一个受约束的套索回归问题，可以通过坐标下降过程非常有效地解决。新的推断算法命名为 ProteinLasso，它探索了一种集成学习策略来解决套索模型中稀疏参数选择问题。我们在三个数据集上测试了 ProteinLasso 的性能。实验结果表明，ProteinLasso 在识别准确性和运行效率方面均优于那些最先进的蛋白质推断算法。此外，我们表明 ProteinLasso 在不同的参数规范下是稳定的。我们的算法的源代码可在：http://sourceforge.net/projects/proteinlasso.

相似文献

ProteinLasso: A Lasso regression approach to protein inference problem in shotgun proteomics.

Comput Biol Chem. 2013 Apr;43:46-54. doi: 10.1016/j.compbiolchem.2012.12.008. Epub 2013 Jan 12.

A linear programming model for protein inference problem in shotgun proteomics.

Bioinformatics. 2012 Nov 15;28(22):2956-62. doi: 10.1093/bioinformatics/bts540. Epub 2012 Sep 6.

Advancement in protein inference from shotgun proteomics using peptide detectability.

Pac Symp Biocomput. 2007:409-20.

BagReg: Protein inference through machine learning.

Comput Biol Chem. 2015 Aug;57:12-20. doi: 10.1016/j.compbiolchem.2015.02.009. Epub 2015 Feb 7.

Protein inference: a review.

Brief Bioinform. 2012 Sep;13(5):586-614. doi: 10.1093/bib/bbs004. Epub 2012 Feb 28.

A combinatorial perspective of the protein inference problem.

IEEE/ACM Trans Comput Biol Bioinform. 2013 Nov-Dec;10(6):1542-7. doi: 10.1109/TCBB.2013.110.

Unifying protein inference and peptide identification with feedback to update consistency between peptides.

Proteomics. 2013 Jan;13(2):239-47. doi: 10.1002/pmic.201200338. Epub 2012 Dec 17.

Improved prediction of peptide detectability for targeted proteomics using a rank-based algorithm and organism-specific data.

J Proteomics. 2014 Aug 28;108:269-83. doi: 10.1016/j.jprot.2014.05.011. Epub 2014 May 27.

Decoy-free protein-level false discovery rate estimation.

Bioinformatics. 2014 Mar 1;30(5):675-81. doi: 10.1093/bioinformatics/btt431. Epub 2013 Aug 6.

Peptide sequence tag-based blind identification of post-translational modifications with point process model.

Bioinformatics. 2006 Jul 15;22(14):e307-13. doi: 10.1093/bioinformatics/btl226.

引用本文的文献

Evaluating nomogram models for predicting survival outcomes in gastric gastrointestinal stromal tumors with SEER database analysis.

Sci Rep. 2024 May 20;14(1):11494. doi: 10.1038/s41598-024-62353-z.

Implementing Artificial Intelligence and Digital Health in Resource-Limited Settings? Top 10 Lessons We Learned in Congenital Heart Defects and Cardiology.

OMICS. 2020 May;24(5):264-277. doi: 10.1089/omi.2019.0142. Epub 2019 Oct 8.

Algorithms for Fitting the Constrained Lasso.

J Comput Graph Stat. 2018;27(4):861-871. doi: 10.1080/10618600.2018.1473777. Epub 2018 Aug 7.

DeepPep: Deep proteome inference from peptide profiles.

PLoS Comput Biol. 2017 Sep 5;13(9):e1005661. doi: 10.1371/journal.pcbi.1005661. eCollection 2017 Sep.

Protein abundances can distinguish between naturally-occurring and laboratory strains of Yersinia pestis, the causative agent of plague.

PLoS One. 2017 Aug 30;12(8):e0183478. doi: 10.1371/journal.pone.0183478. eCollection 2017.

PGCA: An algorithm to link protein groups created from MS/MS data.

PLoS One. 2017 May 31;12(5):e0177569. doi: 10.1371/journal.pone.0177569. eCollection 2017.

Selection of key sequence-based features for prediction of essential genes in 31 diverse bacterial species.

PLoS One. 2017 Mar 30;12(3):e0174638. doi: 10.1371/journal.pone.0174638. eCollection 2017.

Statistical approach to protein quantification.

Mol Cell Proteomics. 2014 Feb;13(2):666-77. doi: 10.1074/mcp.M112.025445. Epub 2013 Nov 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

ProteinLasso：一种在鸟枪法蛋白质组学中进行蛋白质推断问题的套索回归方法。

ProteinLasso: A Lasso regression approach to protein inference problem in shotgun proteomics.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译