• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用独特序列标签进行全蛋白质组范围的蛋白质及其修饰的鉴定,减少模糊性并提高错误发现率。

Proteome-wide identification of proteins and their modifications with decreased ambiguities and improved false discovery rates using unique sequence tags.

作者信息

Shen Yufeng, Tolić Nikola, Hixson Kim K, Purvine Samuel O, Pasa-Tolić Ljiljana, Qian Wei-Jun, Adkins Joshua N, Moore Ronald J, Smith Richard D

机构信息

Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, USA.

出版信息

Anal Chem. 2008 Mar 15;80(6):1871-82. doi: 10.1021/ac702328x. Epub 2008 Feb 14.

DOI:10.1021/ac702328x
PMID:18271604
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2600587/
Abstract

Identifying proteins and their modification states and with known levels of confidence remains as a significant challenge for proteomics. Random or decoy peptide databases are increasingly being used to estimate the false discovery rate (FDR), e.g., from liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses of tryptic digests. We show that this approach can significantly underestimate the FDR and describe an approach for more confident protein identifications that uses unique partial sequences derived from a combination of database searching and amino acid residue sequencing using high-accuracy MS/MS data. Applied to a Saccharomyces cerevisiae tryptic digest, the approach provided 3 132 confident peptide identifications ( approximately 5% modified in some fashion), covering 575 proteins with an estimated zero FDR. The conventional approach provided 3 359 peptide identifications and 656 proteins with 0.3% FDR based upon a decoy database analysis. However, the present approach revealed approximately 5% of the 3 359 identifications to be incorrect and many more as potentially ambiguous (e.g., due to not considering certain amino acid substitutions and modifications). In addition, 677 peptides and 39 proteins were identified that had been missed by conventional analysis, including nontryptic peptides, peptides with a variety of expected/unexpected chemical modifications, known/unknown post-translational modifications, single nucleotide polymorphisms or gene encoding errors, and multiple modifications of individual peptides.

摘要

识别蛋白质及其修饰状态并确定其可信度水平,仍然是蛋白质组学面临的重大挑战。随机或诱饵肽数据库越来越多地用于估计错误发现率(FDR),例如,从胰蛋白酶消化产物的液相色谱 - 串联质谱(LC-MS/MS)分析中进行估计。我们表明,这种方法可能会显著低估FDR,并描述了一种用于更可靠蛋白质鉴定的方法,该方法使用从数据库搜索和使用高精度MS/MS数据的氨基酸残基测序相结合中获得的独特部分序列。应用于酿酒酵母胰蛋白酶消化产物时,该方法提供了3132个可靠的肽段鉴定结果(约5%以某种方式被修饰),覆盖了575种蛋白质,估计错误发现率为零。基于诱饵数据库分析,传统方法提供了3359个肽段鉴定结果和656种蛋白质,错误发现率为0.3%。然而,目前的方法表明,3359个鉴定结果中约5%是错误的,还有更多结果可能存在歧义(例如,由于未考虑某些氨基酸替换和修饰)。此外,还鉴定出了677个肽段和39种蛋白质,这些是传统分析中遗漏的,包括非胰蛋白酶肽段、具有各种预期/意外化学修饰的肽段、已知/未知的翻译后修饰、单核苷酸多态性或基因编码错误,以及单个肽段的多种修饰。

相似文献

1
Proteome-wide identification of proteins and their modifications with decreased ambiguities and improved false discovery rates using unique sequence tags.使用独特序列标签进行全蛋白质组范围的蛋白质及其修饰的鉴定,减少模糊性并提高错误发现率。
Anal Chem. 2008 Mar 15;80(6):1871-82. doi: 10.1021/ac702328x. Epub 2008 Feb 14.
2
A peptide-retrieval strategy enables significant improvement of quantitative performance without compromising confidence of identification.肽段检索策略可在不影响鉴定置信度的情况下显著提高定量性能。
J Proteomics. 2017 Jan 30;152:276-282. doi: 10.1016/j.jprot.2016.11.020. Epub 2016 Nov 27.
3
Large Scale Mass Spectrometry-based Identifications of Enzyme-mediated Protein Methylation Are Subject to High False Discovery Rates.基于大规模质谱法对酶介导的蛋白质甲基化进行鉴定时,假阳性率很高。
Mol Cell Proteomics. 2016 Mar;15(3):989-1006. doi: 10.1074/mcp.M115.055384. Epub 2015 Dec 23.
4
Bioinformatics analysis of a Saccharomyces cerevisiae N-terminal proteome provides evidence of alternative translation initiation and post-translational N-terminal acetylation.真核生物酿酒酵母 N 端蛋白组的生物信息学分析提供了翻译起始的替代和翻译后 N 端乙酰化的证据。
J Proteome Res. 2011 Aug 5;10(8):3578-89. doi: 10.1021/pr2002325. Epub 2011 Jun 20.
5
Reverse and Random Decoy Methods for False Discovery Rate Estimation in High Mass Accuracy Peptide Spectral Library Searches.反转和随机诱饵方法在高质量精度肽谱库搜索中的假发现率估计。
J Proteome Res. 2018 Feb 2;17(2):846-857. doi: 10.1021/acs.jproteome.7b00614. Epub 2018 Jan 11.
6
High-throughput database search and large-scale negative polarity liquid chromatography-tandem mass spectrometry with ultraviolet photodissociation for complex proteomic samples.高通量数据库搜索和大规模负相液相色谱-串联质谱联用与紫外光解用于复杂蛋白质组样品。
Mol Cell Proteomics. 2013 Sep;12(9):2604-14. doi: 10.1074/mcp.O113.028258. Epub 2013 May 21.
7
De novo sequencing of unique sequence tags for discovery of post-translational modifications of proteins.用于发现蛋白质翻译后修饰的独特序列标签的从头测序。
Anal Chem. 2008 Oct 15;80(20):7742-54. doi: 10.1021/ac801123p. Epub 2008 Sep 11.
8
Evaluation of multidimensional chromatography coupled with tandem mass spectrometry (LC/LC-MS/MS) for large-scale protein analysis: the yeast proteome.多维色谱联用串联质谱法(LC/LC-MS/MS)用于大规模蛋白质分析的评估:酵母蛋白质组
J Proteome Res. 2003 Jan-Feb;2(1):43-50. doi: 10.1021/pr025556v.
9
Analysis of the acidic proteome with negative electron-transfer dissociation mass spectrometry.采用负离子电喷雾解析质谱法分析酸性蛋白质组。
Anal Chem. 2012 Mar 20;84(6):2875-82. doi: 10.1021/ac203430u. Epub 2012 Mar 1.
10
Combining De Novo Peptide Sequencing Algorithms, A Synergistic Approach to Boost Both Identifications and Confidence in Bottom-up Proteomics.结合从头肽序列算法,协同方法可提高下向蛋白质组学的鉴定数量和置信度。
J Proteome Res. 2017 Sep 1;16(9):3209-3218. doi: 10.1021/acs.jproteome.7b00198. Epub 2017 Aug 22.

引用本文的文献

1
π-PrimeNovo: an accurate and efficient non-autoregressive deep learning model for de novo peptide sequencing.π-PrimeNovo:一种用于从头肽测序的准确高效的非自回归深度学习模型。
Nat Commun. 2025 Jan 2;16(1):267. doi: 10.1038/s41467-024-55021-3.
2
Identification of Arginylated Proteins by Mass Spectrometry.通过质谱法鉴定精氨酰化蛋白质
Methods Mol Biol. 2023;2620:139-152. doi: 10.1007/978-1-0716-2942-0_19.
3
Peptidomics and Capillary Electrophoresis.肽组学与毛细管电泳。
Adv Exp Med Biol. 2021;1336:87-104. doi: 10.1007/978-3-030-77252-9_5.
4
Biochemical analysis of protein arginylation.蛋白质精氨酰化的生化分析
Methods Enzymol. 2019;626:89-113. doi: 10.1016/bs.mie.2019.07.028. Epub 2019 Aug 1.
5
Current peptidomics: applications, purification, identification, quantification, and functional analysis.当前的肽组学:应用、纯化、鉴定、定量及功能分析。
Proteomics. 2015 Mar;15(5-6):1026-38. doi: 10.1002/pmic.201400310. Epub 2015 Jan 21.
6
Overcoming species boundaries in peptide identification with Bayesian information criterion-driven error-tolerant peptide search (BICEPS).贝叶斯信息准则驱动的容错肽搜索(BICEPS)克服肽鉴定中的种属界限。
Mol Cell Proteomics. 2012 Jul;11(7):M111.014167. doi: 10.1074/mcp.M111.014167. Epub 2012 Apr 6.
7
Improving collision induced dissociation (CID), high energy collision dissociation (HCD), and electron transfer dissociation (ETD) fourier transform MS/MS degradome-peptidome identifications using high accuracy mass information.利用高精度质量信息提高碰撞诱导解离(CID)、高能碰撞解离(HCD)和电子转移解离(ETD)傅里叶变换 MS/MS 降解组-肽组鉴定。
J Proteome Res. 2012 Feb 3;11(2):668-77. doi: 10.1021/pr200597j. Epub 2011 Dec 1.
8
Experimental annotation of post-translational features and translated coding regions in the pathogen Salmonella Typhimurium.在病原体鼠伤寒沙门氏菌中对翻译后修饰特征和翻译编码区进行实验注释。
BMC Genomics. 2011 Aug 25;12:433. doi: 10.1186/1471-2164-12-433.
9
Effectiveness of CID, HCD, and ETD with FT MS/MS for degradomic-peptidomic analysis: comparison of peptide identification methods.CID、HCD 和 ETD 与 FT MS/MS 联用在降解组-肽组分析中的效果:肽鉴定方法比较。
J Proteome Res. 2011 Sep 2;10(9):3929-43. doi: 10.1021/pr200052c. Epub 2011 Aug 15.
10
A high-confidence human plasma proteome reference set with estimated concentrations in PeptideAtlas.高可信度的人类血浆蛋白质组参考集,PeptideAtlas 估计其肽段浓度。
Mol Cell Proteomics. 2011 Sep;10(9):M110.006353. doi: 10.1074/mcp.M110.006353. Epub 2011 Jun 1.

本文引用的文献

1
An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.一种将肽的串联质谱数据与蛋白质数据库中氨基酸序列相关联的方法。
J Am Soc Mass Spectrom. 1994 Nov;5(11):976-89. doi: 10.1016/1044-0305(94)80016-2.
2
Clinical proteomics: A need to define the field and to begin to set adequate standards.临床蛋白质组学:需要定义该领域并开始制定适当的标准。
Proteomics Clin Appl. 2007 Feb;1(2):148-56. doi: 10.1002/prca.200600771. Epub 2007 Jan 22.
3
The minimum information about a proteomics experiment (MIAPE).蛋白质组学实验的最小信息(MIAPE)。
Nat Biotechnol. 2007 Aug;25(8):887-93. doi: 10.1038/nbt1329.
4
Virtual polymorphism: finding divergent peptide matches in mass spectrometry data.虚拟多态性:在质谱数据中寻找不同的肽段匹配
Anal Chem. 2007 Jul 1;79(13):5030-9. doi: 10.1021/ac0703496. Epub 2007 May 24.
5
Estimating the statistical significance of peptide identifications from shotgun proteomics experiments.评估鸟枪法蛋白质组学实验中肽段鉴定结果的统计学显著性。
J Proteome Res. 2007 May;6(5):1758-67. doi: 10.1021/pr0605320. Epub 2007 Mar 31.
6
Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry.用于提高质谱法大规模蛋白质鉴定可信度的靶标-诱饵搜索策略。
Nat Methods. 2007 Mar;4(3):207-14. doi: 10.1038/nmeth1019.
7
De novo peptide sequencing and identification with precision mass spectrometry.利用精确质谱进行从头肽测序和鉴定。
J Proteome Res. 2007 Jan;6(1):114-23. doi: 10.1021/pr060271u.
8
On the proper use of mass accuracy in proteomics.论蛋白质组学中质量精度的正确使用
Mol Cell Proteomics. 2007 Mar;6(3):377-81. doi: 10.1074/mcp.M600380-MCP200. Epub 2006 Dec 12.
9
Global landscape of protein complexes in the yeast Saccharomyces cerevisiae.酿酒酵母中蛋白质复合物的全球格局。
Nature. 2006 Mar 30;440(7084):637-43. doi: 10.1038/nature04670. Epub 2006 Mar 22.
10
ModifiComb, a new proteomic tool for mapping substoichiometric post-translational modifications, finding novel types of modifications, and fingerprinting complex protein mixtures.ModifiComb,一种用于绘制亚化学计量翻译后修饰图谱、发现新型修饰以及对复杂蛋白质混合物进行指纹识别的新型蛋白质组学工具。
Mol Cell Proteomics. 2006 May;5(5):935-48. doi: 10.1074/mcp.T500034-MCP200. Epub 2006 Jan 25.