• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用云计算对临床蛋白质组肿瘤分析联盟(CPTAC)数据集进行复检,发现大量翻译后修饰和蛋白质序列变体。

Reinspection of a Clinical Proteomics Tumor Analysis Consortium (CPTAC) Dataset with Cloud Computing Reveals Abundant Post-Translational Modifications and Protein Sequence Variants.

作者信息

Prakash Amol, Taylor Lorne, Varkey Manu, Hoxie Nate, Mohammed Yassene, Goo Young Ah, Peterman Scott, Moghekar Abhay, Yuan Yuting, Glaros Trevor, Steele Joel R, Faridi Pouya, Parihari Shashwati, Srivastava Sanjeeva, Otto Joseph J, Nyalwidhe Julius O, Semmes O John, Moran Michael F, Madugundu Anil, Mun Dong Gi, Pandey Akhilesh, Mahoney Keira E, Shabanowitz Jeffrey, Saxena Satya, Orsburn Benjamin C

机构信息

Optys Tech Corporation, Shrewsbury, MA 01545, USA.

McGill University Health Center, Montreal, QC H4A 3J1, Canada.

出版信息

Cancers (Basel). 2021 Oct 9;13(20):5034. doi: 10.3390/cancers13205034.

DOI:10.3390/cancers13205034
PMID:34680183
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8534219/
Abstract

The Clinical Proteomic Tumor Analysis Consortium (CPTAC) has provided some of the most in-depth analyses of the phenotypes of human tumors ever constructed. Today, the majority of proteomic data analysis is still performed using software housed on desktop computers which limits the number of sequence variants and post-translational modifications that can be considered. The original CPTAC studies limited the search for PTMs to only samples that were chemically enriched for those modified peptides. Similarly, the only sequence variants considered were those with strong evidence at the exon or transcript level. In this multi-institutional collaborative reanalysis, we utilized unbiased protein databases containing millions of human sequence variants in conjunction with hundreds of common post-translational modifications. Using these tools, we identified tens of thousands of high-confidence PTMs and sequence variants. We identified 4132 phosphorylated peptides in nonenriched samples, 93% of which were confirmed in the samples which were chemically enriched for phosphopeptides. In addition, our results also cover 90% of the high-confidence variants reported by the original proteogenomics study, without the need for sample specific next-generation sequencing. Finally, we report fivefold more somatic and germline variants that have an independent evidence at the peptide level, including mutations in ERRB2 and BCAS1. In this reanalysis of CPTAC proteomic data with cloud computing, we present an openly available and searchable web resource of the highest-coverage proteomic profiling of human tumors described to date.

摘要

临床蛋白质组肿瘤分析联盟(CPTAC)对人类肿瘤表型进行了一些有史以来最深入的分析。如今,大多数蛋白质组数据分析仍使用台式计算机上的软件进行,这限制了可考虑的序列变异和翻译后修饰的数量。CPTAC的原始研究将翻译后修饰的搜索仅限于那些化学富集修饰肽的样本。同样,唯一考虑的序列变异是那些在外显子或转录水平有确凿证据的变异。在这项多机构合作的重新分析中,我们利用了包含数百万个人类序列变异以及数百种常见翻译后修饰的无偏蛋白质数据库。使用这些工具,我们鉴定出了数以万计的高可信度翻译后修饰和序列变异。我们在未富集的样本中鉴定出4132个磷酸化肽,其中93%在化学富集磷酸肽的样本中得到证实。此外,我们的结果还涵盖了原始蛋白质基因组学研究报告的90%的高可信度变异,而无需进行样本特异性的下一代测序。最后,我们报告了在肽水平有独立证据的体细胞和种系变异数量增加了五倍,包括ERBB2和BCAS1中的突变。在这项利用云计算对CPTAC蛋白质组数据进行的重新分析中,我们展示了一个公开可用且可搜索的网络资源,它是迄今为止所描述的人类肿瘤最高覆盖率蛋白质组分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c846/8534219/6025f1877409/cancers-13-05034-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c846/8534219/2977b0ec69ae/cancers-13-05034-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c846/8534219/0e21be118cb5/cancers-13-05034-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c846/8534219/27b10e832806/cancers-13-05034-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c846/8534219/4b5310b9d79c/cancers-13-05034-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c846/8534219/6025f1877409/cancers-13-05034-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c846/8534219/2977b0ec69ae/cancers-13-05034-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c846/8534219/0e21be118cb5/cancers-13-05034-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c846/8534219/27b10e832806/cancers-13-05034-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c846/8534219/4b5310b9d79c/cancers-13-05034-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c846/8534219/6025f1877409/cancers-13-05034-g005.jpg

相似文献

1
Reinspection of a Clinical Proteomics Tumor Analysis Consortium (CPTAC) Dataset with Cloud Computing Reveals Abundant Post-Translational Modifications and Protein Sequence Variants.利用云计算对临床蛋白质组肿瘤分析联盟(CPTAC)数据集进行复检,发现大量翻译后修饰和蛋白质序列变体。
Cancers (Basel). 2021 Oct 9;13(20):5034. doi: 10.3390/cancers13205034.
2
Human Proteomic Variation Revealed by Combining RNA-Seq Proteogenomics and Global Post-Translational Modification (G-PTM) Search Strategy.通过整合RNA测序蛋白质基因组学和全球翻译后修饰(G-PTM)搜索策略揭示的人类蛋白质组变异
J Proteome Res. 2016 Mar 4;15(3):800-8. doi: 10.1021/acs.jproteome.5b00817. Epub 2016 Jan 12.
3
A User-Friendly Visualization Tool for Multi-Omics Data.用于多组学数据的用户友好型可视化工具。
Proteomics. 2020 Nov;20(21-22):e2000136. doi: 10.1002/pmic.202000136. Epub 2020 Sep 3.
4
The CPTAC Data Portal: A Resource for Cancer Proteomics Research.CPTAC数据门户:癌症蛋白质组学研究资源
J Proteome Res. 2015 Jun 5;14(6):2707-13. doi: 10.1021/pr501254j. Epub 2015 May 4.
5
A Description of the Clinical Proteomic Tumor Analysis Consortium (CPTAC) Common Data Analysis Pipeline.临床蛋白质组肿瘤分析联盟(CPTAC)通用数据分析流程说明。
J Proteome Res. 2016 Mar 4;15(3):1023-32. doi: 10.1021/acs.jproteome.5b01091. Epub 2016 Feb 25.
6
Detection and verification of 2.3 million cancer mutations in NCI60 cancer cell lines with a cloud search engine.利用云搜索引擎检测和验证 NCI60 癌细胞系中的 230 万个癌症突变。
J Proteomics. 2019 Oct 30;209:103488. doi: 10.1016/j.jprot.2019.103488. Epub 2019 Aug 21.
7
Proteogenomic Analysis of Breast Cancer Transcriptomic and Proteomic Data, Using De Novo Transcript Assembly: Genome-Wide Identification of Novel Peptides and Clinical Implications.基于从头转录组组装的乳腺癌转录组学和蛋白质组学数据的蛋白质基因组分析:新型肽的全基因组鉴定及其临床意义。
Mol Cell Proteomics. 2022 Apr;21(4):100220. doi: 10.1016/j.mcpro.2022.100220. Epub 2022 Feb 26.
8
A proteogenomics data-driven knowledge base of human cancer.一个基于人类癌症的蛋白质基因组学数据驱动的知识库。
Cell Syst. 2023 Sep 20;14(9):777-787.e5. doi: 10.1016/j.cels.2023.07.007. Epub 2023 Aug 23.
9
Proteogenomic data and resources for pan-cancer analysis.泛癌分析的蛋白质基因组学数据和资源。
Cancer Cell. 2023 Aug 14;41(8):1397-1406. doi: 10.1016/j.ccell.2023.06.009.
10
Simplified and Unified Access to Cancer Proteogenomic Data.简化和统一的癌症蛋白质基因组学数据访问。
J Proteome Res. 2021 Apr 2;20(4):1902-1910. doi: 10.1021/acs.jproteome.0c00919. Epub 2021 Feb 9.

引用本文的文献

1
Proteomic Analysis Revealed the Potential Role of MAGE-D2 in the Therapeutic Targeting of Triple-Negative Breast Cancer.蛋白质组学分析揭示了 MAGE-D2 在三阴性乳腺癌治疗靶向中的潜在作用。
Mol Cell Proteomics. 2024 Jan;23(1):100703. doi: 10.1016/j.mcpro.2023.100703. Epub 2023 Dec 20.
2
Proteogenomic data and resources for pan-cancer analysis.泛癌分析的蛋白质基因组学数据和资源。
Cancer Cell. 2023 Aug 14;41(8):1397-1406. doi: 10.1016/j.ccell.2023.06.009.
3
Effectively utilizing publicly available databases for cancer target evaluation.

本文引用的文献

1
Toward best practice in cancer mutation detection with whole-genome and whole-exome sequencing.实现全基因组和全外显子组测序中癌症基因突变检测的最佳实践。
Nat Biotechnol. 2021 Sep;39(9):1141-1150. doi: 10.1038/s41587-021-00994-5. Epub 2021 Sep 9.
2
Rapid genotype imputation from sequence with reference panels.基于参考面板的序列快速基因型推断。
Nat Genet. 2021 Jul;53(7):1104-1111. doi: 10.1038/s41588-021-00877-0. Epub 2021 Jun 3.
3
PANOPLY: a cloud-based platform for automated and reproducible proteogenomic data analysis.PANOPLY:一个基于云的平台,用于自动化和可重复的蛋白质基因组数据分析。
有效利用公开可用数据库进行癌症靶点评估。
NAR Cancer. 2023 Jul 14;5(3):zcad035. doi: 10.1093/narcan/zcad035. eCollection 2023 Sep.
4
A Novel Splice Variant of Inhibits β-Arrestin 2 to Promote the Proliferation and Migration of Glioblastoma Cells, and This Effect Was Blocked by Maackiain.一种新型剪接变体抑制β-抑制蛋白2以促进胶质母细胞瘤细胞的增殖和迁移,而该效应被山奈酚阻断。
Cancers (Basel). 2022 Aug 11;14(16):3890. doi: 10.3390/cancers14163890.
Nat Methods. 2021 Jun;18(6):580-582. doi: 10.1038/s41592-021-01176-6.
4
Single-cell proteomic and transcriptomic analysis of macrophage heterogeneity using SCoPE2.单细胞蛋白质组学和转录组学分析使用 SCoPE2 分析巨噬细胞异质性。
Genome Biol. 2021 Jan 27;22(1):50. doi: 10.1186/s13059-021-02267-5.
5
State of the Field in Multi-Omics Research: From Computational Needs to Data Mining and Sharing.多组学研究领域现状:从计算需求到数据挖掘与共享
Front Genet. 2020 Dec 10;11:610798. doi: 10.3389/fgene.2020.610798. eCollection 2020.
6
Proteogenomic Landscape of Breast Cancer Tumorigenesis and Targeted Therapy.乳腺癌发生和靶向治疗的蛋白质基因组全景分析
Cell. 2020 Nov 25;183(5):1436-1456.e31. doi: 10.1016/j.cell.2020.10.036. Epub 2020 Nov 18.
7
Small Molecule KRAS Inhibitors: The Future for Targeted Pancreatic Cancer Therapy?小分子KRAS抑制剂:靶向胰腺癌治疗的未来?
Cancers (Basel). 2020 May 24;12(5):1341. doi: 10.3390/cancers12051341.
8
The premise of personalized immunotherapy for cancer dormancy.癌症休眠期个体化免疫治疗的前提。
Oncogene. 2020 May;39(22):4323-4330. doi: 10.1038/s41388-020-1295-3. Epub 2020 Apr 22.
9
Targeting the MAPK Pathway in KRAS-Driven Tumors.针对 KRAS 驱动肿瘤的 MAPK 通路。
Cancer Cell. 2020 Apr 13;37(4):543-550. doi: 10.1016/j.ccell.2020.03.013.
10
A Compact Quadrupole-Orbitrap Mass Spectrometer with FAIMS Interface Improves Proteome Coverage in Short LC Gradients.紧凑型四极杆轨道阱质谱仪与 FAIMS 接口联用,在短 LC 梯度下提高蛋白质组覆盖度。
Mol Cell Proteomics. 2020 Apr;19(4):716-729. doi: 10.1074/mcp.TIR119.001906. Epub 2020 Feb 12.