• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于检测差异表达蛋白质的贝叶斯算法及其在乳腺癌研究中的应用。

A Bayesian algorithm for detecting differentially expressed proteins and its application in breast cancer research.

作者信息

Santra Tapesh, Delatola Eleni Ioanna

机构信息

Systems Biology Ireland, University College Dublin, Belfield, Dublin-4, Ireland.

出版信息

Sci Rep. 2016 Jul 22;6:30159. doi: 10.1038/srep30159.

DOI:10.1038/srep30159
PMID:27444576
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4957118/
Abstract

Presence of considerable noise and missing data points make analysis of mass-spectrometry (MS) based proteomic data a challenging task. The missing values in MS data are caused by the inability of MS machines to reliably detect proteins whose abundances fall below the detection limit. We developed a Bayesian algorithm that exploits this knowledge and uses missing data points as a complementary source of information to the observed protein intensities in order to find differentially expressed proteins by analysing MS based proteomic data. We compared its accuracy with many other methods using several simulated datasets. It consistently outperformed other methods. We then used it to analyse proteomic screens of a breast cancer (BC) patient cohort. It revealed large differences between the proteomic landscapes of triple negative and Luminal A, which are the most and least aggressive types of BC. Unexpectedly, majority of these differences could be attributed to the direct transcriptional activity of only seven transcription factors some of which are known to be inactive in triple negative BC. We also identified two new proteins which significantly correlated with the survival of BC patients, and therefore may have potential diagnostic/prognostic values.

摘要

大量噪声和缺失数据点的存在使得基于质谱(MS)的蛋白质组学数据分析成为一项具有挑战性的任务。MS数据中的缺失值是由MS机器无法可靠检测丰度低于检测限的蛋白质所致。我们开发了一种贝叶斯算法,该算法利用这一知识,并将缺失数据点用作观测到的蛋白质强度的补充信息源,以便通过分析基于MS的蛋白质组学数据来寻找差异表达的蛋白质。我们使用几个模拟数据集将其准确性与许多其他方法进行了比较。它始终优于其他方法。然后我们用它来分析一组乳腺癌(BC)患者的蛋白质组学筛查结果。结果显示,三阴型和管腔A型这两种侵袭性最强和最弱的BC类型之间的蛋白质组图谱存在很大差异。出乎意料的是,这些差异中的大多数可归因于仅7种转录因子的直接转录活性,其中一些转录因子已知在三阴型BC中无活性。我们还鉴定出两种与BC患者生存率显著相关的新蛋白质,因此可能具有潜在的诊断/预后价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60ca/4957118/ef7975c3d30c/srep30159-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60ca/4957118/83ea38f52eb3/srep30159-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60ca/4957118/e6fdbd9b2c69/srep30159-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60ca/4957118/35be5e583a01/srep30159-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60ca/4957118/871100929081/srep30159-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60ca/4957118/ef7975c3d30c/srep30159-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60ca/4957118/83ea38f52eb3/srep30159-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60ca/4957118/e6fdbd9b2c69/srep30159-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60ca/4957118/35be5e583a01/srep30159-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60ca/4957118/871100929081/srep30159-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60ca/4957118/ef7975c3d30c/srep30159-f5.jpg

相似文献

1
A Bayesian algorithm for detecting differentially expressed proteins and its application in breast cancer research.一种用于检测差异表达蛋白质的贝叶斯算法及其在乳腺癌研究中的应用。
Sci Rep. 2016 Jul 22;6:30159. doi: 10.1038/srep30159.
2
Advancement of mass spectrometry-based proteomics technologies to explore triple negative breast cancer.基于质谱的蛋白质组学技术在探索三阴性乳腺癌方面的进展。
Mol Biosyst. 2016 Dec 20;13(1):42-55. doi: 10.1039/c6mb00639f.
3
Glycoproteomic comparison of clinical triple-negative and luminal breast tumors.临床三阴性和管腔型乳腺肿瘤的糖蛋白质组学比较
J Proteome Res. 2015 Mar 6;14(3):1376-88. doi: 10.1021/pr500987r. Epub 2015 Feb 16.
4
Shotgun proteomics of archival triple-negative breast cancer samples.存档三阴性乳腺癌样本的 shotgun 蛋白质组学分析。
Proteomics Clin Appl. 2013 Apr;7(3-4):283-91. doi: 10.1002/prca.201200048. Epub 2013 Mar 6.
5
The proteomic landscape of triple-negative breast cancer.三阴性乳腺癌的蛋白质组学概况
Cell Rep. 2015 Apr 28;11(4):630-44. doi: 10.1016/j.celrep.2015.03.050. Epub 2015 Apr 16.
6
A multi-model statistical approach for proteomic spectral count quantitation.一种用于蛋白质组学光谱计数定量的多模型统计方法。
J Proteomics. 2016 Jul 20;144:23-32. doi: 10.1016/j.jprot.2016.05.032. Epub 2016 May 31.
7
Molecular characterization of triple negative breast cancer formaldehyde-fixed paraffin-embedded samples by data-independent acquisition proteomics.采用数据非依赖采集蛋白质组学方法对三阴性乳腺癌福尔马林固定石蜡包埋样本进行分子特征分析。
Proteomics. 2022 Feb;22(3):e2100110. doi: 10.1002/pmic.202100110. Epub 2021 Oct 27.
8
Prediction of adjuvant chemotherapy response in triple negative breast cancer with discovery and targeted proteomics.利用发现型和靶向蛋白质组学预测三阴性乳腺癌的辅助化疗反应
PLoS One. 2017 Jun 8;12(6):e0178296. doi: 10.1371/journal.pone.0178296. eCollection 2017.
9
In-depth method assessments of differentially expressed protein detection for shotgun proteomics data with missing values.深度方法评估用于具有缺失值的鸟枪法蛋白质组学数据的差异表达蛋白检测。
Sci Rep. 2017 Jun 13;7(1):3367. doi: 10.1038/s41598-017-03650-8.
10
Subtyping of breast cancer using reverse phase protein arrays.利用反相蛋白质阵列对乳腺癌进行亚型分类。
Expert Rev Proteomics. 2014 Dec;11(6):757-70. doi: 10.1586/14789450.2014.971113.

引用本文的文献

1
Refate identifies chemical compounds to target trans-regulatory networks for cellular conversion.Refate可识别用于靶向细胞转化的反式调控网络的化合物。
bioRxiv. 2025 Jul 12:2025.07.09.664003. doi: 10.1101/2025.07.09.664003.
2
Machine learning algorithms being an auxiliary tool to predict the overall survival of patients with renal cell carcinoma using the SEER database.机器学习算法作为一种辅助工具,利用监测、流行病学与最终结果(SEER)数据库预测肾细胞癌患者的总生存期。
Transl Androl Urol. 2024 Jan 31;13(1):53-63. doi: 10.21037/tau-23-319. Epub 2024 Jan 23.
3
Challenges and Opportunities for Bayesian Statistics in Proteomics.

本文引用的文献

1
Plug-and-play analysis of the human phosphoproteome by targeted high-resolution mass spectrometry.通过靶向高分辨率质谱对人类磷酸化蛋白质组进行即插即用分析。
Nat Methods. 2016 May;13(5):431-4. doi: 10.1038/nmeth.3811. Epub 2016 Mar 28.
2
Minimal sample requirement for highly multiplexed protein quantification in cell lines and tissues by PCT-SWATH mass spectrometry.通过PCT-SWATH质谱法对细胞系和组织中高度多重蛋白质定量的最低样本要求
Proteomics. 2015 Nov;15(21):3711-21. doi: 10.1002/pmic.201500161. Epub 2015 Sep 10.
3
Distinguishing Low-Risk Luminal A Breast Cancer Subtypes with Ki-67 and p53 Is More Predictive of Long-Term Survival.
贝叶斯统计学在蛋白质组学中的挑战与机遇。
J Proteome Res. 2022 Apr 1;21(4):849-864. doi: 10.1021/acs.jproteome.1c00859. Epub 2022 Mar 8.
4
Comprehensive tissue-specific gene set enrichment analysis and transcription factor analysis of breast cancer by integrating 14 gene expression datasets.通过整合14个基因表达数据集对乳腺癌进行全面的组织特异性基因集富集分析和转录因子分析。
Oncotarget. 2017 Jan 24;8(4):6775-6786. doi: 10.18632/oncotarget.14286.
5
Vascular Endothelial Growth Factor (VEGF) Promotes Assembly of the p130Cas Interactome to Drive Endothelial Chemotactic Signaling and Angiogenesis.血管内皮生长因子(VEGF)促进p130Cas相互作用组的组装,以驱动内皮趋化信号传导和血管生成。
Mol Cell Proteomics. 2017 Feb;16(2):168-180. doi: 10.1074/mcp.M116.064428. Epub 2016 Dec 22.
利用Ki-67和p53区分低风险腔面A型乳腺癌亚型对长期生存更具预测性。
PLoS One. 2015 Aug 4;10(8):e0124658. doi: 10.1371/journal.pone.0124658. eCollection 2015.
4
Differential protein expression and peak selection in mass spectrometry data by binary discriminant analysis.基于二元判别分析的质谱数据差异蛋白质表达和峰选择。
Bioinformatics. 2015 Oct 1;31(19):3156-62. doi: 10.1093/bioinformatics/btv334. Epub 2015 May 28.
5
The CPTAC Data Portal: A Resource for Cancer Proteomics Research.CPTAC数据门户:癌症蛋白质组学研究资源
J Proteome Res. 2015 Jun 5;14(6):2707-13. doi: 10.1021/pr501254j. Epub 2015 May 4.
6
Review, evaluation, and discussion of the challenges of missing value imputation for mass spectrometry-based label-free global proteomics.基于质谱的无标记全局蛋白质组学中缺失值插补挑战的综述、评估与讨论。
J Proteome Res. 2015 May 1;14(5):1993-2001. doi: 10.1021/pr501138h. Epub 2015 Apr 22.
7
Two-group comparisons of zero-inflated intensity values: the choice of test statistic matters.零膨胀强度值的两组比较:检验统计量的选择很重要。
Bioinformatics. 2015 Jul 15;31(14):2310-7. doi: 10.1093/bioinformatics/btv154. Epub 2015 Mar 18.
8
ATF4 Gene Network Mediates Cellular Response to the Anticancer PAD Inhibitor YW3-56 in Triple-Negative Breast Cancer Cells.ATF4基因网络介导三阴性乳腺癌细胞对抗癌PAD抑制剂YW3-56的细胞反应。
Mol Cancer Ther. 2015 Apr;14(4):877-88. doi: 10.1158/1535-7163.MCT-14-1093-T. Epub 2015 Jan 22.
9
Label-free proteomic analysis of breast cancer molecular subtypes.乳腺癌分子亚型的无标记蛋白质组学分析
J Proteome Res. 2014 Nov 7;13(11):4752-72. doi: 10.1021/pr500676x. Epub 2014 Sep 24.
10
Linking signaling pathways to transcriptional programs in breast cancer.将乳腺癌中的信号通路与转录程序相联系。
Genome Res. 2014 Nov;24(11):1869-80. doi: 10.1101/gr.173039.114. Epub 2014 Sep 2.