• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用已知蛋白质功能关系提高基因表达分析的可重复性:良好关联蛋白的概念。

Enhancing reproducibility of gene expression analysis with known protein functional relationships: The concept of well-associated protein.

机构信息

Momenta Pharmaceuticals, 301 Binney Street, Cambridge, Massachusetts, United States of America.

出版信息

PLoS Comput Biol. 2020 Feb 14;16(2):e1007684. doi: 10.1371/journal.pcbi.1007684. eCollection 2020 Feb.

DOI:10.1371/journal.pcbi.1007684
PMID:32058996
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7046299/
Abstract

Identification of differentially expressed genes (DEGs) is well recognized to be variable across independent replications of genome-wide transcriptional studies. These are often employed to characterize disease state early in the process of discovery and prioritize novel targets aimed at addressing unmet medical need. Increasing reproducibility of biological findings from these studies could potentially positively impact the success rate of new clinical interventions. This work demonstrates that statistically sound combination of gene expression data with prior knowledge about biology in the form of large protein interaction networks can yield quantitatively more reproducible observations from studies characterizing human disease. The novel concept of Well-Associated Proteins (WAPs) introduced herein-gene products significantly associated on protein interaction networks with the differences in transcript levels between control and disease-does not require choosing a differential expression threshold and can be computed efficiently enough to enable false discovery rate estimation via permutation. Reproducibility of WAPs is shown to be on average superior to that of DEGs under easily-quantifiable conditions suggesting that they can yield a significantly more robust description of disease. Enhanced reproducibility of WAPs versus DEGs is first demonstrated with four independent data sets focused on systemic sclerosis. This finding is then validated over thousands of pairs of data sets obtained by random partitions of large studies in several other diseases. Conditions that individual data sets must satisfy to yield robust WAP scores are examined. Reproducible identification of WAPs can potentially benefit drug target selection and precision medicine studies.

摘要

鉴定差异表达基因(DEGs)在全基因组转录组研究的独立重复中是多变的,这一点已得到广泛认可。这些研究通常用于在发现过程的早期描述疾病状态,并优先选择针对未满足医疗需求的新目标。如果这些研究中的生物学发现具有更高的可重复性,那么新的临床干预措施的成功率可能会得到积极的影响。本工作表明,将基因表达数据与以大规模蛋白质相互作用网络形式呈现的生物学先验知识进行合理组合,可以从描述人类疾病的研究中得出更具定量可重复性的观察结果。本文提出的新概念——Well-Associated Proteins(WAPs)——即与转录水平差异显著相关的蛋白质相互作用网络上的基因产物,不需要选择差异表达阈值,并且可以高效计算,以便通过置换进行错误发现率估计。在易于量化的条件下,WAPs 的重现性平均优于 DEGs,表明它们可以对疾病进行更稳健的描述。我们首先用四个专注于系统性硬化症的独立数据集证明了 WAPs 与 DEGs 的重现性之间的增强。然后,在其他几种疾病的大型研究的随机分区中获得了数千对数据集,对这一发现进行了验证。检查了单个数据集必须满足哪些条件才能产生稳健的 WAP 分数。WAPs 的可重现鉴定可能会有益于药物靶点选择和精准医学研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/9481f27c2828/pcbi.1007684.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/532b0fa7793e/pcbi.1007684.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/11ad4b839c22/pcbi.1007684.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/7deef2f6a98d/pcbi.1007684.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/06f892b239ef/pcbi.1007684.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/f5df37023fa0/pcbi.1007684.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/dc0c53a5a878/pcbi.1007684.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/b36e0874f033/pcbi.1007684.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/9481f27c2828/pcbi.1007684.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/532b0fa7793e/pcbi.1007684.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/11ad4b839c22/pcbi.1007684.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/7deef2f6a98d/pcbi.1007684.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/06f892b239ef/pcbi.1007684.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/f5df37023fa0/pcbi.1007684.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/dc0c53a5a878/pcbi.1007684.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/b36e0874f033/pcbi.1007684.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b83c/7046299/9481f27c2828/pcbi.1007684.g008.jpg

相似文献

1
Enhancing reproducibility of gene expression analysis with known protein functional relationships: The concept of well-associated protein.利用已知蛋白质功能关系提高基因表达分析的可重复性:良好关联蛋白的概念。
PLoS Comput Biol. 2020 Feb 14;16(2):e1007684. doi: 10.1371/journal.pcbi.1007684. eCollection 2020 Feb.
2
A novel method to prioritize RNAseq data for post-hoc analysis based on absolute changes in transcript abundance.一种基于转录本丰度绝对变化对RNAseq数据进行事后分析优先级排序的新方法。
Stat Appl Genet Mol Biol. 2015 Jun;14(3):227-41. doi: 10.1515/sagmb-2014-0018.
3
Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.基于疾病谱数据中错误发现率的七种生成Affymetrix表达分数方法的比较。
BMC Bioinformatics. 2005 Feb 10;6:26. doi: 10.1186/1471-2105-6-26.
4
Reproducibility enhancement and differential expression of non predefined functional gene sets in human genome.人类基因组中未预先定义的功能基因集的可重复性增强及差异表达
BMC Genomics. 2014 Dec 24;15(1):1181. doi: 10.1186/1471-2164-15-1181.
5
Integrated analysis of the gene expression profile and DNA methylation profile of obese patients with type 2 diabetes.2 型糖尿病肥胖患者的基因表达谱和 DNA 甲基化谱的综合分析。
Mol Med Rep. 2018 Jun;17(6):7636-7644. doi: 10.3892/mmr.2018.8804. Epub 2018 Mar 28.
6
Integrating Biological Networks for Drug Target Prediction and Prioritization.整合生物网络用于药物靶点预测及优先级排序
Methods Mol Biol. 2019;1903:203-218. doi: 10.1007/978-1-4939-8955-3_12.
7
Bioinformatics analysis of fibroblasts exposed to TGF‑β at the early proliferation phase of wound repair.成纤维细胞在伤口修复早期增殖阶段暴露于 TGF-β 后的生物信息学分析。
Mol Med Rep. 2017 Dec;16(6):8146-8154. doi: 10.3892/mmr.2017.7619. Epub 2017 Sep 26.
8
Construction of Parkinson's disease marker-based weighted protein-protein interaction network for prioritization of co-expressed genes.构建帕金森病标志物加权蛋白质-蛋白质相互作用网络,对共表达基因进行优先级排序。
Gene. 2019 May 20;697:67-77. doi: 10.1016/j.gene.2019.02.026. Epub 2019 Feb 16.
9
Identification of potential gene targets in systemic vasculitis using DNA microarray analysis.使用DNA微阵列分析鉴定系统性血管炎中的潜在基因靶点。
Mol Med Rep. 2017 Jun;15(6):3665-3673. doi: 10.3892/mmr.2017.6455. Epub 2017 Apr 11.
10
Network-based association analysis to infer new disease-gene relationships using large-scale protein interactions.基于网络的关联分析,利用大规模蛋白质相互作用推断新的疾病-基因关系。
PLoS One. 2018 Jun 27;13(6):e0199435. doi: 10.1371/journal.pone.0199435. eCollection 2018.

引用本文的文献

1
Multiomic study of skin, peripheral blood, and serum: is serum proteome a reflection of disease process at the end-organ level in systemic sclerosis?皮肤、外周血和血清的多组学研究:血清蛋白质组学是否反映了系统性硬化症终末器官水平的疾病过程?
Arthritis Res Ther. 2021 Oct 15;23(1):259. doi: 10.1186/s13075-021-02633-5.

本文引用的文献

1
Next-generation characterization of the Cancer Cell Line Encyclopedia.下一代癌症细胞系百科全书的特征描述。
Nature. 2019 May;569(7757):503-508. doi: 10.1038/s41586-019-1186-3. Epub 2019 May 8.
2
Genes with High Network Connectivity Are Enriched for Disease Heritability.高网络连通性的基因富集了疾病遗传性。
Am J Hum Genet. 2019 May 2;104(5):896-913. doi: 10.1016/j.ajhg.2019.03.020.
3
Identification of pathways associated with chemosensitivity through network embedding.通过网络嵌入识别与化疗敏感性相关的途径。
PLoS Comput Biol. 2019 Mar 20;15(3):e1006864. doi: 10.1371/journal.pcbi.1006864. eCollection 2019 Mar.
4
A Comprehensive Survey of Tools and Software for Active Subnetwork Identification.用于活跃子网识别的工具和软件综合调查。
Front Genet. 2019 Mar 5;10:155. doi: 10.3389/fgene.2019.00155. eCollection 2019.
5
Predictability of human differential gene expression.人类差异基因表达的可预测性。
Proc Natl Acad Sci U S A. 2019 Mar 26;116(13):6491-6500. doi: 10.1073/pnas.1802973116. Epub 2019 Mar 7.
6
Pathway enrichment analysis and visualization of omics data using g:Profiler, GSEA, Cytoscape and EnrichmentMap.使用 g:Profiler、GSEA、Cytoscape 和 EnrichmentMap 进行组学数据的通路富集分析和可视化。
Nat Protoc. 2019 Feb;14(2):482-517. doi: 10.1038/s41596-018-0103-9.
7
Gene set analysis methods: a systematic comparison.基因集分析方法:系统比较
BioData Min. 2018 May 31;11:8. doi: 10.1186/s13040-018-0166-8. eCollection 2018.
8
Systematic Evaluation of Molecular Networks for Discovery of Disease Genes.系统评价分子网络发现疾病基因。
Cell Syst. 2018 Apr 25;6(4):484-495.e5. doi: 10.1016/j.cels.2018.03.001. Epub 2018 Mar 28.
9
Integrative omics for health and disease.整体医学组学与健康和疾病。
Nat Rev Genet. 2018 May;19(5):299-310. doi: 10.1038/nrg.2018.4. Epub 2018 Feb 26.
10
The Reactome Pathway Knowledgebase.Reactome 通路知识库。
Nucleic Acids Res. 2018 Jan 4;46(D1):D649-D655. doi: 10.1093/nar/gkx1132.