• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

降低重叠基因的权重可以提高基因集分析的效果。

Down-weighting overlapping genes improves gene set analysis.

机构信息

Perinatology Research Branch, NICHD/NIH/DHHS, Bethesda, Maryland, USA.

出版信息

BMC Bioinformatics. 2012 Jun 19;13:136. doi: 10.1186/1471-2105-13-136.

DOI:10.1186/1471-2105-13-136
PMID:22713124
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3443069/
Abstract

BACKGROUND

The identification of gene sets that are significantly impacted in a given condition based on microarray data is a crucial step in current life science research. Most gene set analysis methods treat genes equally, regardless how specific they are to a given gene set.

RESULTS

In this work we propose a new gene set analysis method that computes a gene set score as the mean of absolute values of weighted moderated gene t-scores. The gene weights are designed to emphasize the genes appearing in few gene sets, versus genes that appear in many gene sets. We demonstrate the usefulness of the method when analyzing gene sets that correspond to the KEGG pathways, and hence we called our method Pathway Analysis with Down-weighting of Overlapping Genes (PADOG). Unlike most gene set analysis methods which are validated through the analysis of 2-3 data sets followed by a human interpretation of the results, the validation employed here uses 24 different data sets and a completely objective assessment scheme that makes minimal assumptions and eliminates the need for possibly biased human assessments of the analysis results.

CONCLUSIONS

PADOG significantly improves gene set ranking and boosts sensitivity of analysis using information already available in the gene expression profiles and the collection of gene sets to be analyzed. The advantages of PADOG over other existing approaches are shown to be stable to changes in the database of gene sets to be analyzed. PADOG was implemented as an R package available at: http://bioinformaticsprb.med.wayne.edu/PADOG/or http://www.bioconductor.org.

摘要

背景

根据微阵列数据识别给定条件下受显著影响的基因集是当前生命科学研究的关键步骤。大多数基因集分析方法平等对待基因,而不管它们对特定基因集的特异性如何。

结果

在这项工作中,我们提出了一种新的基因集分析方法,该方法将基因集得分计算为加权调节基因 t 分数的绝对值的平均值。基因权重旨在强调出现在少数基因集中的基因,而不是出现在许多基因集中的基因。我们通过分析对应于 KEGG 途径的基因集来证明该方法的有效性,因此我们将我们的方法称为具有重叠基因降权的途径分析(Pathway Analysis with Down-weighting of Overlapping Genes,PADOG)。与大多数基因集分析方法通过分析 2-3 个数据集并对结果进行人工解释来验证不同,这里的验证使用了 24 个不同的数据集和一个完全客观的评估方案,该方案进行了最小的假设,并且无需对分析结果进行可能有偏差的人工评估。

结论

PADOG 显著提高了基因集的排序,并通过利用基因表达谱和要分析的基因集集合中已经存在的信息来提高分析的敏感性。与其他现有方法相比,PADOG 的优势在分析的基因集数据库发生变化时仍然稳定。PADOG 已作为一个 R 包实现,可在以下网址获得:http://bioinformaticsprb.med.wayne.edu/PADOG/或 http://www.bioconductor.org。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c27c/3443069/2165f9d12b9a/1471-2105-13-136-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c27c/3443069/5086e30a1ef8/1471-2105-13-136-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c27c/3443069/e34aeb97bd37/1471-2105-13-136-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c27c/3443069/0f5648550a50/1471-2105-13-136-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c27c/3443069/f9f374271ee7/1471-2105-13-136-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c27c/3443069/2165f9d12b9a/1471-2105-13-136-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c27c/3443069/5086e30a1ef8/1471-2105-13-136-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c27c/3443069/e34aeb97bd37/1471-2105-13-136-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c27c/3443069/0f5648550a50/1471-2105-13-136-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c27c/3443069/f9f374271ee7/1471-2105-13-136-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c27c/3443069/2165f9d12b9a/1471-2105-13-136-5.jpg

相似文献

1
Down-weighting overlapping genes improves gene set analysis.降低重叠基因的权重可以提高基因集分析的效果。
BMC Bioinformatics. 2012 Jun 19;13:136. doi: 10.1186/1471-2105-13-136.
2
Differential regulation enrichment analysis via the integration of transcriptional regulatory network and gene expression data.通过整合转录调控网络和基因表达数据进行差异调控富集分析。
Bioinformatics. 2015 Feb 15;31(4):563-71. doi: 10.1093/bioinformatics/btu672. Epub 2014 Oct 15.
3
Importing ArrayExpress datasets into R/Bioconductor.将 ArrayExpress 数据集导入 R/Bioconductor。
Bioinformatics. 2009 Aug 15;25(16):2092-4. doi: 10.1093/bioinformatics/btp354. Epub 2009 Jun 8.
4
Comparative study of gene set enrichment methods.基因集富集方法的比较研究。
BMC Bioinformatics. 2009 Sep 2;10:275. doi: 10.1186/1471-2105-10-275.
5
Estimating the similarity of alternative Affymetrix probe sets using transcriptional networks.利用转录网络评估替代Affymetrix探针集的相似性。
BMC Res Notes. 2013 Mar 21;6:107. doi: 10.1186/1756-0500-6-107.
6
Interactively optimizing signal-to-noise ratios in expression profiling: project-specific algorithm selection and detection p-value weighting in Affymetrix microarrays.在表达谱分析中交互式优化信噪比:Affymetrix微阵列中特定项目的算法选择和检测p值加权
Bioinformatics. 2004 Nov 1;20(16):2534-44. doi: 10.1093/bioinformatics/bth280. Epub 2004 Apr 29.
7
Calculation of reliable transcript levels of annotated genes on the basis of multiple probe-sets in Affymetrix microarrays.基于Affymetrix微阵列中多个探针集计算注释基因的可靠转录水平。
Acta Biochim Pol. 2009;56(2):271-7. Epub 2009 May 12.
8
Discovering monotonic stemness marker genes from time-series stem cell microarray data.从时间序列干细胞微阵列数据中发现单调干性标记基因。
BMC Genomics. 2015;16 Suppl 2(Suppl 2):S2. doi: 10.1186/1471-2164-16-S2-S2. Epub 2015 Jan 21.
9
A statistical framework for differential network analysis from microarray data.一种基于微阵列数据的差异网络分析的统计框架。
BMC Bioinformatics. 2010 Feb 19;11:95. doi: 10.1186/1471-2105-11-95.
10
Gene expression analysis in clear cell renal cell carcinoma using gene set enrichment analysis for biostatistical management.基于基因集富集分析的 clear cell 肾细胞癌基因表达分析用于生物统计学管理。
BJU Int. 2011 Jul;108(2 Pt 2):E29-35. doi: 10.1111/j.1464-410X.2010.09794.x. Epub 2011 Mar 16.

引用本文的文献

1
SomaModules: A Pathway Enrichment Approach Tailored to SomaScan Data.体细胞模块:一种针对SomaScan数据量身定制的通路富集方法。
J Proteome Res. 2025 Sep 5;24(9):4391-4402. doi: 10.1021/acs.jproteome.4c01114. Epub 2025 Aug 11.
2
SomaModules: a pathway enrichment approach tailored to SomaScan data.体细胞模块:一种针对SomaScan数据量身定制的通路富集方法。
bioRxiv. 2025 Aug 2:2025.07.30.667673. doi: 10.1101/2025.07.30.667673.
3
MRAS: Master Regulator Analysis of Alternative Splicing.MRAS:可变剪接的主调控因子分析

本文引用的文献

1
PGC-1α, a potential therapeutic target for early intervention in Parkinson's disease.PGC-1α,帕金森病早期干预的潜在治疗靶点。
Sci Transl Med. 2010 Oct 6;2(52):52ra73. doi: 10.1126/scitranslmed.3001059.
2
Gene expression profiling reveals novel biomarkers in nonsmall cell lung cancer.基因表达谱分析揭示非小细胞肺癌的新型生物标志物。
Int J Cancer. 2011 Jul 15;129(2):355-64. doi: 10.1002/ijc.25704. Epub 2010 Nov 28.
3
Gene set analysis exploiting the topology of a pathway.利用通路拓扑结构的基因集分析。
Adv Sci (Weinh). 2025 Jun;12(21):e2414493. doi: 10.1002/advs.202414493. Epub 2025 May 5.
4
Proteomic, metabolomic and lipidomic profiles in community acquired pneumonia for differentiating viral and bacterial infections.社区获得性肺炎中用于区分病毒和细菌感染的蛋白质组学、代谢组学和脂质组学特征。
Sci Rep. 2025 Jan 14;15(1):1922. doi: 10.1038/s41598-025-85229-2.
5
Prediction of late-onset preeclampsia using plasma proteomics: a longitudinal multi-cohort study.利用血浆蛋白质组学预测晚发型子痫前期:一项纵向多队列研究
Sci Rep. 2024 Dec 28;14(1):30813. doi: 10.1038/s41598-024-81277-2.
6
Translation of genome-wide association study: from genomic signals to biological insights.全基因组关联研究的翻译:从基因组信号到生物学见解。
Front Genet. 2024 Oct 3;15:1375481. doi: 10.3389/fgene.2024.1375481. eCollection 2024.
7
A best-match approach for gene set analyses in embedding spaces.一种在嵌入空间中进行基因集分析的最佳匹配方法。
Genome Res. 2024 Oct 11;34(9):1421-1433. doi: 10.1101/gr.279141.124.
8
Unbiased discovery of cancer pathways and therapeutics using Pathway Ensemble Tool and Benchmark.使用通路集成工具和基准进行癌症通路和治疗方法的无偏发现
Nat Commun. 2024 Aug 24;15(1):7288. doi: 10.1038/s41467-024-51859-9.
9
Ant colony optimization for the identification of dysregulated gene subnetworks from expression data.基于蚁群算法的表达数据中失调基因子网络识别
BMC Bioinformatics. 2024 Aug 1;25(1):254. doi: 10.1186/s12859-024-05871-x.
10
CCPA: cloud-based, self-learning modules for consensus pathway analysis using GO, KEGG and Reactome.CCPA:基于云的、使用 GO、KEGG 和 Reactome 的自学习共识途径分析模块。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae222.
BMC Syst Biol. 2010 Sep 1;4:121. doi: 10.1186/1752-0509-4-121.
4
Gene expression-based classification of non-small cell lung carcinomas and survival prediction.基于基因表达的非小细胞肺癌分类和生存预测。
PLoS One. 2010 Apr 22;5(4):e10312. doi: 10.1371/journal.pone.0010312.
5
A 'metastasis-prone' signature for early-stage mismatch-repair proficient sporadic colorectal cancer patients and its implications for possible therapeutics.早期错配修复功能良好的散发性结直肠癌患者的“转移倾向”特征及其可能的治疗意义。
Clin Exp Metastasis. 2010 Feb;27(2):83-90. doi: 10.1007/s10585-010-9305-4. Epub 2010 Feb 9.
6
FKBP51 affects cancer cell response to chemotherapy by negatively regulating Akt.FKBP51通过负向调节Akt影响癌细胞对化疗的反应。
Cancer Cell. 2009 Sep 8;16(3):259-66. doi: 10.1016/j.ccr.2009.07.016.
7
Choosing the right path: enhancement of biologically relevant sets of genes or proteins using pathway structure.选择正确的途径:利用途径结构增强生物学相关的基因或蛋白质组。
Genome Biol. 2009;10(4):R44. doi: 10.1186/gb-2009-10-4-r44. Epub 2009 Apr 24.
8
Combined gene expression analysis of whole-tissue and microdissected pancreatic ductal adenocarcinoma identifies genes specifically overexpressed in tumor epithelia.全组织及显微切割胰腺导管腺癌的联合基因表达分析鉴定出在肿瘤上皮中特异性过表达的基因。
Hepatogastroenterology. 2008 Nov-Dec;55(88):2016-27.
9
Regulation of endocytosis via the oxygen-sensing pathway.通过氧感应途径对胞吞作用的调节。
Nat Med. 2009 Mar;15(3):319-24. doi: 10.1038/nm.1922. Epub 2009 Mar 1.
10
A general modular framework for gene set enrichment analysis.一种用于基因集富集分析的通用模块化框架。
BMC Bioinformatics. 2009 Feb 3;10:47. doi: 10.1186/1471-2105-10-47.