• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

FUNNEL-GSEA:时间序列基因集富集分析中的功能弹性网络回归。

FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis.

机构信息

Department of Biostatistics and Computational Biology.

Department of Microbiology and Immunology, University of Rochester, Rochester, NY 14642, USA.

出版信息

Bioinformatics. 2017 Jul 1;33(13):1944-1952. doi: 10.1093/bioinformatics/btx104.

DOI:10.1093/bioinformatics/btx104
PMID:28334094
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5939227/
Abstract

MOTIVATION

Gene set enrichment analyses (GSEAs) are widely used in genomic research to identify underlying biological mechanisms (defined by the gene sets), such as Gene Ontology terms and molecular pathways. There are two caveats in the currently available methods: (i) they are typically designed for group comparisons or regression analyses, which do not utilize temporal information efficiently in time-series of transcriptomics measurements; and (ii) genes overlapping in multiple molecular pathways are considered multiple times in hypothesis testing.

RESULTS

We propose an inferential framework for GSEA based on functional data analysis, which utilizes the temporal information based on functional principal component analysis, and disentangles the effects of overlapping genes by a functional extension of the elastic-net regression. Furthermore, the hypothesis testing for the gene sets is performed by an extension of Mann-Whitney U test which is based on weighted rank sums computed from correlated observations. By using both simulated datasets and a large-scale time-course gene expression data on human influenza infection, we demonstrate that our method has uniformly better receiver operating characteristic curves, and identifies more pathways relevant to immune-response to human influenza infection than the competing approaches.

AVAILABILITY AND IMPLEMENTATION

The methods are implemented in R package FUNNEL, freely and publicly available at: https://github.com/yunzhang813/FUNNEL-GSEA-R-Package .

CONTACT

xing_qiu@urmc.rochester.edu or juilee_thakar@urmc.rochester.edu.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

基因集富集分析(GSEA)广泛应用于基因组研究中,以识别潜在的生物学机制(由基因集定义),如基因本体论术语和分子途径。目前可用方法存在两个问题:(i)它们通常是为组比较或回归分析设计的,在转录组测量的时间序列中不能有效地利用时间信息;(ii)在多个分子途径中重叠的基因在假设检验中被多次考虑。

结果

我们提出了一种基于功能数据分析的 GSEA 推断框架,该框架利用基于功能主成分分析的时间信息,并通过弹性网络回归的功能扩展来分解重叠基因的影响。此外,通过基于相关观测的加权秩和计算的 Mann-Whitney U 检验的扩展,对基因集进行假设检验。通过使用模拟数据集和大规模的人类流感感染时间过程基因表达数据,我们证明我们的方法具有均匀更好的接收器操作特征曲线,并且比竞争方法识别出更多与人类流感感染免疫反应相关的途径。

可用性和实施

该方法在 R 包 FUNNEL 中实现,可在以下网址免费公开获得:https://github.com/yunzhang813/FUNNEL-GSEA-R-Package。

联系人

xing_qiu@urmc.rochester.edu 或 juilee_thakar@urmc.rochester.edu。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/5939227/f453cdd5d21c/btx104f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/5939227/b9484fbaba74/btx104f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/5939227/617398de90db/btx104f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/5939227/a7c7831a66e4/btx104f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/5939227/ef89a2233237/btx104f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/5939227/f453cdd5d21c/btx104f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/5939227/b9484fbaba74/btx104f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/5939227/617398de90db/btx104f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/5939227/a7c7831a66e4/btx104f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/5939227/ef89a2233237/btx104f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/5939227/f453cdd5d21c/btx104f5.jpg

相似文献

1
FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis.FUNNEL-GSEA:时间序列基因集富集分析中的功能弹性网络回归。
Bioinformatics. 2017 Jul 1;33(13):1944-1952. doi: 10.1093/bioinformatics/btx104.
2
Combining multiple tools outperforms individual methods in gene set enrichment analyses.在基因集富集分析中,结合多种工具比单独使用方法表现更优。
Bioinformatics. 2017 Feb 1;33(3):414-424. doi: 10.1093/bioinformatics/btw623.
3
GSEApy: a comprehensive package for performing gene set enrichment analysis in Python.GSEApy:一个用于在 Python 中进行基因集富集分析的综合软件包。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac757.
4
GSEA-InContext: identifying novel and common patterns in expression experiments.GSEA-InContext:在表达实验中识别新颖和常见的模式。
Bioinformatics. 2018 Jul 1;34(13):i555-i564. doi: 10.1093/bioinformatics/bty271.
5
GO2MSIG, an automated GO based multi-species gene set generator for gene set enrichment analysis.GO2MSIG,一种基于 GO 的自动化多物种基因集生成器,用于基因集富集分析。
BMC Bioinformatics. 2014 May 17;15:146. doi: 10.1186/1471-2105-15-146.
6
Differential Gene Set Enrichment Analysis: a statistical approach to quantify the relative enrichment of two gene sets.差异基因集富集分析:一种统计方法,用于量化两个基因集的相对富集程度。
Bioinformatics. 2021 Jan 29;36(21):5247-5254. doi: 10.1093/bioinformatics/btaa658.
7
Identification and visualization of differential isoform expression in RNA-seq time series.RNA-seq 时间序列中差异异构体表达的鉴定和可视化。
Bioinformatics. 2018 Feb 1;34(3):524-526. doi: 10.1093/bioinformatics/btx578.
8
PathwaySplice: an R package for unbiased pathway analysis of alternative splicing in RNA-Seq data.PathwaySplice:一个用于 RNA-Seq 数据中可变剪接的无偏通路分析的 R 包。
Bioinformatics. 2018 Sep 15;34(18):3220-3222. doi: 10.1093/bioinformatics/bty317.
9
blitzGSEA: efficient computation of gene set enrichment analysis through gamma distribution approximation.blitzGSEA:通过伽马分布逼近实现基因集富集分析的高效计算。
Bioinformatics. 2022 Apr 12;38(8):2356-2357. doi: 10.1093/bioinformatics/btac076.
10
pulseR: Versatile computational analysis of RNA turnover from metabolic labeling experiments.pulseR:基于代谢标记实验的 RNA 周转的多功能计算分析。
Bioinformatics. 2017 Oct 15;33(20):3305-3307. doi: 10.1093/bioinformatics/btx368.

引用本文的文献

1
Alzheimer's disease: an integrative bioinformatics and machine learning analysis reveals glutamine metabolism-associated gene biomarkers.阿尔茨海默病:一项综合生物信息学和机器学习分析揭示了谷氨酰胺代谢相关基因生物标志物。
BMC Pharmacol Toxicol. 2025 Jan 28;26(1):19. doi: 10.1186/s40360-025-00852-z.
2
Purine metabolism-related genes and immunization in thyroid eye disease were validated using bioinformatics and machine learning.利用生物信息学和机器学习验证了与甲状腺眼病免疫相关的嘌呤代谢基因。
Sci Rep. 2023 Oct 26;13(1):18391. doi: 10.1038/s41598-023-45048-9.
3
SMYD3 associates with the NuRD (MTA1/2) complex to regulate transcription and promote proliferation and invasiveness in hepatocellular carcinoma cells.

本文引用的文献

1
PathCellNet: Cell-type specific pathogen-response network explorer.PathCellNet:细胞类型特异性病原体反应网络浏览器。
J Immunol Methods. 2016 Dec;439:15-22. doi: 10.1016/j.jim.2016.09.005. Epub 2016 Sep 20.
2
Diversity in Compartmental Dynamics of Gene Regulatory Networks: The Immune Response in Primary Influenza A Infection in Mice.基因调控网络区室动力学的多样性:小鼠甲型流感病毒初次感染中的免疫反应
PLoS One. 2015 Sep 28;10(9):e0138110. doi: 10.1371/journal.pone.0138110. eCollection 2015.
3
Human Dendritic Cell Response Signatures Distinguish 1918, Pandemic, and Seasonal H1N1 Influenza Viruses.
SMYD3 与 NuRD(MTA1/2)复合物结合,调节转录,促进肝癌细胞的增殖和侵袭。
BMC Biol. 2022 Dec 27;20(1):294. doi: 10.1186/s12915-022-01499-6.
4
FastMix: a versatile data integration pipeline for cell type-specific biomarker inference.FastMix:一种通用的数据集成管道,用于细胞类型特异性生物标志物推断。
Bioinformatics. 2022 Oct 14;38(20):4735-4744. doi: 10.1093/bioinformatics/btac585.
5
QPRT Acts as an Independent Prognostic Factor in Invasive Breast Cancer.QPRT是浸润性乳腺癌的独立预后因素。
J Oncol. 2022 Feb 24;2022:6548644. doi: 10.1155/2022/6548644. eCollection 2022.
6
Expression and clinical significance of in glioma.[具体物质或指标]在胶质瘤中的表达及临床意义。(原文中缺少具体要表达的内容)
Int J Clin Exp Pathol. 2021 Sep 15;14(9):938-955. eCollection 2021.
7
Enriched Random Forest for High Dimensional Genomic Data.高维基因组数据的富集随机森林。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Sep-Oct;19(5):2817-2828. doi: 10.1109/TCBB.2021.3089417. Epub 2022 Oct 10.
8
A Novel Six-Gene Signature for Prognosis Prediction in Ovarian Cancer.一种用于卵巢癌预后预测的新型六基因特征
Front Genet. 2020 Oct 15;11:1006. doi: 10.3389/fgene.2020.01006. eCollection 2020.
9
High expression of is an unfavorable prognostic biomarker in T4 gastric cancer patients.在 T4 期胃癌患者中,高表达是一个不利的预后生物标志物。
World J Gastroenterol. 2019 Aug 21;25(31):4452-4467. doi: 10.3748/wjg.v25.i31.4452.
10
Epigenetic Profiles Reveal That ADCYAP1 Serves as Key Molecule in Gestational Diabetes Mellitus.表观遗传谱表明 ADCYAP1 是妊娠期糖尿病的关键分子。
Comput Math Methods Med. 2019 Aug 14;2019:6936175. doi: 10.1155/2019/6936175. eCollection 2019.
人类树突状细胞反应特征可区分1918年、大流行和季节性H1N1流感病毒。
J Virol. 2015 Oct;89(20):10190-205. doi: 10.1128/JVI.01523-15. Epub 2015 Jul 29.
4
Time-Course Gene Set Analysis for Longitudinal Gene Expression Data.纵向基因表达数据的时间进程基因集分析
PLoS Comput Biol. 2015 Jun 25;11(6):e1004310. doi: 10.1371/journal.pcbi.1004310. eCollection 2015 Jun.
5
Acute myocardial infarction activates distinct inflammation and proliferation pathways in circulating monocytes, prior to recruitment, and identified through conserved transcriptional responses in mice and humans.急性心肌梗死在循环单核细胞募集之前就激活了不同的炎症和增殖途径,并通过小鼠和人类中保守的转录反应得以确定。
Eur Heart J. 2015 Aug 1;36(29):1923-34. doi: 10.1093/eurheartj/ehv195. Epub 2015 May 16.
6
limma powers differential expression analyses for RNA-sequencing and microarray studies.limma为RNA测序和微阵列研究提供差异表达分析的动力。
Nucleic Acids Res. 2015 Apr 20;43(7):e47. doi: 10.1093/nar/gkv007. Epub 2015 Jan 20.
7
Aging-dependent alterations in gene expression and a mitochondrial signature of responsiveness to human influenza vaccination.基因表达的衰老依赖性改变以及对人流感疫苗反应的线粒体特征。
Aging (Albany NY). 2015 Jan;7(1):38-52. doi: 10.18632/aging.100720.
8
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.使用DESeq2对RNA测序数据的倍数变化和离散度进行适度估计。
Genome Biol. 2014;15(12):550. doi: 10.1186/s13059-014-0550-8.
9
Variable Selection in Generalized Functional Linear Models.广义泛函线性模型中的变量选择
Stat. 2013;2(1):86-103. doi: 10.1002/sta4.20.
10
Evaluation of bias-variance trade-off for commonly used post-summarizing normalization procedures in large-scale gene expression studies.大规模基因表达研究中常用的汇总后归一化程序的偏差-方差权衡评估。
PLoS One. 2014 Jun 18;9(6):e99380. doi: 10.1371/journal.pone.0099380. eCollection 2014.