• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

比较已编辑基因集与基于转录组学的基因特征在检测免疫细胞中通路激活的应用。

A comparison of curated gene sets versus transcriptomics-derived gene signatures for detecting pathway activation in immune cells.

机构信息

Hannover Medical School, Biomedical Research in Endstage and Obstructive Lung Disease Hannover (BREATH), German Center for Lung Research, Carl-Neuberg-Straße, Hannover, 30625, Germany.

Institute of Technical Chemistry, Leibniz University of Hannover, Callinstraße 5, Hannover, 30167, Germany.

出版信息

BMC Bioinformatics. 2020 Jan 28;21(1):28. doi: 10.1186/s12859-020-3366-4.

DOI:10.1186/s12859-020-3366-4
PMID:31992182
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6986093/
Abstract

BACKGROUND

Despite the significant contribution of transcriptomics to the fields of biological and biomedical research, interpreting long lists of significantly differentially expressed genes remains a challenging step in the analysis process. Gene set enrichment analysis is a standard approach for summarizing differentially expressed genes into pathways or other gene groupings. Here, we explore an alternative approach to utilizing gene sets from curated databases. We examine the method of deriving custom gene sets which may be relevant to a given experiment using reference data sets from previous transcriptomics studies. We call these data-derived gene sets, "gene signatures" for the biological process tested in the previous study. We focus on the feasibility of this approach in analyzing immune-related processes, which are complicated in their nature but play an important role in the medical research.

RESULTS

We evaluate several statistical approaches to detecting the activity of a gene signature in a target data set. We compare the performance of the data-derived gene signature approach with comparable GO term gene sets across all of the statistical tests. A total of 61 differential expression comparisons generated from 26 transcriptome experiments were included in the analysis. These experiments covered eight immunological processes in eight types of leukocytes. The data-derived signatures were used to detect the presence of immunological processes in the test data with modest accuracy (AUC = 0.67). The performance for GO and literature based gene sets was worse (AUC = 0.59). Both approaches were plagued by poor specificity.

CONCLUSIONS

When investigators seek to test specific hypotheses, the data-derived signature approach can perform as well, if not better than standard gene-set based approaches for immunological signatures. Furthermore, the data-derived signatures can be generated in the cases that well-defined gene sets are lacking from pathway databases and also offer the opportunity for defining signatures in a cell-type specific manner. However, neither the data-derived signatures nor standard gene-sets can be demonstrated to reliably provide negative predictions for negative cases. We conclude that the data-derived signature approach is a useful and sometimes necessary tool, but analysts should be weary of false positives.

摘要

背景

尽管转录组学在生物和生物医学研究领域做出了重要贡献,但解释大量显著差异表达基因仍然是分析过程中的一个挑战。基因集富集分析是一种将差异表达基因总结为途径或其他基因分组的标准方法。在这里,我们探索了一种利用来自已验证数据库的基因集的替代方法。我们研究了一种从先前转录组学研究的参考数据集推导出与特定实验相关的自定义基因集的方法。我们将这些从数据中推导出的基因集称为先前研究中测试的生物学过程的“基因特征”。我们专注于这种方法在分析免疫相关过程中的可行性,这些过程在性质上很复杂,但在医学研究中起着重要作用。

结果

我们评估了几种用于在目标数据集检测基因特征活性的统计方法。我们比较了数据派生基因特征方法与所有统计检验中可比的 GO 术语基因集的性能。从 26 个转录组实验中总共生成了 61 个差异表达比较,这些实验涵盖了八种白细胞中的八种免疫过程。使用数据派生的特征以适度的准确性(AUC = 0.67)来检测测试数据中免疫过程的存在。GO 和基于文献的基因集的性能更差(AUC = 0.59)。这两种方法都存在特异性差的问题。

结论

当研究人员寻求测试特定假设时,数据派生的特征方法可以与基于标准基因集的方法一样,甚至更好地用于免疫特征。此外,在缺乏途径数据库中定义明确的基因集的情况下,可以生成数据派生的特征,并且还提供了以细胞类型特异性方式定义特征的机会。然而,无论是数据派生的特征还是标准基因集都不能可靠地为阴性病例提供可靠的阴性预测。我们得出的结论是,数据派生的特征方法是一种有用的且有时是必要的工具,但分析人员应该警惕假阳性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8770/6986093/f8285ada5209/12859_2020_3366_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8770/6986093/eed4f3eb83da/12859_2020_3366_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8770/6986093/e76231856d8d/12859_2020_3366_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8770/6986093/4f5abd7ab459/12859_2020_3366_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8770/6986093/f8285ada5209/12859_2020_3366_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8770/6986093/eed4f3eb83da/12859_2020_3366_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8770/6986093/e76231856d8d/12859_2020_3366_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8770/6986093/4f5abd7ab459/12859_2020_3366_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8770/6986093/f8285ada5209/12859_2020_3366_Fig4_HTML.jpg

相似文献

1
A comparison of curated gene sets versus transcriptomics-derived gene signatures for detecting pathway activation in immune cells.比较已编辑基因集与基于转录组学的基因特征在检测免疫细胞中通路激活的应用。
BMC Bioinformatics. 2020 Jan 28;21(1):28. doi: 10.1186/s12859-020-3366-4.
2
ADAGE signature analysis: differential expression analysis with data-defined gene sets.ADAGE特征分析:使用数据定义的基因集进行差异表达分析。
BMC Bioinformatics. 2017 Nov 22;18(1):512. doi: 10.1186/s12859-017-1905-4.
3
Muscle Gene Sets: a versatile methodological aid to functional genomics in the neuromuscular field.肌肉基因集:神经肌肉领域功能基因组学的多功能方法学辅助工具。
Skelet Muscle. 2019 May 3;9(1):10. doi: 10.1186/s13395-019-0196-z.
4
GeneTopics--interpretation of gene sets via literature-driven topic models.基因主题——通过文献驱动的主题模型对基因集进行解读。
BMC Syst Biol. 2013;7 Suppl 5(Suppl 5):S10. doi: 10.1186/1752-0509-7-S5-S10. Epub 2013 Dec 9.
5
Facilitating systems-level analyses of all-cause and Covid-mediated sepsis through SeptiSearch, a manually-curated compendium of dysregulated gene sets.通过 SeptiSearch 促进全因和新冠介导的败血症的系统水平分析,这是一个经过人工整理的失调基因集汇编。
Front Immunol. 2023 May 26;14:1135859. doi: 10.3389/fimmu.2023.1135859. eCollection 2023.
6
Gene expression analysis in clear cell renal cell carcinoma using gene set enrichment analysis for biostatistical management.基于基因集富集分析的 clear cell 肾细胞癌基因表达分析用于生物统计学管理。
BJU Int. 2011 Jul;108(2 Pt 2):E29-35. doi: 10.1111/j.1464-410X.2010.09794.x. Epub 2011 Mar 16.
7
DiSignAtlas: an atlas of human and mouse disease signatures based on bulk and single-cell transcriptomics.DiSignAtlas:基于批量和单细胞转录组学的人类和小鼠疾病特征图谱。
Nucleic Acids Res. 2024 Jan 5;52(D1):D1236-D1245. doi: 10.1093/nar/gkad961.
8
Comparison of gene set scoring methods for reproducible evaluation of tuberculosis gene signatures.比较基因集评分方法,以实现结核病基因特征的可重现性评估。
BMC Infect Dis. 2024 Jun 20;24(1):610. doi: 10.1186/s12879-024-09457-z.
9
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
10
Identifying and optimizing human endometrial gene expression signatures for endometrial dating.识别并优化用于子宫内膜分期的人类子宫内膜基因表达特征。
Hum Reprod. 2022 Jan 28;37(2):284-296. doi: 10.1093/humrep/deab262.

引用本文的文献

1
Evaluating adaptive stress response gene signatures using transcriptomics.利用转录组学评估适应性应激反应基因特征
Comput Toxicol. 2021 Nov 1;20:1-9. doi: 10.1016/j.comtox.2021.100179.
2
Facilitating systems-level analyses of all-cause and Covid-mediated sepsis through SeptiSearch, a manually-curated compendium of dysregulated gene sets.通过 SeptiSearch 促进全因和新冠介导的败血症的系统水平分析,这是一个经过人工整理的失调基因集汇编。
Front Immunol. 2023 May 26;14:1135859. doi: 10.3389/fimmu.2023.1135859. eCollection 2023.
3
Latent Variables Capture Pathway-Level Points of Departure in High-Throughput Toxicogenomic Data.

本文引用的文献

1
Gene expression profile of human T cells following a single stimulation of peripheral blood mononuclear cells with anti-CD3 antibodies.人外周血单个核细胞经抗 CD3 抗体单次刺激后 T 细胞的基因表达谱。
BMC Genomics. 2019 Jul 19;20(1):593. doi: 10.1186/s12864-019-5967-8.
2
Human Eosinophils Express a Distinct Gene Expression Program in Response to IL-3 Compared with Common β-Chain Cytokines IL-5 and GM-CSF.人类嗜酸性粒细胞在响应 IL-3 时表达与常见的β-链细胞因子 IL-5 和 GM-CSF 不同的基因表达谱。
J Immunol. 2019 Jul 15;203(2):329-337. doi: 10.4049/jimmunol.1801668. Epub 2019 Jun 7.
3
Radiogenomic Analysis of F-18-Fluorodeoxyglucose Positron Emission Tomography and Gene Expression Data Elucidates the Epidemiological Complexity of Colorectal Cancer Landscape.
潜在变量捕获高通量毒理基因组学数据中的途径起始点。
Chem Res Toxicol. 2022 Apr 18;35(4):670-683. doi: 10.1021/acs.chemrestox.1c00444. Epub 2022 Mar 25.
4
Derivation and Application of Molecular Signatures to Prostate Cancer: Opportunities and Challenges.分子特征在前列腺癌中的推导与应用:机遇与挑战
Cancers (Basel). 2021 Jan 28;13(3):495. doi: 10.3390/cancers13030495.
F-18-氟脱氧葡萄糖正电子发射断层扫描与基因表达数据的放射基因组学分析揭示了结直肠癌格局的流行病学复杂性。
Comput Struct Biotechnol J. 2019 Jan 25;17:177-185. doi: 10.1016/j.csbj.2019.01.007. eCollection 2019.
4
Quantitative Proteomics Reveals the Dynamic Protein Landscape during Initiation of Human Th17 Cell Polarization.定量蛋白质组学揭示人类Th17细胞极化起始过程中的动态蛋白质图谱。
iScience. 2019 Jan 25;11:334-355. doi: 10.1016/j.isci.2018.12.020. Epub 2018 Dec 26.
5
Transcript- and protein-level analyses of the response of human eosinophils to glucocorticoids.人类嗜酸性粒细胞对糖皮质激素反应的转录组和蛋白质组分析。
Sci Data. 2018 Dec 4;5:180275. doi: 10.1038/sdata.2018.275.
6
PANTHER version 14: more genomes, a new PANTHER GO-slim and improvements in enrichment analysis tools.PANTHER 版本 14:更多基因组、一个新的 PANTHER GO-slim 和富集分析工具的改进。
Nucleic Acids Res. 2019 Jan 8;47(D1):D419-D426. doi: 10.1093/nar/gky1038.
7
ArrayExpress update - from bulk to single-cell expression data.ArrayExpress 更新——从批量到单细胞表达数据。
Nucleic Acids Res. 2019 Jan 8;47(D1):D711-D715. doi: 10.1093/nar/gky964.
8
Impact of Aging and HIV Infection on the Function of the C-Type Lectin Receptor MINCLE in Monocytes.衰老和 HIV 感染对单核细胞 C 型凝集素受体 MINCLE 功能的影响。
J Gerontol A Biol Sci Med Sci. 2019 May 16;74(6):794-801. doi: 10.1093/gerona/gly209.
9
Germline-activating mutations in compromise B cell development and function.胚系激活突变会影响 B 细胞的发育和功能。
J Exp Med. 2018 Aug 6;215(8):2073-2095. doi: 10.1084/jem.20180010. Epub 2018 Jul 17.
10
Glucocorticoid-induced eosinopenia in humans can be linked to early transcriptional events.人类中糖皮质激素诱导的嗜酸性粒细胞减少可能与早期转录事件有关。
Allergy. 2018 Oct;73(10):2076-2079. doi: 10.1111/all.13497. Epub 2018 Jul 17.