克服空间分辨转录组脑图谱数据分析中基因类别富集的假阳性。

Overcoming false-positive gene-category enrichment in the analysis of spatially resolved transcriptomic brain atlas data.

机构信息

School of Physics, The University of Sydney, Camperdown, NSW, Australia.

The Turner Institute for Brain and Mental Health, School of Psychological Sciences and Monash Biomedical Imaging, Monash University, Clayton, VIC, Australia.

出版信息

Nat Commun. 2021 May 11;12(1):2669. doi: 10.1038/s41467-021-22862-1.

DOI:10.1038/s41467-021-22862-1

PMID:33976144

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8113439/

Abstract

Transcriptomic atlases have improved our understanding of the correlations between gene-expression patterns and spatially varying properties of brain structure and function. Gene-category enrichment analysis (GCEA) is a common method to identify functional gene categories that drive these associations, using gene-to-category annotation systems like the Gene Ontology (GO). Here, we show that applying standard GCEA methodology to spatial transcriptomic data is affected by substantial false-positive bias, with GO categories displaying an over 500-fold average inflation of false-positive associations with random neural phenotypes in mouse and human. The estimated false-positive rate of a GO category is associated with its rate of being reported as significantly enriched in the literature, suggesting that published reports are affected by this false-positive bias. We show that within-category gene-gene coexpression and spatial autocorrelation are key drivers of the false-positive bias and introduce flexible ensemble-based null models that can account for these effects, made available as a software toolbox.

摘要

转录组图谱提高了我们对基因表达模式与大脑结构和功能的空间变化特性之间相关性的理解。基因类别富集分析（GCEA）是一种常用的方法，用于识别驱动这些关联的功能基因类别，使用基因到类别注释系统，如基因本体论（GO）。在这里，我们表明，将标准 GCEA 方法应用于空间转录组数据会受到大量假阳性偏差的影响，GO 类别与随机神经表型的假阳性关联的平均膨胀超过 500 倍，在小鼠和人类中。GO 类别的估计假阳性率与其在文献中被报道为显著富集的频率相关，这表明已发表的报告受到这种假阳性偏差的影响。我们表明，类别内基因-基因共表达和空间自相关是假阳性偏差的关键驱动因素，并引入了灵活的基于集成的零模型，可以解释这些影响，作为软件工具箱提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1be/8113439/1f41f1c4225d/41467_2021_22862_Fig1_HTML.jpg

相似文献

Overcoming false-positive gene-category enrichment in the analysis of spatially resolved transcriptomic brain atlas data.克服空间分辨转录组脑图谱数据分析中基因类别富集的假阳性。

Nat Commun. 2021 May 11;12(1):2669. doi: 10.1038/s41467-021-22862-1.

WormCat: An Online Tool for Annotation and Visualization of Genome-Scale Data.WormCat：一种用于基因组规模数据注释和可视化的在线工具。

Genetics. 2020 Feb;214(2):279-294. doi: 10.1534/genetics.119.302919. Epub 2019 Dec 6.

GO FEAT: a rapid web-based functional annotation tool for genomic and transcriptomic data.GO FEAT：一个快速的基于网络的基因组和转录组数据功能注释工具。

Sci Rep. 2018 Jan 29;8(1):1794. doi: 10.1038/s41598-018-20211-9.

Empirical Bayes estimation of posterior probabilities of enrichment: a comparative study of five estimators of the local false discovery rate.经验贝叶斯估计富集后验概率：局部错误发现率五个估计量的比较研究。

BMC Bioinformatics. 2013 Mar 6;14:87. doi: 10.1186/1471-2105-14-87.

Identification of protein features encoded by alternative exons using Exon Ontology.使用外显子本体论鉴定由可变外显子编码的蛋白质特征。

Genome Res. 2017 Jun;27(6):1087-1097. doi: 10.1101/gr.212696.116. Epub 2017 Apr 18.

Gene Ontology Enrichment Improves Performances of Functional Similarity of Genes.基因本体论富集提高了基因功能相似性的性能。

Sci Rep. 2018 Aug 14;8(1):12100. doi: 10.1038/s41598-018-30455-0.

Novel comparison of evaluation metrics for gene ontology classifiers reveals drastic performance differences.新型基因本体论分类器评估指标比较揭示出显著的性能差异。

PLoS Comput Biol. 2019 Nov 4;15(11):e1007419. doi: 10.1371/journal.pcbi.1007419. eCollection 2019 Nov.

Large-scale inference of gene function through phylogenetic annotation of Gene Ontology terms: case study of the apoptosis and autophagy cellular processes.通过基因本体术语的系统发育注释对基因功能进行大规模推断：细胞凋亡和自噬细胞过程的案例研究

Database (Oxford). 2016 Dec 26;2016. doi: 10.1093/database/baw155. Print 2016.

A Gene Ontology Tutorial in Python.Python 中的基因本体论教程。

Methods Mol Biol. 2017;1446:221-229. doi: 10.1007/978-1-4939-3743-1_16.

Gene Ontology: Pitfalls, Biases, and Remedies.基因本体论：陷阱、偏差与补救措施

Methods Mol Biol. 2017;1446:189-205. doi: 10.1007/978-1-4939-3743-1_14.

引用本文的文献

A Critical Evaluation of Background Gene Omission in Imaging Transcriptomics.成像转录组学中背景基因遗漏的批判性评估

Biol Psychiatry Glob Open Sci. 2025 Jul 18;5(6):100568. doi: 10.1016/j.bpsgos.2025.100568. eCollection 2025 Nov.

Gene transcription, neurotransmitter, and neurocognition signatures of brain structural-functional coupling variability.脑结构-功能耦合变异性的基因转录、神经递质和神经认知特征

Nat Commun. 2025 Aug 15;16(1):7623. doi: 10.1038/s41467-025-63000-5.

Network spreading and local biological vulnerability in amyotrophic lateral sclerosis.肌萎缩侧索硬化症中的网络传播与局部生物易损性

Commun Biol. 2025 Aug 4;8(1):1153. doi: 10.1038/s42003-025-08561-3.

Mapping cerebral blood perfusion and its links to multi-scale brain organization across the human lifespan.绘制全人类生命周期内的脑血流灌注及其与多尺度脑组织的联系。

PLoS Biol. 2025 Jul 29;23(7):e3003277. doi: 10.1371/journal.pbio.3003277. eCollection 2025 Jul.

Genetic foundations of interindividual neurophysiological variability.个体间神经生理变异性的遗传基础。

Sci Adv. 2025 Jul 25;11(30):eads7544. doi: 10.1126/sciadv.ads7544. Epub 2025 Jul 23.

Transcriptomic decoding of surface-based imaging phenotypes and its application to pharmacotranscriptomics.基于表面成像表型的转录组学解码及其在药物转录组学中的应用。

Nat Commun. 2025 Jul 22;16(1):6727. doi: 10.1038/s41467-025-61927-3.

Neuroimaging and biological markers of different paretic hand outcomes after stroke.中风后不同偏瘫手功能结局的神经影像学和生物学标志物

J Neuroeng Rehabil. 2025 Jul 5;22(1):150. doi: 10.1186/s12984-025-01682-0.

Resting-State Brain Activity Changes and Their Genetic Correlates in Mild Traumatic Brain Injury.轻度创伤性脑损伤静息态脑活动变化及其遗传相关性

Hum Brain Mapp. 2025 Jun 1;46(8):e70259. doi: 10.1002/hbm.70259.

Neurogenetic phenotypes of learning-dependent plasticity for improved perceptual decisions.用于改善感知决策的学习依赖性可塑性的神经遗传表型。

Commun Biol. 2025 May 21;8(1):779. doi: 10.1038/s42003-025-08212-7.

Molecular mechanisms explaining sex-specific functional connectivity changes in chronic insomnia disorder.解释慢性失眠障碍中性别特异性功能连接变化的分子机制。

BMC Med. 2025 May 6;23(1):261. doi: 10.1186/s12916-025-04089-9.

本文引用的文献

Cellular correlates of cortical thinning throughout the lifespan.全生命周期大脑皮层变薄的细胞相关性。

Sci Rep. 2020 Dec 11;10(1):21803. doi: 10.1038/s41598-020-78471-3.

Benchmarking of cell type deconvolution pipelines for transcriptomics data.基于转录组数据的细胞类型去卷积分析流水线的基准测试

Nat Commun. 2020 Nov 6;11(1):5650. doi: 10.1038/s41467-020-19015-1.

Transcriptomic and cellular decoding of regional brain vulnerability to neurogenetic disorders.转录组学和细胞水平揭示区域大脑对神经遗传疾病的易损性

Nat Commun. 2020 Jul 3;11(1):3358. doi: 10.1038/s41467-020-17051-5.

Generative modeling of brain maps with spatial autocorrelation.基于空间自相关的脑图谱生成模型。

Neuroimage. 2020 Oct 15;220:117038. doi: 10.1016/j.neuroimage.2020.117038. Epub 2020 Jun 22.

Transcriptional and imaging-genetic association of cortical interneurons, brain function, and schizophrenia risk.皮质中间神经元的转录和影像遗传学关联、大脑功能与精神分裂症风险。

Nat Commun. 2020 Jun 8;11(1):2889. doi: 10.1038/s41467-020-16710-x.

Common neural and transcriptional correlates of inhibitory control underlie emotion regulation and memory control.抑制控制所共有的神经和转录相关性为情绪调节和记忆控制提供了基础。

Soc Cogn Affect Neurosci. 2020 Jul 1;15(5):523-536. doi: 10.1093/scan/nsaa073.

Statistical analysis of spatial expression patterns for spatially resolved transcriptomic studies.空间分辨转录组学研究中空间表达模式的统计分析。

Nat Methods. 2020 Feb;17(2):193-200. doi: 10.1038/s41592-019-0701-7. Epub 2020 Jan 27.

BAGSE: a Bayesian hierarchical model approach for gene set enrichment analysis.BAGSE：一种用于基因集富集分析的贝叶斯分层模型方法。

Bioinformatics. 2020 Mar 1;36(6):1689-1695. doi: 10.1093/bioinformatics/btz831.

Discovering Conserved Properties of Brain Organization Through Multimodal Integration and Interspecies Comparison.通过多模态整合和种间比较发现大脑组织的保守特性。

J Exp Neurosci. 2019 Jul 9;13:1179069519862047. doi: 10.1177/1179069519862047. eCollection 2019.

Effective degrees of freedom of the Pearson's correlation coefficient under autocorrelation.自相关下皮尔逊相关系数的有效自由度。

Neuroimage. 2019 Oct 1;199:609-625. doi: 10.1016/j.neuroimage.2019.05.011. Epub 2019 May 31.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

克服空间分辨转录组脑图谱数据分析中基因类别富集的假阳性。

Overcoming false-positive gene-category enrichment in the analysis of spatially resolved transcriptomic brain atlas data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献