精细定位和QTL组织共享信息提高了因果基因鉴定的可靠性。

Fine-mapping and QTL tissue-sharing information improves the reliability of causal gene identification.

作者信息

Barbeira Alvaro N, Melia Owen J, Liang Yanyu, Bonazzola Rodrigo, Wang Gao, Wheeler Heather E, Aguet François, Ardlie Kristin G, Wen Xiaoquan, Im Hae K

机构信息

Section of Genetic Medicine, Department of Medicine, The University of Chicago, Chicago, Illinois.

Department of Human Genetics, The University of Chicago, Chicago, Illinois.

出版信息

Genet Epidemiol. 2020 Sep 10;44(8):854-67. doi: 10.1002/gepi.22346.

DOI:10.1002/gepi.22346

PMID:32964524

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7693040/

Abstract

The integration of transcriptomic studies and genome-wide association studies (GWAS) via imputed expression has seen extensive application in recent years, enabling the functional characterization and causal gene prioritization of GWAS loci. However, the techniques for imputing transcriptomic traits from DNA variation remain underdeveloped. Furthermore, associations found when linking eQTL studies to complex traits through methods like PrediXcan can lead to false positives due to linkage disequilibrium between distinct causal variants. Therefore, the best prediction performance models may not necessarily lead to more reliable causal gene discovery. With the goal of improving discoveries without increasing false positives, we develop and compare multiple transcriptomic imputation approaches using the most recent GTEx release of expression and splicing data on 17,382 RNA-sequencing samples from 948 post-mortem donors in 54 tissues. We find that informing prediction models with posterior causal probability from fine-mapping (dap-g) and borrowing information across tissues (mashr) can lead to better performance in terms of number and proportion of significant associations that are colocalized and the proportion of silver standard genes identified as indicated by precision-recall and receiver operating characteristic curves. All prediction models are made publicly available at predictdb.org.

摘要

近年来，通过推测表达将转录组学研究与全基因组关联研究（GWAS）相结合得到了广泛应用，能够对GWAS位点进行功能表征并确定因果基因的优先级。然而，从DNA变异推测转录组特征的技术仍不发达。此外，当通过PrediXcan等方法将eQTL研究与复杂性状联系起来时，由于不同因果变异之间的连锁不平衡，发现的关联可能会导致假阳性。因此，最佳预测性能模型不一定能带来更可靠的因果基因发现。为了在不增加假阳性的情况下改进发现结果，我们使用来自54个组织中948名尸检供体的17382个RNA测序样本的最新GTEx表达和剪接数据版本，开发并比较了多种转录组推测方法。我们发现，利用精细定位（dap - g）的后验因果概率为预测模型提供信息并跨组织借用信息（mashr），在共定位的显著关联的数量和比例以及精确召回率和受试者工作特征曲线所表明的被鉴定为银标准基因的比例方面，可以带来更好的性能。所有预测模型均可在predictdb.org上公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9bb7/7693040/4d4ce364f6c6/GEPI-44-854-g001.jpg

相似文献

Fine-mapping and QTL tissue-sharing information improves the reliability of causal gene identification.精细定位和QTL组织共享信息提高了因果基因鉴定的可靠性。

Genet Epidemiol. 2020 Sep 10;44(8):854-67. doi: 10.1002/gepi.22346.

Estimating colocalization probability from limited summary statistics.从有限的汇总统计数据中估计共定位概率。

BMC Bioinformatics. 2021 May 17;22(1):254. doi: 10.1186/s12859-021-04170-z.

Quantifying the mapping precision of genome-wide association studies using whole-genome sequencing data.利用全基因组测序数据量化全基因组关联研究的定位精度

Genome Biol. 2017 May 16;18(1):86. doi: 10.1186/s13059-017-1216-0.

TIGAR: An Improved Bayesian Tool for Transcriptomic Data Imputation Enhances Gene Mapping of Complex Traits.TIGAR：一种改进的转录组数据插补贝叶斯工具，可增强复杂性状的基因定位。

Am J Hum Genet. 2019 Aug 1;105(2):258-266. doi: 10.1016/j.ajhg.2019.05.018. Epub 2019 Jun 20.

Evaluation of PrediXcan for prioritizing GWAS associations and predicting gene expression.使用PrediXcan对全基因组关联研究（GWAS）关联进行优先级排序并预测基因表达的评估。

Pac Symp Biocomput. 2018;23:448-459.

Accuracy of genome-wide imputation of untyped markers and impacts on statistical power for association studies.未分型标记的全基因组推断准确性及其对关联研究统计效能的影响。

BMC Genet. 2009 Jun 16;10:27. doi: 10.1186/1471-2156-10-27.

Limits on the reproducibility of marker associations with southern leaf blight resistance in the maize nested association mapping population.玉米巢式关联作图群体中与南方叶枯病抗性相关标记关联的可重复性限制

BMC Genomics. 2014 Dec 5;15(1):1068. doi: 10.1186/1471-2164-15-1068.

Integrating eQTL and GWAS data characterises established and identifies novel migraine risk loci.整合 eQTL 和 GWAS 数据可阐明已确立的和识别新的偏头痛风险基因座。

Hum Genet. 2023 Aug;142(8):1113-1137. doi: 10.1007/s00439-023-02568-8. Epub 2023 May 28.

Comparison of two multi-trait association testing methods and sequence-based fine mapping of six additive QTL in Swiss Large White pigs.比较两种多性状关联测试方法和瑞士大白猪六个加性 QTL 的基于序列的精细定位。

BMC Genomics. 2023 Apr 10;24(1):192. doi: 10.1186/s12864-023-09295-4.

Prioritization of causal genes from genome-wide association studies by Bayesian data integration across loci.通过跨基因座的贝叶斯数据整合从全基因组关联研究中确定因果基因的优先级。

PLoS Comput Biol. 2025 Jan 7;21(1):e1012725. doi: 10.1371/journal.pcbi.1012725. eCollection 2025 Jan.

引用本文的文献

A Genome-Wide Association Study of Anti-Müllerian Hormone (AMH) Levels in Samoan Women.萨摩亚女性抗苗勒管激素（AMH）水平的全基因组关联研究。

Genes (Basel). 2025 Jun 30;16(7):793. doi: 10.3390/genes16070793.

Common variation in meiosis genes shapes human recombination phenotypes and aneuploidy risk.减数分裂基因的常见变异塑造了人类重组表型和非整倍体风险。

medRxiv. 2025 Apr 4:2025.04.02.25325097. doi: 10.1101/2025.04.02.25325097.

Genome-wide association meta-analyses of drug-resistant epilepsy.耐药性癫痫的全基因组关联荟萃分析。

EBioMedicine. 2025 May;115:105675. doi: 10.1016/j.ebiom.2025.105675. Epub 2025 Apr 15.

An atlas of single-cell eQTLs dissects autoimmune disease genes and identifies novel drug classes for treatment.单细胞eQTL图谱剖析自身免疫性疾病基因并确定新型治疗药物类别。

Cell Genom. 2025 Apr 9;5(4):100820. doi: 10.1016/j.xgen.2025.100820. Epub 2025 Mar 27.

Unraveling the genetic landscape of susceptibility to multiple primary cancers.解析多种原发性癌症易感性的遗传图谱。

HGG Adv. 2025 Apr 10;6(2):100413. doi: 10.1016/j.xhgg.2025.100413. Epub 2025 Feb 4.

Transferability of Single- and Cross-Tissue Transcriptome Imputation Models Across Ancestry Groups.单组织和跨组织转录组插补模型在不同祖先群体间的可转移性

Genet Epidemiol. 2025 Jan;49(1):e22611. doi: 10.1002/gepi.22611.

Multiomic integration analysis identifies atherogenic metabolites mediating between novel immune genes and cardiovascular risk.多组学整合分析确定了在新型免疫基因与心血管风险之间起作用的动脉粥样硬化代谢物。

Genome Med. 2024 Oct 24;16(1):122. doi: 10.1186/s13073-024-01397-2.

Integrative Multi-Omics Approach for Improving Causal Gene Identification.用于改进因果基因识别的整合多组学方法

Genet Epidemiol. 2025 Jan;49(1):e22601. doi: 10.1002/gepi.22601. Epub 2024 Oct 23.

Association of Genetically Predicted Skipping of COL4A4 Exon 27 with Hematuria and Albuminuria.基因预测的COL4A4外显子27跳跃与血尿和蛋白尿的关联。

J Am Soc Nephrol. 2025 Jan 1;36(1):48-59. doi: 10.1681/ASN.0000000000000480. Epub 2024 Aug 27.

DHFS-ECM: Design of a Dual Heuristic Feature Selection-based Ensemble Classification Model for the Identification of Bamboo Species from Genomic Sequences.DHFS-ECM：基于双重启发式特征选择的集成分类模型设计，用于从基因组序列中识别竹种

Curr Genomics. 2024 May 31;25(3):185-201. doi: 10.2174/0113892029268176240125055419. Epub 2024 Feb 1.

本文引用的文献

A simple new approach to variable selection in regression, with application to genetic fine mapping.一种用于回归中变量选择的简单新方法及其在基因精细定位中的应用。

J R Stat Soc Series B Stat Methodol. 2020 Dec;82(5):1273-1300. doi: 10.1111/rssb.12388. Epub 2020 Jul 10.

The GTEx Consortium atlas of genetic regulatory effects across human tissues.GTEx 联盟人类组织遗传调控效应图谱

Science. 2020 Sep 11;369(6509):1318-1330. doi: 10.1126/science.aaz1776.

Integrative transcriptome imputation reveals tissue-specific and shared biological mechanisms mediating susceptibility to complex traits.整合转录组推断揭示了介导复杂性状易感性的组织特异性和共享生物学机制。

Nat Commun. 2019 Aug 23;10(1):3834. doi: 10.1038/s41467-019-11874-7.

Exome sequencing of Finnish isolates enhances rare-variant association power.芬兰分离株外显子组测序增强罕见变异关联能力。

Nature. 2019 Aug;572(7769):323-328. doi: 10.1038/s41586-019-1457-z. Epub 2019 Jul 31.

Opportunities and challenges for transcriptome-wide association studies.全转录组关联研究的机遇与挑战。

Nat Genet. 2019 Apr;51(4):592-599. doi: 10.1038/s41588-019-0385-z. Epub 2019 Mar 29.

Gene expression imputation across multiple brain regions provides insights into schizophrenia risk.跨多个脑区的基因表达推断为精神分裂症风险提供了线索。

Nat Genet. 2019 Apr;51(4):659-674. doi: 10.1038/s41588-019-0364-4. Epub 2019 Mar 25.

A statistical framework for cross-tissue transcriptome-wide association analysis.跨组织转录组全基因组关联分析的统计框架。

Nat Genet. 2019 Mar;51(3):568-576. doi: 10.1038/s41588-019-0345-7. Epub 2019 Feb 25.

Integrating predicted transcriptome from multiple tissues improves association detection.整合来自多个组织的预测转录组可提高关联检测。

PLoS Genet. 2019 Jan 22;15(1):e1007889. doi: 10.1371/journal.pgen.1007889. eCollection 2019 Jan.

Flexible statistical methods for estimating and testing effects in genomic studies with multiple conditions.具有多种条件的基因组研究中估计和检验效应的灵活统计方法。

Nat Genet. 2019 Jan;51(1):187-195. doi: 10.1038/s41588-018-0268-8. Epub 2018 Nov 26.

OMIM.org: leveraging knowledge across phenotype-gene relationships.OMIM.org：利用表型-基因关系中的知识。

Nucleic Acids Res. 2019 Jan 8;47(D1):D1038-D1043. doi: 10.1093/nar/gky1151.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

精细定位和QTL组织共享信息提高了因果基因鉴定的可靠性。

Fine-mapping and QTL tissue-sharing information improves the reliability of causal gene identification.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献