• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用公开的汇总水平数据,通过跨多个数据集的基因型进行转录组元推断。

Meta-imputation of transcriptome from genotypes across multiple datasets by leveraging publicly available summary-level data.

机构信息

Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, United States of America.

出版信息

PLoS Genet. 2022 Jan 31;18(1):e1009571. doi: 10.1371/journal.pgen.1009571. eCollection 2022 Jan.

DOI:10.1371/journal.pgen.1009571
PMID:35100255
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8830793/
Abstract

Transcriptome wide association studies (TWAS) can be used as a powerful method to identify and interpret the underlying biological mechanisms behind GWAS by mapping gene expression levels with phenotypes. In TWAS, gene expression is often imputed from individual-level genotypes of regulatory variants identified from external resources, such as Genotype-Tissue Expression (GTEx) Project. In this setting, a straightforward approach to impute expression levels of a specific tissue is to use the model trained from the same tissue type. When multiple tissues are available for the same subjects, it has been demonstrated that training imputation models from multiple tissue types improves the accuracy because of shared eQTLs between the tissues and increase in effective sample size. However, existing joint-tissue methods require access of genotype and expression data across all tissues. Moreover, they cannot leverage the abundance of various expression datasets across various tissues for non-overlapping individuals. Here, we explore the optimal way to combine imputed levels across training models from multiple tissues and datasets in a flexible manner using summary-level data. Our proposed method (SWAM) combines arbitrary number of transcriptome imputation models to linearly optimize the imputation accuracy given a target tissue. By integrating models across tissues and/or individuals, SWAM can improve the accuracy of transcriptome imputation or to improve power to TWAS while only requiring individual-level data from a single reference cohort. To evaluate the accuracy of SWAM, we combined 49 tissue-specific gene expression imputation models from the GTEx Project as well as from a large eQTL study of Depression Susceptibility Genes and Networks (DGN) Project and tested imputation accuracy in GEUVADIS lymphoblastoid cell lines samples. We also extend our meta-imputation method to meta-TWAS to leverage multiple tissues in TWAS analysis with summary-level statistics. Our results capitalize on the importance of integrating multiple tissues to unravel regulatory impacts of genetic variants on complex traits.

摘要

转录组全基因组关联研究(TWAS)可以通过将基因表达水平与表型相关联,作为一种强大的方法来识别和解释 GWAS 背后的潜在生物学机制。在 TWAS 中,基因表达通常是通过从外部资源(如基因型-组织表达(GTEx)项目)中识别的调控变体的个体水平基因型来推断的。在这种情况下,推断特定组织表达水平的一种直接方法是使用来自同一组织类型的模型进行训练。当同一个个体有多个组织可用时,已经证明从多个组织类型训练推断模型可以提高准确性,因为组织之间存在共享的 eQTL,并且有效样本量增加。然而,现有的联合组织方法需要获得所有组织的基因型和表达数据。此外,它们不能利用各种组织中不同个体的大量表达数据集来进行非重叠个体的推断。在这里,我们探索了一种灵活的方法,通过汇总数据以最优的方式结合来自多个组织和数据集的训练模型的推断水平。我们提出的方法(SWAM)结合了任意数量的转录组推断模型,以在线性方式优化给定目标组织的推断准确性。通过整合跨组织和/或个体的模型,SWAM 可以提高转录组推断的准确性,或者在仅需要单个参考队列的个体水平数据的情况下,提高 TWAS 的功效。为了评估 SWAM 的准确性,我们结合了 GTEx 项目以及抑郁易感性基因和网络(DGN)项目的大型 eQTL 研究中 49 个组织特异性基因表达推断模型,并在 GEUVADIS 淋巴母细胞系样本中测试了推断准确性。我们还将我们的元推断方法扩展到元 TWAS 中,以利用汇总统计信息在 TWAS 分析中结合多个组织。我们的结果利用了整合多个组织的重要性,以揭示遗传变异对复杂性状的调节影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c47/8830793/9b8b18686fab/pgen.1009571.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c47/8830793/7e74d43a0d06/pgen.1009571.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c47/8830793/8cfb28073980/pgen.1009571.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c47/8830793/49169c168c13/pgen.1009571.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c47/8830793/9b8b18686fab/pgen.1009571.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c47/8830793/7e74d43a0d06/pgen.1009571.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c47/8830793/8cfb28073980/pgen.1009571.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c47/8830793/49169c168c13/pgen.1009571.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c47/8830793/9b8b18686fab/pgen.1009571.g004.jpg

相似文献

1
Meta-imputation of transcriptome from genotypes across multiple datasets by leveraging publicly available summary-level data.利用公开的汇总水平数据,通过跨多个数据集的基因型进行转录组元推断。
PLoS Genet. 2022 Jan 31;18(1):e1009571. doi: 10.1371/journal.pgen.1009571. eCollection 2022 Jan.
2
Tissue specific regulation of transcription in endometrium and association with disease.子宫内膜中转录的组织特异性调节及其与疾病的关联。
Hum Reprod. 2020 Feb 29;35(2):377-393. doi: 10.1093/humrep/dez279.
3
Bayesian genome-wide TWAS with reference transcriptomic data of brain and blood tissues identified 141 risk genes for Alzheimer's disease dementia.基于大脑和血液组织参考转录组数据的贝叶斯全基因组 TWAS 鉴定出 141 个阿尔茨海默病痴呆风险基因。
Alzheimers Res Ther. 2024 Jun 1;16(1):120. doi: 10.1186/s13195-024-01488-7.
4
Statistical power of transcriptome-wide association studies.转录组关联研究的统计功效。
Genet Epidemiol. 2022 Dec;46(8):572-588. doi: 10.1002/gepi.22491. Epub 2022 Jun 29.
5
Leveraging expression from multiple tissues using sparse canonical correlation analysis and aggregate tests improves the power of transcriptome-wide association studies.利用稀疏典型相关分析和综合检验从多个组织中获取表达信息,可提高全转录组关联研究的效能。
PLoS Genet. 2021 Apr 8;17(4):e1008973. doi: 10.1371/journal.pgen.1008973. eCollection 2021 Apr.
6
Influence of tissue context on gene prioritization for predicted transcriptome-wide association studies.组织背景对预测的全转录组关联研究中基因优先级排序的影响。
Pac Symp Biocomput. 2019;24:296-307.
7
Building an optimal predictive model for imputing tissue-specific gene expression by combining genotype and whole-blood transcriptome data.通过结合基因型和全血转录组数据构建优化的组织特异性基因表达推断预测模型。
HGG Adv. 2023 Jul 11;4(4):100223. doi: 10.1016/j.xhgg.2023.100223. eCollection 2023 Oct 12.
8
How powerful are summary-based methods for identifying expression-trait associations under different genetic architectures?基于汇总数据的方法在不同遗传结构下识别表达性状关联的能力有多强?
Pac Symp Biocomput. 2018;23:228-239.
9
Imputing Gene Expression in Uncollected Tissues Within and Beyond GTEx.推断GTEx内部和外部未采集组织中的基因表达。
Am J Hum Genet. 2016 Apr 7;98(4):697-708. doi: 10.1016/j.ajhg.2016.02.020. Epub 2016 Mar 31.
10
TIGAR: An Improved Bayesian Tool for Transcriptomic Data Imputation Enhances Gene Mapping of Complex Traits.TIGAR:一种改进的转录组数据插补贝叶斯工具,可增强复杂性状的基因定位。
Am J Hum Genet. 2019 Aug 1;105(2):258-266. doi: 10.1016/j.ajhg.2019.05.018. Epub 2019 Jun 20.

引用本文的文献

1
Therapeutic Target Discovery for Multiple Myeloma: Identifying Druggable Genes via Mendelian Randomization.多发性骨髓瘤的治疗靶点发现:通过孟德尔随机化确定可成药基因。
Biomedicines. 2025 Apr 5;13(4):885. doi: 10.3390/biomedicines13040885.
2
SR-TWAS: leveraging multiple reference panels to improve transcriptome-wide association study power by ensemble machine learning.SR-TWAS:通过集成机器学习利用多个参考面板提高转录组全基因组关联研究的效力。
Nat Commun. 2024 Aug 5;15(1):6646. doi: 10.1038/s41467-024-50983-w.
3
OTTERS: a powerful TWAS framework leveraging summary-level reference data.

本文引用的文献

1
Transcriptome prediction performance across machine learning models and diverse ancestries.跨机器学习模型和不同血统的转录组预测性能。
HGG Adv. 2021 Apr 8;2(2). doi: 10.1016/j.xhgg.2020.100019. Epub 2021 Jan 5.
2
A unified framework for cross-population trait prediction by leveraging the genetic correlation of polygenic traits.利用多基因性状的遗传相关性进行跨人群性状预测的统一框架。
Am J Hum Genet. 2021 Apr 1;108(4):632-655. doi: 10.1016/j.ajhg.2021.03.002. Epub 2021 Mar 25.
3
Large eQTL meta-analysis reveals differing patterns between cerebral cortical and cerebellar brain regions.
水獭:一个利用汇总级别的参考数据的强大 TWAS 框架。
Nat Commun. 2023 Mar 7;14(1):1271. doi: 10.1038/s41467-023-36862-w.
大规模 eQTL 荟萃分析揭示了大脑皮质和小脑脑区之间不同的模式。
Sci Data. 2020 Oct 12;7(1):340. doi: 10.1038/s41597-020-00642-8.
4
A unified framework for joint-tissue transcriptome-wide association and Mendelian randomization analysis.联合组织转录组全基因组关联和孟德尔随机化分析的统一框架。
Nat Genet. 2020 Nov;52(11):1239-1246. doi: 10.1038/s41588-020-0706-2. Epub 2020 Oct 5.
5
: batch effect adjustment for RNA-seq count data.RNA测序计数数据的批次效应调整
NAR Genom Bioinform. 2020 Sep;2(3):lqaa078. doi: 10.1093/nargab/lqaa078. Epub 2020 Sep 21.
6
PhenomeXcan: Mapping the genome to the phenome through the transcriptome.PhenomeXcan:通过转录组将基因组映射到表型组。
Sci Adv. 2020 Sep 10;6(37). doi: 10.1126/sciadv.aba2083. Print 2020 Sep.
7
The GTEx Consortium atlas of genetic regulatory effects across human tissues.GTEx 联盟人类组织遗传调控效应图谱
Science. 2020 Sep 11;369(6509):1318-1330. doi: 10.1126/science.aaz1776.
8
PTWAS: investigating tissue-relevant causal molecular mechanisms of complex traits using probabilistic TWAS analysis.PTWAS:使用概率性 TWAS 分析研究复杂性状的组织相关因果分子机制。
Genome Biol. 2020 Sep 11;21(1):232. doi: 10.1186/s13059-020-02026-y.
9
A framework for transcriptome-wide association studies in breast cancer in diverse study populations.用于在不同研究人群的乳腺癌中转录组全基因组关联研究的框架。
Genome Biol. 2020 Feb 20;21(1):42. doi: 10.1186/s13059-020-1942-6.
10
A benchmark of batch-effect correction methods for single-cell RNA sequencing data.单细胞 RNA 测序数据批次效应校正方法的基准测试。
Genome Biol. 2020 Jan 16;21(1):12. doi: 10.1186/s13059-019-1850-9.