• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过结合基因型和全血转录组数据构建优化的组织特异性基因表达推断预测模型。

Building an optimal predictive model for imputing tissue-specific gene expression by combining genotype and whole-blood transcriptome data.

机构信息

Interdisciplinary Program in Bioengineering, Seoul National University, Seoul, Republic of Korea.

Department of Biostatistics, Columbia University, New York, NY, USA.

出版信息

HGG Adv. 2023 Jul 11;4(4):100223. doi: 10.1016/j.xhgg.2023.100223. eCollection 2023 Oct 12.

DOI:10.1016/j.xhgg.2023.100223
Abstract

Accurate imputation of tissue-specific gene expression can be a powerful tool for understanding the biological mechanisms underlying human complex traits. Existing imputation methods can be grouped into two categories according to the types of predictors used. The first category uses genotype data, while the second category uses whole-blood expression data. Both data types can be easily collected from blood, avoiding invasive tissue biopsies. In this study, we attempted to build an optimal predictive model for imputing tissue-specific gene expression by combining the genotype and whole-blood expression data. We first evaluated the imputation performance of each standalone model (using genotype data [GEN model] and using whole-blood expression data [WBE model]) using their respective data types across 47 human tissues. The WBE model outperformed the GEN model in most tissues by a large gain. Then, we developed several combined models that leverage both types of predictors to further improve imputation performance. We tried various strategies, including utilizing a merged dataset of the two data types (MERGED models) and integrating the imputation outcomes of the two standalone models (inverse variance-weighted [IVW] models). We found that one of the MERGED models noticeably outperformed the standalone models. This model involved a fixed ratio between the two regularization penalty factors for the two predictor types so that the contribution of the whole-blood transcriptome is upweighted compared with the genotype. Our study suggests that one can improve the imputation of tissue-specific gene expression by combining the genotype and whole-blood expression, but the improvement can be largely dependent on the combination strategy chosen.

摘要

准确推断组织特异性基因表达可以成为理解人类复杂性状背后生物学机制的有力工具。现有的推断方法可以根据所使用的预测因子类型分为两类。第一类使用基因型数据,而第二类使用全血表达数据。这两种数据类型都可以很容易地从血液中收集,避免了侵入性的组织活检。在这项研究中,我们试图通过结合基因型和全血表达数据来构建一个最佳的预测模型来推断组织特异性基因表达。我们首先使用各自的数据类型评估了每个独立模型(使用基因型数据[GEN 模型]和使用全血表达数据[WBE 模型])在 47 个人类组织中的推断性能。在大多数组织中,WBE 模型的表现明显优于 GEN 模型,增益很大。然后,我们开发了几种结合两种预测因子的组合模型,以进一步提高推断性能。我们尝试了各种策略,包括利用两种数据类型的合并数据集(MERGED 模型)和整合两个独立模型的推断结果(逆方差加权[IVW]模型)。我们发现,MERGED 模型之一的表现明显优于独立模型。该模型涉及两种预测因子的两个正则化惩罚因子之间的固定比例,从而与基因型相比,全血转录组的贡献得到了增强。我们的研究表明,通过结合基因型和全血表达,可以提高组织特异性基因表达的推断,但改进效果在很大程度上取决于所选择的组合策略。

相似文献

1
Building an optimal predictive model for imputing tissue-specific gene expression by combining genotype and whole-blood transcriptome data.通过结合基因型和全血转录组数据构建优化的组织特异性基因表达推断预测模型。
HGG Adv. 2023 Jul 11;4(4):100223. doi: 10.1016/j.xhgg.2023.100223. eCollection 2023 Oct 12.
2
Meta-imputation of transcriptome from genotypes across multiple datasets by leveraging publicly available summary-level data.利用公开的汇总水平数据,通过跨多个数据集的基因型进行转录组元推断。
PLoS Genet. 2022 Jan 31;18(1):e1009571. doi: 10.1371/journal.pgen.1009571. eCollection 2022 Jan.
3
Imputing Gene Expression in Uncollected Tissues Within and Beyond GTEx.推断GTEx内部和外部未采集组织中的基因表达。
Am J Hum Genet. 2016 Apr 7;98(4):697-708. doi: 10.1016/j.ajhg.2016.02.020. Epub 2016 Mar 31.
4
Influence of tissue context on gene prioritization for predicted transcriptome-wide association studies.组织背景对预测的全转录组关联研究中基因优先级排序的影响。
Pac Symp Biocomput. 2019;24:296-307.
5
Using Collaborative Mixed Models to Account for Imputation Uncertainty in Transcriptome-Wide Association Studies.使用协作混合模型来解释转录组关联研究中插补不确定性。
Methods Mol Biol. 2021;2212:93-103. doi: 10.1007/978-1-0716-0947-7_7.
6
SUMMIT-FA: a new resource for improved transcriptome imputation using functional annotations.SUMMIT-FA:利用功能注释提高转录本推断的新资源。
Hum Mol Genet. 2024 Mar 20;33(7):624-635. doi: 10.1093/hmg/ddad205.
7
Power analysis of transcriptome-wide association study: Implications for practical protocol choice.全转录组关联研究的功效分析:对实际方案选择的启示。
PLoS Genet. 2021 Feb 26;17(2):e1009405. doi: 10.1371/journal.pgen.1009405. eCollection 2021 Feb.
8
Integrative approaches for large-scale transcriptome-wide association studies.大规模全转录组关联研究的综合方法
Nat Genet. 2016 Mar;48(3):245-52. doi: 10.1038/ng.3506. Epub 2016 Feb 8.
9
MOSTWAS: Multi-Omic Strategies for Transcriptome-Wide Association Studies.多组学策略在全转录组关联研究中的应用
PLoS Genet. 2021 Mar 8;17(3):e1009398. doi: 10.1371/journal.pgen.1009398. eCollection 2021 Mar.
10
How powerful are summary-based methods for identifying expression-trait associations under different genetic architectures?基于汇总数据的方法在不同遗传结构下识别表达性状关联的能力有多强?
Pac Symp Biocomput. 2018;23:228-239.

本文引用的文献

1
TIGAR-V2: Efficient TWAS tool with nonparametric Bayesian eQTL weights of 49 tissue types from GTEx V8.TIGAR-V2:一种高效的全转录组关联研究工具,具有来自GTEx V8的49种组织类型的非参数贝叶斯表达定量性状基因座权重。
HGG Adv. 2021 Nov 4;3(1):100068. doi: 10.1016/j.xhgg.2021.100068. eCollection 2022 Jan 13.
2
Predicting tissue-specific gene expression from whole blood transcriptome.从全血转录组预测组织特异性基因表达。
Sci Adv. 2021 Apr 2;7(14). doi: 10.1126/sciadv.abd6991. Print 2021 Apr.
3
Current Achievements and Applications of Transcriptomics in Personalized Cancer Medicine.
转录组学在个性化癌症医学中的当前成就和应用。
Int J Mol Sci. 2021 Jan 31;22(3):1422. doi: 10.3390/ijms22031422.
4
Bayesian Genome-wide TWAS Method to Leverage both cis- and trans-eQTL Information through Summary Statistics.贝叶斯全基因组 TWAS 方法,通过汇总统计数据利用 cis- 和 trans-eQTL 信息。
Am J Hum Genet. 2020 Oct 1;107(4):714-726. doi: 10.1016/j.ajhg.2020.08.022. Epub 2020 Sep 21.
5
Blood-based multi-tissue gene expression inference with Bayesian ridge regression.基于贝叶斯岭回归的血液多组织基因表达推断。
Bioinformatics. 2020 Jun 1;36(12):3788-3794. doi: 10.1093/bioinformatics/btaa239.
6
Integrating predicted transcriptome from multiple tissues improves association detection.整合来自多个组织的预测转录组可提高关联检测。
PLoS Genet. 2019 Jan 22;15(1):e1007889. doi: 10.1371/journal.pgen.1007889. eCollection 2019 Jan.
7
Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics.从 GWAS 汇总统计数据推断组织特异性基因表达变异的表型后果。
Nat Commun. 2018 May 8;9(1):1825. doi: 10.1038/s41467-018-03621-1.
8
A Scalable Bayesian Method for Integrating Functional Information in Genome-wide Association Studies.一种用于全基因组关联研究中整合功能信息的可扩展贝叶斯方法。
Am J Hum Genet. 2017 Sep 7;101(3):404-416. doi: 10.1016/j.ajhg.2017.08.002. Epub 2017 Aug 24.
9
Comparison of Two Meta-Analysis Methods: Inverse-Variance-Weighted Average and Weighted Sum of Z-Scores.两种Meta分析方法的比较:逆方差加权平均数与Z分数加权和
Genomics Inform. 2016 Dec;14(4):173-180. doi: 10.5808/GI.2016.14.4.173. Epub 2016 Dec 30.
10
Gene-specific patterns of expression variation across organs and species.跨器官和物种的基因特异性表达变异模式。
Genome Biol. 2016 Jul 8;17(1):151. doi: 10.1186/s13059-016-1008-y.