Department of Statistics, Florida State University, 214 Rogers Building, 117 N. Woodward Avenue, Tallahassee, FL 32306, United States.
Department of Biostatistics, The University of Texas MD Anderson Cancer Center, 7007 Bertner Avenue, Unit 1689, Houston, TX 77030, United States.
Hum Mol Genet. 2024 Mar 20;33(7):624-635. doi: 10.1093/hmg/ddad205.
Transcriptome-wide association studies (TWAS) integrate gene expression prediction models and genome-wide association studies (GWAS) to identify gene-trait associations. The power of TWAS is determined by the sample size of GWAS and the accuracy of the expression prediction model. Here, we present a new method, the Summary-level Unified Method for Modeling Integrated Transcriptome using Functional Annotations (SUMMIT-FA), which improves gene expression prediction accuracy by leveraging functional annotation resources and a large expression quantitative trait loci (eQTL) summary-level dataset. We build gene expression prediction models in whole blood using SUMMIT-FA with the comprehensive functional database MACIE and eQTL summary-level data from the eQTLGen consortium. We apply these models to GWAS for 24 complex traits and show that SUMMIT-FA identifies significantly more gene-trait associations and improves predictive power for identifying "silver standard" genes compared to several benchmark methods. We further conduct a simulation study to demonstrate the effectiveness of SUMMIT-FA.
转录组关联研究(TWAS)将基因表达预测模型和全基因组关联研究(GWAS)整合在一起,以鉴定基因-性状关联。TWAS 的功效取决于 GWAS 的样本量和表达预测模型的准确性。在这里,我们提出了一种新的方法,即使用功能注释进行综合转录组建模的汇总级统一方法(SUMMIT-FA),该方法通过利用功能注释资源和大型表达数量性状基因座(eQTL)汇总数据集来提高基因表达预测准确性。我们使用 SUMMIT-FA 构建了使用全面功能数据库 MACIE 和来自 eQTLGen 联盟的 eQTL 汇总数据集的全血基因表达预测模型。我们将这些模型应用于 24 种复杂性状的 GWAS,并表明 SUMMIT-FA 比几种基准方法鉴定出更多的基因-性状关联,并提高了鉴定“银标准”基因的预测能力。我们还进行了一项模拟研究,以证明 SUMMIT-FA 的有效性。