• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

选择调整推断:置信区间 cis-eQTL 效应大小的应用。

Selection-adjusted inference: an application to confidence intervals for cis-eQTL effect sizes.

机构信息

Department of Statistics, University of Michigan, 451 West Hall, 1085 South University, Ann Arbor, MI, USA.

Department of Electrical Engineering, Stanford University, 350 Serra Mall, Stanford, CA, USA.

出版信息

Biostatistics. 2021 Jan 28;22(1):181-197. doi: 10.1093/biostatistics/kxz024.

DOI:10.1093/biostatistics/kxz024
PMID:31301173
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7846186/
Abstract

The goal of expression quantitative trait loci (eQTL) studies is to identify the genetic variants that influence the expression levels of the genes in an organism. High throughput technology has made such studies possible: in a given tissue sample, it enables us to quantify the expression levels of approximately 20 000 genes and to record the alleles present at millions of genetic polymorphisms. While obtaining this data is relatively cheap once a specimen is at hand, obtaining human tissue remains a costly endeavor: eQTL studies continue to be based on relatively small sample sizes, with this limitation particularly serious for tissues as brain, liver, etc.-often the organs of most immediate medical relevance. Given the high-dimensional nature of these datasets and the large number of hypotheses tested, the scientific community has adopted early on multiplicity adjustment procedures. These testing procedures primarily control the false discoveries rate for the identification of genetic variants with influence on the expression levels. In contrast, a problem that has not received much attention to date is that of providing estimates of the effect sizes associated with these variants, in a way that accounts for the considerable amount of selection. Yet, given the difficulty of procuring additional samples, this challenge is of practical importance. We illustrate in this work how the recently developed conditional inference approach can be deployed to obtain confidence intervals for the eQTL effect sizes with reliable coverage. The procedure we propose is based on a randomized hierarchical strategy with a 2-fold contribution: (1) it reflects the selection steps typically adopted in state of the art investigations and (2) it introduces the use of randomness instead of data-splitting to maximize the use of available data. Analysis of the GTEx Liver dataset (v6) suggests that naively obtained confidence intervals would likely not cover the true values of effect sizes and that the number of local genetic polymorphisms influencing the expression level of genes might be underestimated.

摘要

表达数量性状基因座 (eQTL) 研究的目的是确定影响生物体中基因表达水平的遗传变异。高通量技术使得此类研究成为可能:在给定的组织样本中,它使我们能够量化大约 20000 个基因的表达水平,并记录数百万个遗传多态性位点的等位基因。虽然一旦有了样本,获取这些数据相对来说比较便宜,但获取人体组织仍然是一项昂贵的工作:eQTL 研究仍然基于相对较小的样本量,对于大脑、肝脏等组织,这一限制尤其严重,因为这些组织通常与最直接的医学相关性。考虑到这些数据集的高维性质和测试的假设数量众多,科学界很早就采用了多重调整程序。这些测试程序主要控制假发现率,以识别对基因表达水平有影响的遗传变异。相比之下,到目前为止,一个尚未受到太多关注的问题是,如何以一种考虑到大量选择的方式,提供与这些变异相关的效应大小的估计值。然而,由于获取额外样本的困难,这一挑战具有实际意义。我们在这项工作中说明了如何利用最近开发的条件推断方法来获得具有可靠覆盖范围的 eQTL 效应大小的置信区间。我们提出的程序基于随机分层策略,具有两个贡献:(1)它反映了当前最先进研究中通常采用的选择步骤;(2)它引入了随机性的使用,而不是数据分割,以最大化可用数据的使用。对 GTEx 肝脏数据集(v6)的分析表明,天真地获得的置信区间可能不会覆盖效应大小的真实值,并且影响基因表达水平的局部遗传多态性数量可能被低估。

相似文献

1
Selection-adjusted inference: an application to confidence intervals for cis-eQTL effect sizes.选择调整推断:置信区间 cis-eQTL 效应大小的应用。
Biostatistics. 2021 Jan 28;22(1):181-197. doi: 10.1093/biostatistics/kxz024.
2
Power, false discovery rate and Winner's Curse in eQTL studies.基因表达数量性状位点(eQTL)研究中的权力、假发现率和赢家诅咒。
Nucleic Acids Res. 2018 Dec 14;46(22):e133. doi: 10.1093/nar/gky780.
3
HT-eQTL: integrative expression quantitative trait loci analysis in a large number of human tissues.HT-eQTL:大量人类组织中的综合表达数量性状基因座分析。
BMC Bioinformatics. 2018 Mar 9;19(1):95. doi: 10.1186/s12859-018-2088-3.
4
Conditional eQTL analysis reveals allelic heterogeneity of gene expression.条件性表达数量性状基因座分析揭示了基因表达的等位基因异质性。
Hum Mol Genet. 2017 Apr 15;26(8):1444-1451. doi: 10.1093/hmg/ddx043.
5
Comparing allele specific expression and local expression quantitative trait loci and the influence of gene expression on complex trait variation in cattle.比较等位基因特异性表达和局部表达数量性状基因座,以及基因表达对牛复杂性状变异的影响。
BMC Genomics. 2018 Nov 3;19(1):793. doi: 10.1186/s12864-018-5181-0.
6
Integrative eQTL-weighted hierarchical Cox models for SNP-set based time-to-event association studies.基于 SNP 集的整合 eQTL 加权分层 Cox 模型用于时间事件关联研究。
J Transl Med. 2021 Oct 9;19(1):418. doi: 10.1186/s12967-021-03090-z.
7
Accurate and fast multiple-testing correction in eQTL studies.在全基因组关联研究中进行准确快速的多重检验校正。
Am J Hum Genet. 2015 Jun 4;96(6):857-68. doi: 10.1016/j.ajhg.2015.04.012. Epub 2015 May 28.
8
A Comprehensive cis-eQTL Analysis Revealed Target Genes in Breast Cancer Susceptibility Loci Identified in Genome-wide Association Studies.一项全面的顺式-eQTL 分析揭示了全基因组关联研究中鉴定的乳腺癌易感性位点的靶基因。
Am J Hum Genet. 2018 May 3;102(5):890-903. doi: 10.1016/j.ajhg.2018.03.016.
9
An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci.一种用于识别广泛影响的表达数量性状基因座的独立成分分析混杂因素校正框架。
PLoS Comput Biol. 2017 May 15;13(5):e1005537. doi: 10.1371/journal.pcbi.1005537. eCollection 2017 May.
10
The use of genome-wide eQTL associations in lymphoblastoid cell lines to identify novel genetic pathways involved in complex traits.利用全基因组 eQTL 关联在淋巴母细胞系中识别复杂性状相关的新遗传途径。
PLoS One. 2011;6(7):e22070. doi: 10.1371/journal.pone.0022070. Epub 2011 Jul 15.

引用本文的文献

1
Integrative Bayesian models using Post-selective inference: A case study in radiogenomics.基于后选择推断的综合贝叶斯模型:放射组学案例研究。
Biometrics. 2023 Sep;79(3):1801-1813. doi: 10.1111/biom.13740. Epub 2022 Aug 31.
2
Estimation of genetic variance contributed by a quantitative trait locus: correcting the bias associated with significance tests.遗传方差由数量性状基因座贡献的估计:校正与显著性检验相关的偏差。
Genetics. 2021 Nov 5;219(3). doi: 10.1093/genetics/iyab115.

本文引用的文献

1
Fast and efficient QTL mapper for thousands of molecular phenotypes.适用于数千种分子表型的快速高效QTL定位器。
Bioinformatics. 2016 May 15;32(10):1479-85. doi: 10.1093/bioinformatics/btv722. Epub 2015 Dec 26.
2
Sparse regression and marginal testing using cluster prototypes.使用聚类原型的稀疏回归和边际检验。
Biostatistics. 2016 Apr;17(2):364-76. doi: 10.1093/biostatistics/kxv049. Epub 2015 Nov 27.
3
The Genotype-Tissue Expression (GTEx) project.基因型-组织表达 (GTEx) 项目。
Nat Genet. 2013 Jun;45(6):580-5. doi: 10.1038/ng.2653.
4
Bias-reduced estimators and confidence intervals for odds ratios in genome-wide association studies.全基因组关联研究中比值比的偏差校正估计量和置信区间
Biostatistics. 2008 Oct;9(4):621-34. doi: 10.1093/biostatistics/kxn001. Epub 2008 Feb 28.
5
Genetics of gene expression surveyed in maize, mouse and man.在玉米、小鼠和人类中对基因表达的遗传学进行的研究。
Nature. 2003 Mar 20;422(6929):297-302. doi: 10.1038/nature01434.
6
'Gene shaving' as a method for identifying distinct sets of genes with similar expression patterns.“基因消减”作为一种识别具有相似表达模式的不同基因集的方法。
Genome Biol. 2000;1(2):RESEARCH0003. doi: 10.1186/gb-2000-1-2-research0003. Epub 2000 Aug 4.