Suppr超能文献

QuASAR:读取的定量等位基因特异性分析。

QuASAR: quantitative allele-specific analysis of reads.

作者信息

Harvey Chris T, Moyerbrailean Gregory A, Davis Gordon O, Wen Xiaoquan, Luca Francesca, Pique-Regi Roger

机构信息

Center for Molecular Medicine and Genetics, Department of Obstetrics and Gynecology, Wayne State University, 540 E Canfield, Scott Hall, Detroit, MI 48201, USA and Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA.

出版信息

Bioinformatics. 2015 Apr 15;31(8):1235-42. doi: 10.1093/bioinformatics/btu802. Epub 2014 Dec 4.

Abstract

MOTIVATION

Expression quantitative trait loci (eQTL) studies have discovered thousands of genetic variants that regulate gene expression, enabling a better understanding of the functional role of non-coding sequences. However, eQTL studies are costly, requiring large sample sizes and genome-wide genotyping of each sample. In contrast, analysis of allele-specific expression (ASE) is becoming a popular approach to detect the effect of genetic variation on gene expression, even within a single individual. This is typically achieved by counting the number of RNA-seq reads matching each allele at heterozygous sites and testing the null hypothesis of a 1:1 allelic ratio. In principle, when genotype information is not readily available, it could be inferred from the RNA-seq reads directly. However, there are currently no existing methods that jointly infer genotypes and conduct ASE inference, while considering uncertainty in the genotype calls.

RESULTS

We present QuASAR, quantitative allele-specific analysis of reads, a novel statistical learning method for jointly detecting heterozygous genotypes and inferring ASE. The proposed ASE inference step takes into consideration the uncertainty in the genotype calls, while including parameters that model base-call errors in sequencing and allelic over-dispersion. We validated our method with experimental data for which high-quality genotypes are available. Results for an additional dataset with multiple replicates at different sequencing depths demonstrate that QuASAR is a powerful tool for ASE analysis when genotypes are not available.

AVAILABILITY AND IMPLEMENTATION

http://github.com/piquelab/QuASAR.

CONTACT

fluca@wayne.edu or rpique@wayne.edu

SUPPLEMENTARY INFORMATION

Supplementary Material is available at Bioinformatics online.

摘要

动机

表达数量性状基因座(eQTL)研究已发现数千个调控基因表达的遗传变异,有助于更好地理解非编码序列的功能作用。然而,eQTL研究成本高昂,需要大样本量以及对每个样本进行全基因组基因分型。相比之下,等位基因特异性表达(ASE)分析正成为一种检测遗传变异对基因表达影响的常用方法,甚至可在单个个体内进行。这通常通过计算杂合位点处与每个等位基因匹配的RNA测序读数数量,并检验1:1等位基因比例的零假设来实现。原则上,当基因型信息不易获取时,可以直接从RNA测序读数中推断出来。然而,目前尚无现有方法能在考虑基因型调用不确定性的同时联合推断基因型并进行ASE推断。

结果

我们提出了QuASAR(reads的定量等位基因特异性分析),这是一种用于联合检测杂合基因型和推断ASE的新型统计学习方法。所提出的ASE推断步骤考虑了基因型调用中的不确定性,同时纳入了对测序中碱基调用错误和等位基因过度离散进行建模的参数。我们使用可获得高质量基因型的实验数据验证了我们的方法。对另一个在不同测序深度有多个重复的数据集的结果表明,当基因型不可用时,QuASAR是进行ASE分析的强大工具。

可用性与实现方式

http://github.com/piquelab/QuASAR。

联系方式

fluca@wayne.edurpique@wayne.edu

补充信息

补充材料可在《生物信息学》在线获取。

相似文献

1
QuASAR: quantitative allele-specific analysis of reads.QuASAR:读取的定量等位基因特异性分析。
Bioinformatics. 2015 Apr 15;31(8):1235-42. doi: 10.1093/bioinformatics/btu802. Epub 2014 Dec 4.

引用本文的文献

9
RNA-seq data science: From raw data to effective interpretation.RNA测序数据科学:从原始数据到有效解读
Front Genet. 2023 Mar 13;14:997383. doi: 10.3389/fgene.2023.997383. eCollection 2023.

本文引用的文献

1
Cross-population joint analysis of eQTLs: fine mapping and functional annotation.全群体eQTL联合分析:精细定位与功能注释。
PLoS Genet. 2015 Apr 23;11(4):e1005176. doi: 10.1371/journal.pgen.1005176. eCollection 2015 Apr.
3
Allelic expression of deleterious protein-coding variants across human tissues.人类组织中有害蛋白质编码变体的等位基因表达
PLoS Genet. 2014 May 1;10(5):e1004304. doi: 10.1371/journal.pgen.1004304. eCollection 2014 May.
7
Reliable identification of genomic variants from RNA-seq data.从 RNA-seq 数据中可靠地识别基因组变异。
Am J Hum Genet. 2013 Oct 3;93(4):641-51. doi: 10.1016/j.ajhg.2013.08.008. Epub 2013 Sep 26.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验