Suppr超能文献

通过成分测量误差回归建模纳入量化不确定性的差异转录本使用分析。

Differential transcript usage analysis incorporating quantification uncertainty via compositional measurement error regression modeling.

作者信息

Young Amber M, Van Buren Scott, Rashid Naim U

机构信息

Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC, 27599, USA.

Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC, 27599, USA and Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, 450 West Drive, Chapel Hill, NC, 27599, USA.

出版信息

Biostatistics. 2024 Apr 15;25(2):559-576. doi: 10.1093/biostatistics/kxad008.

Abstract

Differential transcript usage (DTU) occurs when the relative expression of multiple transcripts arising from the same gene changes between different conditions. Existing approaches to detect DTU often rely on computational procedures that can have speed and scalability issues as the number of samples increases. Here we propose a new method, CompDTU, that uses compositional regression to model the relative abundance proportions of each transcript that are of interest in DTU analyses. This procedure leverages fast matrix-based computations that make it ideally suited for DTU analysis with larger sample sizes. This method also allows for the testing of and adjustment for multiple categorical or continuous covariates. Additionally, many existing approaches for DTU ignore quantification uncertainty in the expression estimates for each transcript in RNA-seq data. We extend our CompDTU method to incorporate quantification uncertainty leveraging common output from RNA-seq expression quantification tool in a novel method CompDTUme. Through several power analyses, we show that CompDTU has excellent sensitivity and reduces false positive results relative to existing methods. Additionally, CompDTUme results in further improvements in performance over CompDTU with sufficient sample size for genes with high levels of quantification uncertainty, while also maintaining favorable speed and scalability. We motivate our methods using data from the Cancer Genome Atlas Breast Invasive Carcinoma data set, specifically using RNA-seq data from primary tumors for 740 patients with breast cancer. We show greatly reduced computation time from our new methods as well as the ability to detect several novel genes with significant DTU across different breast cancer subtypes.

摘要

当同一基因产生的多个转录本的相对表达在不同条件下发生变化时,就会出现差异转录本使用情况(DTU)。现有的检测DTU的方法通常依赖于计算程序,随着样本数量的增加,这些程序可能会出现速度和可扩展性问题。在这里,我们提出了一种新方法CompDTU,它使用成分回归来对DTU分析中感兴趣的每个转录本的相对丰度比例进行建模。该程序利用基于矩阵的快速计算,使其非常适合用于更大样本量的DTU分析。该方法还允许对多个分类或连续协变量进行检验和调整。此外,许多现有的DTU方法忽略了RNA测序数据中每个转录本表达估计的量化不确定性。我们扩展了我们的CompDTU方法,以利用RNA测序表达量化工具的常见输出,在一种新方法CompDTUme中纳入量化不确定性。通过几次功效分析,我们表明CompDTU具有出色的灵敏度,并且相对于现有方法减少了假阳性结果。此外,对于具有高量化不确定性水平的基因,在样本量足够的情况下,CompDTUme在性能上比CompDTU有进一步的提升,同时还保持了良好的速度和可扩展性。我们使用来自癌症基因组图谱乳腺浸润癌数据集的数据,特别是740例乳腺癌患者原发肿瘤的RNA测序数据,来推动我们的方法。我们展示了我们新方法大大减少的计算时间,以及检测不同乳腺癌亚型中几个具有显著DTU的新基因的能力。

相似文献

引用本文的文献

1
Compression of quantification uncertainty for scRNA-seq counts.单细胞RNA测序计数的量化不确定性压缩
Bioinformatics. 2021 Jul 19;37(12):1699-1707. doi: 10.1093/bioinformatics/btab001.

本文引用的文献

9
The Functional Impact of Alternative Splicing in Cancer.可变剪接在癌症中的功能影响
Cell Rep. 2017 Aug 29;20(9):2215-2226. doi: 10.1016/j.celrep.2017.08.012.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验