clrDV：一种基于偏态正态分布的 RNA-Seq 数据差异变异性检验方法。

clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution.

机构信息

Institute of Mathematical Sciences, Universiti Malaya, Kuala Lumpur, Malaysia.

Universiti Malaya Centre for Data Analytics, Universiti Malaya, Kuala Lumpur, Malaysia.

出版信息

PeerJ. 2023 Sep 29;11:e16126. doi: 10.7717/peerj.16126. eCollection 2023.

DOI:10.7717/peerj.16126

PMID:37790621

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10544356/

Abstract

BACKGROUND

Pathological conditions may result in certain genes having expression variance that differs markedly from that of the control. Finding such genes from gene expression data can provide invaluable candidates for therapeutic intervention. Under the dominant paradigm for modeling RNA-Seq gene counts using the negative binomial model, tests of differential variability are challenging to develop, owing to dependence of the variance on the mean.

METHODS

Here, we describe clrDV, a statistical method for detecting genes that show differential variability between two populations. We present the skew-normal distribution for modeling gene-wise null distribution of centered log-ratio transformation of compositional RNA-seq data.

RESULTS

Simulation results show that clrDV has false discovery rate and probability of Type II error that are on par with or superior to existing methodologies. In addition, its run time is faster than its closest competitors, and remains relatively constant for increasing sample size per group. Analysis of a large neurodegenerative disease RNA-Seq dataset using clrDV successfully recovers multiple gene candidates that have been reported to be associated with Alzheimer's disease.

摘要

背景

病理条件可能导致某些基因的表达方差与对照明显不同。从基因表达数据中找到这些基因，可以为治疗干预提供非常有价值的候选基因。在使用负二项模型对 RNA-Seq 基因计数进行建模的主流范例下，由于方差依赖于均值，因此很难开发出用于检测差异变异性的检验方法。

方法

在这里，我们描述了 clrDV，这是一种用于检测两个群体之间差异变异性的基因的统计方法。我们提出了偏态正态分布，用于对基于中心对数比变换的组成性 RNA-seq 数据的基因进行零分布建模。

结果

模拟结果表明，clrDV 的假发现率和第二类错误的概率与现有方法相当或优于现有方法。此外，它的运行时间比最接近的竞争对手快，并且随着每组样本量的增加而保持相对稳定。使用 clrDV 对大型神经退行性疾病 RNA-Seq 数据集进行分析，成功地恢复了多个已报道与阿尔茨海默病相关的基因候选物。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7a0/10544356/fc864e8b861c/peerj-11-16126-g001.jpg

相似文献

clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution.clrDV：一种基于偏态正态分布的 RNA-Seq 数据差异变异性检验方法。

PeerJ. 2023 Sep 29;11:e16126. doi: 10.7717/peerj.16126. eCollection 2023.

SIEVE: One-stop differential expression, variability, and skewness analyses using RNA-Seq data.SIEVE：使用RNA测序数据进行一站式差异表达、变异性和偏度分析。

bioRxiv. 2024 Apr 13:2024.04.09.588804. doi: 10.1101/2024.04.09.588804.

Benchmarking differential expression analysis tools for RNA-Seq: normalization-based vs. log-ratio transformation-based methods.RNA-Seq 差异表达分析工具的基准测试：基于标准化与基于对数比变换的方法。

BMC Bioinformatics. 2018 Jul 18;19(1):274. doi: 10.1186/s12859-018-2261-8.

deGPS is a powerful tool for detecting differential expression in RNA-sequencing studies.deGPS是一种用于在RNA测序研究中检测差异表达的强大工具。

BMC Genomics. 2015 Jun 13;16(1):455. doi: 10.1186/s12864-015-1676-0.

LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data.LFCseq：一种用于RNA测序数据差异表达分析的非参数方法。

BMC Genomics. 2014;15 Suppl 10(Suppl 10):S7. doi: 10.1186/1471-2164-15-S10-S7. Epub 2014 Dec 12.

An evaluation of RNA-seq differential analysis methods.RNA-seq 差异分析方法评估。

PLoS One. 2022 Sep 16;17(9):e0264246. doi: 10.1371/journal.pone.0264246. eCollection 2022.

Sample size calculations for the differential expression analysis of RNA-seq data using a negative binomial regression model.使用负二项回归模型对RNA测序数据进行差异表达分析的样本量计算。

Stat Appl Genet Mol Biol. 2019 Jan 22;18(1):/j/sagmb.2019.18.issue-1/sagmb-2018-0021/sagmb-2018-0021.xml. doi: 10.1515/sagmb-2018-0021.

Benchmarking RNA-seq differential expression analysis methods using spike-in and simulation data.使用 Spike-in 和模拟数据进行 RNA-seq 差异表达分析方法的基准测试。

PLoS One. 2020 Apr 30;15(4):e0232271. doi: 10.1371/journal.pone.0232271. eCollection 2020.

A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data.用于RNA测序数据差异表达分析的每个样本全局缩放和每个基因归一化方法的比较。

PLoS One. 2017 May 1;12(5):e0176185. doi: 10.1371/journal.pone.0176185. eCollection 2017.

Statistical models for RNA-seq data derived from a two-condition 48-replicate experiment.源自双条件48次重复实验的RNA测序数据的统计模型。

Bioinformatics. 2015 Nov 15;31(22):3625-30. doi: 10.1093/bioinformatics/btv425. Epub 2015 Jul 23.

本文引用的文献

Disentangling the Relationship Between Chronic Kidney Disease and Cognitive Disorders.理清慢性肾病与认知障碍之间的关系。

Front Neurol. 2022 Feb 25;13:830064. doi: 10.3389/fneur.2022.830064. eCollection 2022.

Identification of differentially distributed gene expression and distinct sets of cancer-related genes identified by changes in mean and variability.通过均值和变异性变化鉴定差异分布的基因表达以及不同的癌症相关基因集。

NAR Genom Bioinform. 2022 Jan 14;4(1):lqab124. doi: 10.1093/nargab/lqab124. eCollection 2022 Mar.

Alzheimer's disease and progressive supranuclear palsy share similar transcriptomic changes in distinct brain regions.阿尔茨海默病和进行性核上性麻痹在不同脑区具有相似的转录组变化。

J Clin Invest. 2022 Jan 18;132(2). doi: 10.1172/JCI149904.

Fibrillar amyloid peptides promote platelet aggregation through the coordinated action of ITAM- and ROS-dependent pathways.纤维状淀粉样肽通过 ITAM 和 ROS 依赖途径的协调作用促进血小板聚集。

J Thromb Haemost. 2020 Nov;18(11):3029-3042. doi: 10.1111/jth.15055. Epub 2020 Sep 13.

Reduced proteasome activity in the aging brain results in ribosome stoichiometry loss and aggregation.衰老大脑中蛋白酶体活性的降低导致核糖体计量失配和聚集。

Mol Syst Biol. 2020 Jun;16(6):e9596. doi: 10.15252/msb.20209596.

A field guide for the compositional analysis of any-omics data.任何组学数据的组成分析指南。

Gigascience. 2019 Sep 1;8(9). doi: 10.1093/gigascience/giz107.

RNA sequencing: the teenage years.RNA 测序：青少年时期。

Nat Rev Genet. 2019 Nov;20(11):631-656. doi: 10.1038/s41576-019-0150-2. Epub 2019 Jul 24.

Gene expression variability: the other dimension in transcriptome analysis.基因表达变异性：转录组分析的另一个维度。

Physiol Genomics. 2019 May 1;51(5):145-158. doi: 10.1152/physiolgenomics.00128.2018. Epub 2019 Mar 15.

Microglial Progranulin: Involvement in Alzheimer's Disease and Neurodegenerative Diseases.小胶质细胞颗粒蛋白：在阿尔茨海默病和神经退行性疾病中的作用。

Cells. 2019 Mar 11;8(3):230. doi: 10.3390/cells8030230.

A Novel Perspective Linkage Between Kidney Function and Alzheimer's Disease.肾功能与阿尔茨海默病之间的一种新视角联系

Front Cell Neurosci. 2018 Oct 29;12:384. doi: 10.3389/fncel.2018.00384. eCollection 2018.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

clrDV：一种基于偏态正态分布的 RNA-Seq 数据差异变异性检验方法。

clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

背景

方法

结果

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献