• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因表达变异性以及使用MDSeq对大规模RNA测序研究进行分析。

Gene expression variability and the analysis of large-scale RNA-seq studies with the MDSeq.

作者信息

Ran Di, Daye Z John

机构信息

Mel and Enid Zuckerman College of Public Health, The University of Arizona, Tucson, AZ 85724, USA.

Independent Researcher, Raleigh, NC 27612, USA.

出版信息

Nucleic Acids Res. 2017 Jul 27;45(13):e127. doi: 10.1093/nar/gkx456.

DOI:10.1093/nar/gkx456
PMID:28535263
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5737414/
Abstract

Rapidly decreasing cost of next-generation sequencing has led to the recent availability of large-scale RNA-seq data, that empowers the analysis of gene expression variability, in addition to gene expression means. In this paper, we present the MDSeq, based on the coefficient of dispersion, to provide robust and computationally efficient analysis of both gene expression means and variability on RNA-seq counts. The MDSeq utilizes a novel reparametrization of the negative binomial to provide flexible generalized linear models (GLMs) on both the mean and dispersion. We address challenges of analyzing large-scale RNA-seq data via several new developments to provide a comprehensive toolset that models technical excess zeros, identifies outliers efficiently, and evaluates differential expressions at biologically interesting levels. We evaluated performances of the MDSeq using simulated data when the ground truths are known. Results suggest that the MDSeq often outperforms current methods for the analysis of gene expression mean and variability. Moreover, the MDSeq is applied in two real RNA-seq studies, in which we identified functionally relevant genes and gene pathways. Specifically, the analysis of gene expression variability with the MDSeq on the GTEx human brain tissue data has identified pathways associated with common neurodegenerative disorders when gene expression means were conserved.

摘要

下一代测序成本的迅速下降使得大规模RNA测序数据近期得以获取,这不仅能分析基因表达均值,还能对基因表达变异性进行分析。在本文中,我们提出了基于离散系数的MDSeq,以对RNA测序计数中的基因表达均值和变异性进行稳健且计算高效的分析。MDSeq利用负二项式的一种新型重新参数化方法,在均值和离散度上提供灵活的广义线性模型(GLM)。我们通过几个新进展应对分析大规模RNA测序数据的挑战,以提供一个全面的工具集,该工具集能对技术上的过多零值进行建模、有效识别异常值,并在生物学上有意义的水平评估差异表达。当已知真实情况时,我们使用模拟数据评估了MDSeq的性能。结果表明,在分析基因表达均值和变异性方面,MDSeq常常优于当前方法。此外,MDSeq应用于两项真实的RNA测序研究,我们在其中鉴定出了功能相关的基因和基因通路。具体而言,在GTEx人类脑组织数据上使用MDSeq分析基因表达变异性时,当基因表达均值保持不变时,已鉴定出与常见神经退行性疾病相关的通路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b0a/5737414/4638f32f65f2/gkx456fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b0a/5737414/88be2bf15fce/gkx456fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b0a/5737414/5c7cca75c193/gkx456fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b0a/5737414/78bb2fae922c/gkx456fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b0a/5737414/180d2fe52194/gkx456fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b0a/5737414/36f49ea985b2/gkx456fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b0a/5737414/4638f32f65f2/gkx456fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b0a/5737414/88be2bf15fce/gkx456fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b0a/5737414/5c7cca75c193/gkx456fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b0a/5737414/78bb2fae922c/gkx456fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b0a/5737414/180d2fe52194/gkx456fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b0a/5737414/36f49ea985b2/gkx456fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b0a/5737414/4638f32f65f2/gkx456fig6.jpg

相似文献

1
Gene expression variability and the analysis of large-scale RNA-seq studies with the MDSeq.基因表达变异性以及使用MDSeq对大规模RNA测序研究进行分析。
Nucleic Acids Res. 2017 Jul 27;45(13):e127. doi: 10.1093/nar/gkx456.
2
RNA-Seq Data Analysis in Galaxy.RNA-Seq 数据分析在 Galaxy 中。
Methods Mol Biol. 2021;2284:367-392. doi: 10.1007/978-1-0716-1307-8_20.
3
Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data.RNA测序数据差异基因表达分析方法的综合评估
Genome Biol. 2013;14(9):R95. doi: 10.1186/gb-2013-14-9-r95.
4
Detection of high variability in gene expression from single-cell RNA-seq profiling.从单细胞RNA测序分析中检测基因表达的高变异性。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):508. doi: 10.1186/s12864-016-2897-6.
5
Classifying next-generation sequencing data using a zero-inflated Poisson model.使用零膨胀泊松模型对下一代测序数据进行分类。
Bioinformatics. 2018 Apr 15;34(8):1329-1335. doi: 10.1093/bioinformatics/btx768.
6
What if we ignore the random effects when analyzing RNA-seq data in a multifactor experiment.在多因素实验中分析RNA测序数据时,如果我们忽略随机效应会怎样?
Stat Appl Genet Mol Biol. 2016 Apr;15(2):87-105. doi: 10.1515/sagmb-2015-0011.
7
DAFS: a data-adaptive flag method for RNA-sequencing data to differentiate genes with low and high expression.DAFS:一种用于 RNA-seq 数据的自适应标记方法,用于区分低表达和高表达基因。
BMC Bioinformatics. 2014 Mar 31;15:92. doi: 10.1186/1471-2105-15-92.
8
Statistical Modeling of High Dimensional Counts.高维计数的统计建模。
Methods Mol Biol. 2021;2284:97-134. doi: 10.1007/978-1-0716-1307-8_7.
9
Single-Cell RNA Sequencing Analysis: A Step-by-Step Overview.单细胞 RNA 测序分析:分步概述。
Methods Mol Biol. 2021;2284:343-365. doi: 10.1007/978-1-0716-1307-8_19.
10
A flexible count data model to fit the wide diversity of expression profiles arising from extensively replicated RNA-seq experiments.一种灵活的计数数据模型,可适用于广泛复制的 RNA-seq 实验所产生的广泛多样化的表达谱。
BMC Bioinformatics. 2013 Aug 21;14:254. doi: 10.1186/1471-2105-14-254.

引用本文的文献

1
Robust and Adaptive Non-Parametric Tests for Detecting General Distributional Shifts in Gene Expression.用于检测基因表达中一般分布变化的稳健自适应非参数检验
bioRxiv. 2025 Mar 11:2025.03.06.641952. doi: 10.1101/2025.03.06.641952.
2
Exploring cell-to-cell variability and functional insights through differentially variable gene analysis.通过差异可变基因分析探索细胞间变异性和功能见解。
NPJ Syst Biol Appl. 2025 Mar 20;11(1):29. doi: 10.1038/s41540-025-00507-z.
3
Luminal epithelial cells integrate variable responses to aging into stereotypical changes that underlie breast cancer susceptibility.

本文引用的文献

1
Systematic characterization of lncRNAs' cell-to-cell expression heterogeneity in glioblastoma cells.胶质母细胞瘤细胞中lncRNAs细胞间表达异质性的系统表征
Oncotarget. 2016 Apr 5;7(14):18403-14. doi: 10.18632/oncotarget.7580.
2
Solar Simulated Ultraviolet Radiation Induces Global Histone Hypoacetylation in Human Keratinocytes.模拟太阳紫外线辐射诱导人角质形成细胞中整体组蛋白低乙酰化
PLoS One. 2016 Feb 26;11(2):e0150175. doi: 10.1371/journal.pone.0150175. eCollection 2016.
3
Assays to Study Consequences of Cytoplasmic Intermediate Filament Mutations: The Case of Epidermal Keratins.
管腔上皮细胞将对衰老的多种反应整合为构成乳腺癌易感性基础的典型变化。
Elife. 2024 Nov 15;13:e95720. doi: 10.7554/eLife.95720.
4
Gene coexpression networks reveal a broad role for lncRNAs in inflammatory bowel disease.基因共表达网络揭示了长链非编码 RNA 在炎症性肠病中的广泛作用。
JCI Insight. 2024 Feb 8;9(3):e168988. doi: 10.1172/jci.insight.168988.
5
Measuring cell-to-cell expression variability in single-cell RNA-sequencing data: a comparative analysis and applications to B cell aging.测量单细胞 RNA 测序数据中的细胞间表达变异性:比较分析及其在 B 细胞衰老中的应用。
Genome Biol. 2023 Oct 20;24(1):238. doi: 10.1186/s13059-023-03036-2.
6
clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution.clrDV:一种基于偏态正态分布的 RNA-Seq 数据差异变异性检验方法。
PeerJ. 2023 Sep 29;11:e16126. doi: 10.7717/peerj.16126. eCollection 2023.
7
Quantifying transcriptome diversity: a review.量化转录组多样性:综述。
Brief Funct Genomics. 2024 Mar 20;23(2):83-94. doi: 10.1093/bfgp/elad019.
8
Modeling group heteroscedasticity in single-cell RNA-seq pseudo-bulk data.对单细胞 RNA-seq 拟时间序列数据中的模型组异方差进行建模。
Genome Biol. 2023 May 5;24(1):107. doi: 10.1186/s13059-023-02949-2.
9
Detection of genes with differential expression dispersion unravels the role of autophagy in cancer progression.检测具有差异表达分散的基因揭示了自噬在癌症进展中的作用。
PLoS Comput Biol. 2023 Mar 9;19(3):e1010342. doi: 10.1371/journal.pcbi.1010342. eCollection 2023 Mar.
10
Captivity induces large and population-dependent brain transcriptomic changes in wild-caught cane toads (Rhinella marina). captivity 诱导野生捕获的甘蔗蟾蜍(Rhinella marina)的大脑转录组发生大规模且依赖于种群的变化。
Mol Ecol. 2022 Oct;31(19):4949-4961. doi: 10.1111/mec.16633. Epub 2022 Aug 4.
研究细胞质中间丝突变后果的实验:以表皮角蛋白为例。
Methods Enzymol. 2016;568:219-53. doi: 10.1016/bs.mie.2015.09.030. Epub 2016 Jan 6.
4
Increased Variability of Genomic Transcription in Schizophrenia.精神分裂症中基因组转录变异性增加
Sci Rep. 2015 Dec 10;5:17995. doi: 10.1038/srep17995.
5
Design and Analysis of Single-Cell Sequencing Experiments.单细胞测序实验的设计与分析。
Cell. 2015 Nov 5;163(4):799-810. doi: 10.1016/j.cell.2015.10.039.
6
Single-cell RNA-seq reveals changes in cell cycle and differentiation programs upon aging of hematopoietic stem cells.单细胞RNA测序揭示了造血干细胞衰老过程中细胞周期和分化程序的变化。
Genome Res. 2015 Dec;25(12):1860-72. doi: 10.1101/gr.192237.115. Epub 2015 Oct 1.
7
Pathogen Cell-to-Cell Variability Drives Heterogeneity in Host Immune Responses.病原体细胞间变异性驱动宿主免疫反应的异质性。
Cell. 2015 Sep 10;162(6):1309-21. doi: 10.1016/j.cell.2015.08.027. Epub 2015 Sep 3.
8
Variability of Gene Expression Identifies Transcriptional Regulators of Early Human Embryonic Development.基因表达的变异性确定了人类早期胚胎发育的转录调节因子。
PLoS Genet. 2015 Aug 19;11(8):e1005428. doi: 10.1371/journal.pgen.1005428. eCollection 2015 Aug.
9
Single-cell mRNA sequencing identifies subclonal heterogeneity in anti-cancer drug responses of lung adenocarcinoma cells.单细胞mRNA测序揭示了肺腺癌细胞抗癌药物反应中的亚克隆异质性。
Genome Biol. 2015 Jun 19;16(1):127. doi: 10.1186/s13059-015-0692-3.
10
Deep sequencing reveals cell-type-specific patterns of single-cell transcriptome variation.深度测序揭示了单细胞转录组变异的细胞类型特异性模式。
Genome Biol. 2015 Jun 9;16(1):122. doi: 10.1186/s13059-015-0683-4.