• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

两个负二项分布差异的理论框架及其在测序数据比较分析中的应用。

Theoretical framework for the difference of two negative binomial distributions and its application in comparative analysis of sequencing data.

机构信息

Department of Biological and Biomedical Sciences, Rowan University, Glassboro, New Jersey 08028, USA.

Moorestown High School, Moorestown, New Jersey 08057, USA.

出版信息

Genome Res. 2024 Oct 29;34(10):1636-1650. doi: 10.1101/gr.278843.123.

DOI:10.1101/gr.278843.123
PMID:39406498
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11529838/
Abstract

High-throughput sequencing (HTS) technologies have been instrumental in investigating biological questions at the bulk and single-cell levels. Comparative analysis of two HTS data sets often relies on testing the statistical significance for the difference of two negative binomial distributions (DOTNB). Although negative binomial distributions are well studied, the theoretical results for DOTNB remain largely unexplored. Here, we derive basic analytical results for DOTNB and examine its asymptotic properties. As a state-of-the-art application of DOTNB, we introduce DEGage, a computational method for detecting differentially expressed genes (DEGs) in scRNA-seq data. DEGage calculates the mean of the sample-wise differences of gene expression levels as the test statistic and determines significant differential expression by computing the -value with DOTNB. Extensive validation using simulated and real scRNA-seq data sets demonstrates that DEGage outperforms five popular DEG analysis tools: DEGseq2, DEsingle, edgeR, Monocle3, and scDD. DEGage is robust against high dropout levels and exhibits superior sensitivity when applied to balanced and imbalanced data sets, even with small sample sizes. We utilize DEGage to analyze prostate cancer scRNA-seq data sets and identify marker genes for 17 cell types. Furthermore, we apply DEGage to scRNA-seq data sets of mouse neurons with and without fear memory and reveal eight potential memory-related genes overlooked in previous analyses. The theoretical results and supporting software for DOTNB can be widely applied to comparative analyses of dispersed count data in HTS and broad research questions.

摘要

高通量测序 (HTS) 技术在研究整体和单细胞水平的生物学问题方面发挥了重要作用。比较两个 HTS 数据集通常依赖于测试两个负二项分布(DOTNB)差异的统计显著性。尽管负二项分布已经得到了很好的研究,但 DOTNB 的理论结果在很大程度上仍未得到探索。在这里,我们推导出 DOTNB 的基本分析结果,并研究其渐近性质。作为 DOTNB 的一项先进应用,我们引入了 DEGage,这是一种用于检测 scRNA-seq 数据中差异表达基因 (DEGs) 的计算方法。DEGage 计算基因表达水平的样本间差异的平均值作为检验统计量,并通过计算 DOTNB 的 - 值来确定显著的差异表达。使用模拟和真实的 scRNA-seq 数据集进行广泛验证表明,DEGage 优于五种流行的 DEG 分析工具:DEGseq2、DEsingle、edgeR、Monocle3 和 scDD。DEGage 对高辍学水平具有鲁棒性,并且在应用于平衡和不平衡数据集时表现出优异的灵敏度,即使样本量较小也是如此。我们利用 DEGage 分析前列腺癌 scRNA-seq 数据集,并鉴定出 17 种细胞类型的标记基因。此外,我们将 DEGage 应用于具有和不具有恐惧记忆的小鼠神经元 scRNA-seq 数据集,并揭示了之前分析中忽略的八个潜在的与记忆相关的基因。DOTNB 的理论结果和支持软件可广泛应用于 HTS 中离散计数数据的比较分析和广泛的研究问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4475/11529838/5d276a6abcef/1636f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4475/11529838/e6b97fc98544/1636f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4475/11529838/72531af12001/1636f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4475/11529838/db176a97a963/1636f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4475/11529838/4dfad19d0df4/1636f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4475/11529838/5d276a6abcef/1636f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4475/11529838/e6b97fc98544/1636f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4475/11529838/72531af12001/1636f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4475/11529838/db176a97a963/1636f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4475/11529838/4dfad19d0df4/1636f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4475/11529838/5d276a6abcef/1636f05.jpg

相似文献

1
Theoretical framework for the difference of two negative binomial distributions and its application in comparative analysis of sequencing data.两个负二项分布差异的理论框架及其在测序数据比较分析中的应用。
Genome Res. 2024 Oct 29;34(10):1636-1650. doi: 10.1101/gr.278843.123.
2
Detecting Fear-Memory-Related Genes from Neuronal scRNA-seq Data by Diverse Distributions and Bhattacharyya Distance.通过多种分布和巴氏距离从神经元 scRNA-seq 数据中检测到与恐惧记忆相关的基因。
Biomolecules. 2022 Aug 17;12(8):1130. doi: 10.3390/biom12081130.
3
Detection of high variability in gene expression from single-cell RNA-seq profiling.从单细胞RNA测序分析中检测基因表达的高变异性。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):508. doi: 10.1186/s12864-016-2897-6.
4
Bias, robustness and scalability in single-cell differential expression analysis.单细胞差异表达分析中的偏差、稳健性和可扩展性。
Nat Methods. 2018 Apr;15(4):255-261. doi: 10.1038/nmeth.4612. Epub 2018 Feb 26.
5
An accessible, interactive GenePattern Notebook for analysis and exploration of single-cell transcriptomic data.一个用于分析和探索单细胞转录组数据的可访问的交互式基因模式笔记本。
F1000Res. 2018 Aug 16;7:1306. doi: 10.12688/f1000research.15830.2. eCollection 2018.
6
A flexible count data model to fit the wide diversity of expression profiles arising from extensively replicated RNA-seq experiments.一种灵活的计数数据模型,可适用于广泛复制的 RNA-seq 实验所产生的广泛多样化的表达谱。
BMC Bioinformatics. 2013 Aug 21;14:254. doi: 10.1186/1471-2105-14-254.
7
Data Analysis in Single-Cell Transcriptome Sequencing.单细胞转录组测序中的数据分析
Methods Mol Biol. 2018;1754:311-326. doi: 10.1007/978-1-4939-7717-8_18.
8
DEsingle for detecting three types of differential expression in single-cell RNA-seq data.DEsingle 用于检测单细胞 RNA-seq 数据中的三种差异表达。
Bioinformatics. 2018 Sep 15;34(18):3223-3224. doi: 10.1093/bioinformatics/bty332.
9
Visualization of Single Cell RNA-Seq Data Using t-SNE in R.使用 R 中的 t-SNE 可视化单细胞 RNA-Seq 数据。
Methods Mol Biol. 2020;2117:159-167. doi: 10.1007/978-1-0716-0301-7_8.
10
Single-cell RNA-seq denoising using a deep count autoencoder.基于深度计数自编码器的单细胞 RNA-seq 去噪。
Nat Commun. 2019 Jan 23;10(1):390. doi: 10.1038/s41467-018-07931-2.

引用本文的文献

1
Single-Cell Hi-C Technologies and Computational Data Analysis.单细胞Hi-C技术与计算数据分析
Adv Sci (Weinh). 2025 Mar;12(9):e2412232. doi: 10.1002/advs.202412232. Epub 2025 Jan 30.

本文引用的文献

1
Systematic benchmarking of single-cell ATAC-sequencing protocols.单细胞 ATAC-seq 测序协议的系统基准测试。
Nat Biotechnol. 2024 Jun;42(6):916-926. doi: 10.1038/s41587-023-01881-x. Epub 2023 Aug 3.
2
Methods and applications for single-cell and spatial multi-omics.单细胞和空间多组学的方法和应用。
Nat Rev Genet. 2023 Aug;24(8):494-515. doi: 10.1038/s41576-023-00580-2. Epub 2023 Mar 2.
3
Comprehensive multi-omics single-cell data integration reveals greater heterogeneity in the human immune system.综合多组学单细胞数据整合揭示了人类免疫系统中更大的异质性。
iScience. 2022 Sep 13;25(10):105123. doi: 10.1016/j.isci.2022.105123. eCollection 2022 Oct 21.
4
Detecting Fear-Memory-Related Genes from Neuronal scRNA-seq Data by Diverse Distributions and Bhattacharyya Distance.通过多种分布和巴氏距离从神经元 scRNA-seq 数据中检测到与恐惧记忆相关的基因。
Biomolecules. 2022 Aug 17;12(8):1130. doi: 10.3390/biom12081130.
5
Targeting acetyl-CoA metabolism attenuates the formation of fear memories through reduced activity-dependent histone acetylation.靶向乙酰辅酶 A 代谢通过减少活性依赖性组蛋白乙酰化来减弱恐惧记忆的形成。
Proc Natl Acad Sci U S A. 2022 Aug 9;119(32):e2114758119. doi: 10.1073/pnas.2114758119. Epub 2022 Aug 3.
6
Differential Expression Analysis of Single-Cell RNA-Seq Data: Current Statistical Approaches and Outstanding Challenges.单细胞RNA测序数据的差异表达分析:当前的统计方法与突出挑战
Entropy (Basel). 2022 Jul 18;24(7):995. doi: 10.3390/e24070995.
7
Recommendations of scRNA-seq Differential Gene Expression Analysis Based on Comprehensive Benchmarking.基于全面基准测试的单细胞RNA测序差异基因表达分析建议
Life (Basel). 2022 Jun 7;12(6):850. doi: 10.3390/life12060850.
8
Comprehensive characterization of the prostate tumor microenvironment identifies CXCR4/CXCL12 crosstalk as a novel antiangiogenic therapeutic target in prostate cancer.全面描绘前列腺肿瘤微环境,鉴定出 CXCR4/CXCL12 相互作用是前列腺癌新型抗血管生成治疗靶点。
Mol Cancer. 2022 Jun 18;21(1):132. doi: 10.1186/s12943-022-01597-7.
9
Comparison and evaluation of statistical error models for scRNA-seq.单细胞RNA测序(scRNA-seq)统计误差模型的比较与评估
Genome Biol. 2022 Jan 18;23(1):27. doi: 10.1186/s13059-021-02584-9.
10
PANTHER: Making genome-scale phylogenetics accessible to all.PANTHER:让所有人大开眼界的基因组系统发生学。
Protein Sci. 2022 Jan;31(1):8-22. doi: 10.1002/pro.4218. Epub 2021 Nov 25.