• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于零膨胀分位数秩分数的检验(ZIQRANK)及其在单细胞RNA测序差异基因表达分析中的应用

ZERO-INFLATED QUANTILE RANK-SCORE BASED TEST (ZIQRANK) WITH APPLICATION TO SCRNA-SEQ DIFFERENTIAL GENE EXPRESSION ANALYSIS.

作者信息

Ling Wodan, Zhang Wenfei, Cheng Bin, Wei Ying

机构信息

Public Health Sciences Division, Fred Hutchinson Cancer Research Center.

Sarepta Therapeutics.

出版信息

Ann Appl Stat. 2021 Dec;15(4):1673-1696. doi: 10.1214/21-aoas1442. Epub 2021 Dec 21.

DOI:10.1214/21-aoas1442
PMID:35116085
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8809477/
Abstract

Differential gene expression analysis based on scRNA-seq data is challenging due to two unique characteristics of scRNA-seq data. First, multimodality and other heterogeneity of the gene expression among different cell conditions lead to divergences in the tail events or crossings of the expression distributions. Second, scRNA-seq data generally have a considerable fraction of dropout events, causing zero inflation in the expression. To account for the first characteristic, existing parametric approaches targeting the mean difference in gene expression are limited, while quantile regression that examines various locations in the distribution will improve the power. However, the second characteristic, zero inflation, makes the traditional quantile regression invalid and underpowered. We propose a quantile-based test that handles the two characteristics, multimodality and zero inflation, simultaneously. The proposed quantile rank-score based test for differential distribution detection (ZIQRank) is derived under a two-part quantile regression model for zero-inflated outcomes. It comprises a test in logistic modeling for the zero counts and a collection of rank-score tests adjusting for zero inflation at multiple prespecified quantiles of the positive part. The testing decision is based on an aggregate result by combining the marginal -values by MinP or Cauchy procedure. The proposed test is asymptotically justified and evaluated with simulation studies. It shows a higher precision-recall AUC in detecting true differentially expressed genes (DEGs) than the existing methods. We apply the ZIQRank test to a TPM scRNA-seq data on human glioblastoma tumors and exclusively identify a group of DEGs between neoplastic and nonneoplastic cells, which are heterogeneous and have been proved to be associated with glioma. Application to a UMI count scRNA-seq data on cells from mouse intestinal organoids further demonstrates the capability of ZIQRank to improve and complement the existing approaches.

摘要

基于单细胞RNA测序(scRNA-seq)数据进行差异基因表达分析具有挑战性,这是由于scRNA-seq数据的两个独特特征。首先,不同细胞状态下基因表达的多模态性和其他异质性导致表达分布的尾部事件或交叉出现差异。其次,scRNA-seq数据通常有相当一部分缺失事件,导致表达出现零膨胀。为了解决第一个特征,现有的针对基因表达平均差异的参数方法有限,而检查分布中不同位置的分位数回归将提高检验效能。然而,第二个特征,即零膨胀,使得传统的分位数回归无效且效能不足。我们提出了一种基于分位数的检验方法,可同时处理多模态性和零膨胀这两个特征。所提出的基于分位数秩得分的差异分布检测检验(ZIQRank)是在一个用于零膨胀结果的两部分分位数回归模型下推导出来的。它包括一个用于零计数的逻辑建模检验以及一组在正部分的多个预先指定分位数处针对零膨胀进行调整的秩得分检验。检验决策基于通过MinP或柯西程序组合边际P值的汇总结果。所提出的检验在渐近意义上是合理的,并通过模拟研究进行了评估。与现有方法相比,它在检测真正的差异表达基因(DEG)时显示出更高的精确召回率AUC。我们将ZIQRank检验应用于人类胶质母细胞瘤肿瘤的TPM scRNA-seq数据,并专门鉴定出一组肿瘤细胞与非肿瘤细胞之间的DEG,这些基因具有异质性且已被证明与胶质瘤相关。将其应用于来自小鼠肠道类器官细胞的UMI计数scRNA-seq数据进一步证明了ZIQRank改进和补充现有方法的能力。

相似文献

1
ZERO-INFLATED QUANTILE RANK-SCORE BASED TEST (ZIQRANK) WITH APPLICATION TO SCRNA-SEQ DIFFERENTIAL GENE EXPRESSION ANALYSIS.基于零膨胀分位数秩分数的检验(ZIQRANK)及其在单细胞RNA测序差异基因表达分析中的应用
Ann Appl Stat. 2021 Dec;15(4):1673-1696. doi: 10.1214/21-aoas1442. Epub 2021 Dec 21.
2
Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ).基于零膨胀分位数方法(ZINQ)的微生物组数据强大而稳健的非参数关联检验。
Microbiome. 2021 Sep 2;9(1):181. doi: 10.1186/s40168-021-01129-3.
3
ZIAQ: a quantile regression method for differential expression analysis of single-cell RNA-seq data.ZIAQ:一种用于单细胞 RNA-seq 数据差异表达分析的分位数回归方法。
Bioinformatics. 2020 May 1;36(10):3124-3130. doi: 10.1093/bioinformatics/btaa098.
4
Identifying Differentially Expressed Genes of Zero Inflated Single Cell RNA Sequencing Data Using Mixed Model Score Tests.使用混合模型得分检验识别零膨胀单细胞RNA测序数据中的差异表达基因。
Front Genet. 2021 Feb 5;12:616686. doi: 10.3389/fgene.2021.616686. eCollection 2021.
5
Statistical methods for analysis of single-cell RNA-sequencing data.用于单细胞RNA测序数据分析的统计方法。
MethodsX. 2021 Nov 17;8:101580. doi: 10.1016/j.mex.2021.101580. eCollection 2021.
6
Observation weights unlock bulk RNA-seq tools for zero inflation and single-cell applications.观测权重为零膨胀和单细胞应用解锁了批量 RNA-seq 工具。
Genome Biol. 2018 Feb 26;19(1):24. doi: 10.1186/s13059-018-1406-4.
7
Bayesian model selection reveals biological origins of zero inflation in single-cell transcriptomics.贝叶斯模型选择揭示了单细胞转录组学中零膨胀的生物学起源。
Genome Biol. 2020 Jul 27;21(1):183. doi: 10.1186/s13059-020-02103-2.
8
Bayesian gamma-negative binomial modeling of single-cell RNA sequencing data.贝叶斯伽马负二项式模型在单细胞 RNA 测序数据分析中的应用。
BMC Genomics. 2020 Sep 9;21(Suppl 9):585. doi: 10.1186/s12864-020-06938-8.
9
Missing Value Imputation With Low-Rank Matrix Completion in Single-Cell RNA-Seq Data by Considering Cell Heterogeneity.通过考虑细胞异质性利用低秩矩阵补全对单细胞RNA测序数据中的缺失值进行插补
Front Genet. 2022 Jul 14;13:952649. doi: 10.3389/fgene.2022.952649. eCollection 2022.
10
Detecting differential alternative splicing events in scRNA-seq with or without Unique Molecular Identifiers.在有或没有独特分子标识符的 scRNA-seq 中检测差异剪接事件。
PLoS Comput Biol. 2020 Jun 5;16(6):e1007925. doi: 10.1371/journal.pcbi.1007925. eCollection 2020 Jun.

引用本文的文献

1
A semiparametric quantile regression rank score test for zero-inflated data.用于零膨胀数据的半参数分位数回归秩得分检验。
Biometrics. 2025 Apr 2;81(2). doi: 10.1093/biomtc/ujaf050.
2
Differential Expression Analysis of Single-Cell RNA-Seq Data: Current Statistical Approaches and Outstanding Challenges.单细胞RNA测序数据的差异表达分析:当前的统计方法与突出挑战
Entropy (Basel). 2022 Jul 18;24(7):995. doi: 10.3390/e24070995.

本文引用的文献

1
Cauchy combination test: a powerful test with analytic -value calculation under arbitrary dependency structures.柯西组合检验:一种在任意相依结构下具有解析值计算功能的强大检验。
J Am Stat Assoc. 2020;115(529):393-402. doi: 10.1080/01621459.2018.1554485. Epub 2019 Apr 25.
2
ZIAQ: a quantile regression method for differential expression analysis of single-cell RNA-seq data.ZIAQ:一种用于单细胞 RNA-seq 数据差异表达分析的分位数回归方法。
Bioinformatics. 2020 May 1;36(10):3124-3130. doi: 10.1093/bioinformatics/btaa098.
3
CHD4 regulates the DNA damage response and RAD51 expression in glioblastoma.CHD4 调控胶质母细胞瘤中的 DNA 损伤反应和 RAD51 表达。
Sci Rep. 2019 Mar 14;9(1):4444. doi: 10.1038/s41598-019-40327-w.
4
Quantile Regression in the Secondary Analysis of Case-Control Data.病例对照数据二次分析中的分位数回归
J Am Stat Assoc. 2016;111(513):344-354. doi: 10.1080/01621459.2015.1008101. Epub 2016 May 5.
5
Comparative analysis of differential gene expression analysis tools for single-cell RNA sequencing data.单细胞 RNA 测序数据分析工具的差异基因表达分析比较。
BMC Bioinformatics. 2019 Jan 18;20(1):40. doi: 10.1186/s12859-019-2599-6.
6
Bias, robustness and scalability in single-cell differential expression analysis.单细胞差异表达分析中的偏差、稳健性和可扩展性。
Nat Methods. 2018 Apr;15(4):255-261. doi: 10.1038/nmeth.4612. Epub 2018 Feb 26.
7
RNA-Seq differential expression analysis: An extended review and a software tool.RNA测序差异表达分析:扩展综述与软件工具
PLoS One. 2017 Dec 21;12(12):e0190152. doi: 10.1371/journal.pone.0190152. eCollection 2017.
8
Single-Cell RNA-Seq Analysis of Infiltrating Neoplastic Cells at the Migrating Front of Human Glioblastoma.单细胞 RNA 测序分析人类脑胶质瘤迁移前沿浸润性肿瘤细胞。
Cell Rep. 2017 Oct 31;21(5):1399-1410. doi: 10.1016/j.celrep.2017.10.030.
9
Unified Sequence-Based Association Tests Allowing for Multiple Functional Annotations and Meta-analysis of Noncoding Variation in Metabochip Data.基于统一序列的关联测试,支持多种功能注释以及代谢芯片数据中非编码变异的荟萃分析。
Am J Hum Genet. 2017 Sep 7;101(3):340-352. doi: 10.1016/j.ajhg.2017.07.011. Epub 2017 Aug 24.
10
Single-Cell RNA-Sequencing: Assessment of Differential Expression Analysis Methods.单细胞RNA测序:差异表达分析方法评估
Front Genet. 2017 May 23;8:62. doi: 10.3389/fgene.2017.00062. eCollection 2017.