• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用混合模型得分检验识别零膨胀单细胞RNA测序数据中的差异表达基因。

Identifying Differentially Expressed Genes of Zero Inflated Single Cell RNA Sequencing Data Using Mixed Model Score Tests.

作者信息

He Zhiqiang, Pan Yueyun, Shao Fang, Wang Hui

机构信息

Department of Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, China.

First Clinical Medical College, Nanjing Medical University, Nanjing, China.

出版信息

Front Genet. 2021 Feb 5;12:616686. doi: 10.3389/fgene.2021.616686. eCollection 2021.

DOI:10.3389/fgene.2021.616686
PMID:33613638
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7894898/
Abstract

Single cell RNA sequencing (scRNA-seq) allows quantitative measurement and comparison of gene expression at the resolution of single cells. Ignoring the batch effects and zero inflation of scRNA-seq data, many proposed differentially expressed (DE) methods might generate bias. We propose a method, single cell mixed model score tests (scMMSTs), to efficiently identify DE genes of scRNA-seq data with batch effects using the generalized linear mixed model (GLMM). scMMSTs treat the batch effect as a random effect. For zero inflation, scMMSTs use a weighting strategy to calculate observational weights for counts independently under zero-inflated and zero-truncated distributions. Counts data with calculated weights were subsequently analyzed using weighted GLMMs. The theoretical null distributions of the score statistics were constructed by mixed Chi-square distributions. Intensive simulations and two real datasets were used to compare edgeR-zinbwave, DESeq2-zinbwave, and scMMSTs. Our study demonstrates that scMMSTs, as supplement to standard methods, are advantageous to define DE genes of zero-inflated scRNA-seq data with batch effects.

摘要

单细胞RNA测序(scRNA-seq)能够在单细胞分辨率下对基因表达进行定量测量和比较。忽略scRNA-seq数据的批次效应和零膨胀问题,许多已提出的差异表达(DE)方法可能会产生偏差。我们提出了一种方法,即单细胞混合模型得分检验(scMMSTs),以使用广义线性混合模型(GLMM)有效地识别具有批次效应的scRNA-seq数据中的DE基因。scMMSTs将批次效应视为随机效应。对于零膨胀问题,scMMSTs使用加权策略在零膨胀和零截断分布下独立计算计数的观测权重。随后使用加权GLMM对具有计算权重的计数数据进行分析。得分统计量的理论零分布由混合卡方分布构建。通过密集模拟和两个真实数据集对edgeR-zinbwave、DESeq2-zinbwave和scMMSTs进行了比较。我们的研究表明,作为标准方法的补充,scMMSTs有利于定义具有批次效应的零膨胀scRNA-seq数据中的DE基因。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e2c/7894898/7ca2b23c0d96/fgene-12-616686-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e2c/7894898/009de893d574/fgene-12-616686-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e2c/7894898/2f949a984aea/fgene-12-616686-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e2c/7894898/b587f4461170/fgene-12-616686-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e2c/7894898/6c613bdffa85/fgene-12-616686-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e2c/7894898/7ca2b23c0d96/fgene-12-616686-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e2c/7894898/009de893d574/fgene-12-616686-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e2c/7894898/2f949a984aea/fgene-12-616686-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e2c/7894898/b587f4461170/fgene-12-616686-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e2c/7894898/6c613bdffa85/fgene-12-616686-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e2c/7894898/7ca2b23c0d96/fgene-12-616686-g005.jpg

相似文献

1
Identifying Differentially Expressed Genes of Zero Inflated Single Cell RNA Sequencing Data Using Mixed Model Score Tests.使用混合模型得分检验识别零膨胀单细胞RNA测序数据中的差异表达基因。
Front Genet. 2021 Feb 5;12:616686. doi: 10.3389/fgene.2021.616686. eCollection 2021.
2
Observation weights unlock bulk RNA-seq tools for zero inflation and single-cell applications.观测权重为零膨胀和单细胞应用解锁了批量 RNA-seq 工具。
Genome Biol. 2018 Feb 26;19(1):24. doi: 10.1186/s13059-018-1406-4.
3
Differential expression of single-cell RNA-seq data using Tweedie models.基于 Tweedie 模型的单细胞 RNA-seq 数据差异表达分析。
Stat Med. 2022 Aug 15;41(18):3492-3510. doi: 10.1002/sim.9430. Epub 2022 Jun 2.
4
ZERO-INFLATED QUANTILE RANK-SCORE BASED TEST (ZIQRANK) WITH APPLICATION TO SCRNA-SEQ DIFFERENTIAL GENE EXPRESSION ANALYSIS.基于零膨胀分位数秩分数的检验(ZIQRANK)及其在单细胞RNA测序差异基因表达分析中的应用
Ann Appl Stat. 2021 Dec;15(4):1673-1696. doi: 10.1214/21-aoas1442. Epub 2021 Dec 21.
5
Bayesian gamma-negative binomial modeling of single-cell RNA sequencing data.贝叶斯伽马负二项式模型在单细胞 RNA 测序数据分析中的应用。
BMC Genomics. 2020 Sep 9;21(Suppl 9):585. doi: 10.1186/s12864-020-06938-8.
6
Bayesian model selection reveals biological origins of zero inflation in single-cell transcriptomics.贝叶斯模型选择揭示了单细胞转录组学中零膨胀的生物学起源。
Genome Biol. 2020 Jul 27;21(1):183. doi: 10.1186/s13059-020-02103-2.
7
Inverse weighting method with jackknife variance estimator for differential expression analysis of single-cell RNA sequencing data.基于刀切方差估计的逆加权法用于单细胞 RNA 测序数据分析中的差异表达分析。
Comput Biol Chem. 2022 Oct;100:107733. doi: 10.1016/j.compbiolchem.2022.107733. Epub 2022 Jul 18.
8
Variance component testing for identifying differentially expressed genes in RNA-seq data.用于识别RNA测序数据中差异表达基因的方差成分检验
PeerJ. 2017 Sep 8;5:e3797. doi: 10.7717/peerj.3797. eCollection 2017.
9
iDESC: identifying differential expression in single-cell RNA sequencing data with multiple subjects.iDESC:基于多个样本的单细胞 RNA 测序数据差异表达识别。
BMC Bioinformatics. 2023 Aug 22;24(1):318. doi: 10.1186/s12859-023-05432-8.
10
Reproducibility of Methods to Detect Differentially Expressed Genes from Single-Cell RNA Sequencing.从单细胞RNA测序中检测差异表达基因方法的可重复性
Front Genet. 2020 Jan 17;10:1331. doi: 10.3389/fgene.2019.01331. eCollection 2019.

引用本文的文献

1
Leveraging gene correlations in single cell transcriptomic data.利用单细胞转录组数据中的基因相关性。
BMC Bioinformatics. 2024 Sep 18;25(1):305. doi: 10.1186/s12859-024-05926-z.
2
Challenges and best practices in omics benchmarking.组学基准测试中的挑战和最佳实践。
Nat Rev Genet. 2024 May;25(5):326-339. doi: 10.1038/s41576-023-00679-6. Epub 2024 Jan 12.
3
Leveraging gene correlations in single cell transcriptomic data.利用单细胞转录组数据中的基因相关性。

本文引用的文献

1
Single cell characterization of B-lymphoid differentiation and leukemic cell states during chemotherapy in ETV6-RUNX1-positive pediatric leukemia identifies drug-targetable transcription factor activities.在 ETV6-RUNX1 阳性小儿白血病的化疗过程中对 B 淋巴细胞分化和白血病细胞状态进行单细胞特征分析,确定了可靶向药物的转录因子活性。
Genome Med. 2020 Nov 20;12(1):99. doi: 10.1186/s13073-020-00799-2.
2
An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data.一种利用单细胞 RNA-Seq 数据对 bulk RNA-Seq 数据进行去卷积的高效灵活方法。
Cells. 2019 Sep 27;8(10):1161. doi: 10.3390/cells8101161.
3
bioRxiv. 2023 Nov 1:2023.03.14.532643. doi: 10.1101/2023.03.14.532643.
4
Differential Expression Analysis of Single-Cell RNA-Seq Data: Current Statistical Approaches and Outstanding Challenges.单细胞RNA测序数据的差异表达分析:当前的统计方法与突出挑战
Entropy (Basel). 2022 Jul 18;24(7):995. doi: 10.3390/e24070995.
Current best practices in single-cell RNA-seq analysis: a tutorial.
单细胞 RNA 测序分析的当前最佳实践:教程。
Mol Syst Biol. 2019 Jun 19;15(6):e8746. doi: 10.15252/msb.20188746.
4
Batch correction evaluation framework using a-priori gene-gene associations: applied to the GTEx dataset.基于先验基因-基因关联的批量校正评估框架:在 GTEx 数据集上的应用。
BMC Bioinformatics. 2019 May 28;20(1):268. doi: 10.1186/s12859-019-2855-9.
5
Identifying and exploiting gene-pathway interactions from RNA-seq data for binary phenotype.从 RNA-seq 数据中识别和利用基因-通路相互作用进行二元表型分析。
BMC Genet. 2019 Mar 19;20(1):36. doi: 10.1186/s12863-019-0739-7.
6
Efficient Variant Set Mixed Model Association Tests for Continuous and Binary Traits in Large-Scale Whole-Genome Sequencing Studies.高效的变体集混合模型关联测试在全基因组测序研究中用于连续和二项性状。
Am J Hum Genet. 2019 Feb 7;104(2):260-274. doi: 10.1016/j.ajhg.2018.12.012. Epub 2019 Jan 10.
7
Developmental Heterogeneity of Microglia and Brain Myeloid Cells Revealed by Deep Single-Cell RNA Sequencing.深度单细胞 RNA 测序揭示小胶质细胞和脑髓系细胞的发育异质性。
Neuron. 2019 Jan 16;101(2):207-223.e10. doi: 10.1016/j.neuron.2018.12.006. Epub 2018 Dec 31.
8
A test metric for assessing single-cell RNA-seq batch correction.一种用于评估单细胞 RNA-seq 批次校正的测试指标。
Nat Methods. 2019 Jan;16(1):43-49. doi: 10.1038/s41592-018-0254-1. Epub 2018 Dec 20.
9
Heritability estimation and differential analysis of count data with generalized linear mixed models in genomic sequencing studies.基于广义线性混合模型的基因组测序研究中计数数据的遗传力估计和差异分析。
Bioinformatics. 2019 Feb 1;35(3):487-496. doi: 10.1093/bioinformatics/bty644.
10
Intra-tumour diversification in colorectal cancer at the single-cell level.单细胞水平的结直肠癌肿瘤内多样化。
Nature. 2018 Apr;556(7702):457-462. doi: 10.1038/s41586-018-0024-3. Epub 2018 Apr 11.