• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于阐明单细胞与批量基因表达之间关系的数学模型,以明确批量基因表达数据的解释。

Mathematical model for the relationship between single-cell and bulk gene expression to clarify the interpretation of bulk gene expression data.

作者信息

Okada Daigo, Zheng Cheng, Cheng Jian Hao

机构信息

Center for Genomic Medicine, Graduate School of Medicine, Kyoto University, South Research Bldg. No.1(5F), 53 Shogoinkawahara-cho, Sakyo-ku, Kyoto 6068507, Kyoto, Japan.

出版信息

Comput Struct Biotechnol J. 2022 Sep 5;20:4850-4859. doi: 10.1016/j.csbj.2022.08.062. eCollection 2022.

DOI:10.1016/j.csbj.2022.08.062
PMID:36147671
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9474327/
Abstract

BACKGROUND

Differential expression analysis is a standard approach in molecular biology. For example, genes whose expression levels differ between diseased and non-diseased samples are considered to be associated with that disease. On the other hand, differential variability analysis focuses on the differences of the variances of gene expression between sample groups. Although differential variability is also known to capture biological information, its interpretation remains unclear and controversial. Recent single-cell analyses have revealed that differences between sample groups can affect gene expression in a cellular subset-specific manner or by altering the proportion of a particular cellular subset. The aim of this study is to clarify the interpretation of mean and variance of bulk gene expression data.

METHOD

We developed a mathematical model in which the bulk gene expression value is proportional to the mean value of the single-cell gene expression profile. Based on this model, we performed theoretical, simulated and real single-cell RNA-seq data analyses.

RESULT AND CONCLUSION

We identified how differences in single-cell gene expression profiles affect the differences in the mean and the variance of bulk gene expression. It is shown that differential expression analysis of bulk expression data can overlook significant changes in gene expression at the single-cell level. Further, differential variability analysis capture the complex feature affected by different gene expression shifts for each subset, changes in the proportions of cellular subsets, and variation in single-cell distribution parameters among samples.

摘要

背景

差异表达分析是分子生物学中的一种标准方法。例如,在患病样本和未患病样本之间表达水平不同的基因被认为与该疾病相关。另一方面,差异变异性分析关注样本组之间基因表达方差的差异。尽管差异变异性也被认为可以捕获生物学信息,但其解释仍不明确且存在争议。最近的单细胞分析表明,样本组之间的差异可以以细胞亚群特异性的方式或通过改变特定细胞亚群的比例来影响基因表达。本研究的目的是阐明批量基因表达数据均值和方差的解释。

方法

我们开发了一个数学模型,其中批量基因表达值与单细胞基因表达谱的均值成正比。基于该模型,我们进行了理论、模拟和实际单细胞RNA测序数据分析。

结果与结论

我们确定了单细胞基因表达谱的差异如何影响批量基因表达均值和方差的差异。结果表明,批量表达数据的差异表达分析可能会忽略单细胞水平上基因表达的显著变化。此外,差异变异性分析捕获了受每个亚群不同基因表达变化、细胞亚群比例变化以及样本间单细胞分布参数变化影响的复杂特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c06/9474327/65bcadef3cca/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c06/9474327/6ea6a8199c70/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c06/9474327/c06cf99862ad/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c06/9474327/1383be8babf5/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c06/9474327/92376aa73419/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c06/9474327/99e6570a3158/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c06/9474327/9e5341c7782d/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c06/9474327/65bcadef3cca/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c06/9474327/6ea6a8199c70/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c06/9474327/c06cf99862ad/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c06/9474327/1383be8babf5/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c06/9474327/92376aa73419/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c06/9474327/99e6570a3158/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c06/9474327/9e5341c7782d/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c06/9474327/65bcadef3cca/gr6.jpg

相似文献

1
Mathematical model for the relationship between single-cell and bulk gene expression to clarify the interpretation of bulk gene expression data.用于阐明单细胞与批量基因表达之间关系的数学模型,以明确批量基因表达数据的解释。
Comput Struct Biotechnol J. 2022 Sep 5;20:4850-4859. doi: 10.1016/j.csbj.2022.08.062. eCollection 2022.
2
Beta-Poisson model for single-cell RNA-seq data analyses.单细胞 RNA-seq 数据分析的 Beta-Poisson 模型。
Bioinformatics. 2016 Jul 15;32(14):2128-35. doi: 10.1093/bioinformatics/btw202. Epub 2016 Apr 19.
3
Detecting cell-type-specific allelic expression imbalance by integrative analysis of bulk and single-cell RNA sequencing data.通过整合分析批量和单细胞 RNA 测序数据检测细胞类型特异性等位基因表达失衡。
PLoS Genet. 2021 Mar 4;17(3):e1009080. doi: 10.1371/journal.pgen.1009080. eCollection 2021 Mar.
4
Identification of differentially distributed gene expression and distinct sets of cancer-related genes identified by changes in mean and variability.通过均值和变异性变化鉴定差异分布的基因表达以及不同的癌症相关基因集。
NAR Genom Bioinform. 2022 Jan 14;4(1):lqab124. doi: 10.1093/nargab/lqab124. eCollection 2022 Mar.
5
Category encoding method to select feature genes for the classification of bulk and single-cell RNA-seq data.类别编码方法用于对批量和单细胞 RNA-seq 数据进行分类的特征基因选择。
Stat Med. 2021 Aug 15;40(18):4077-4089. doi: 10.1002/sim.9015. Epub 2021 May 24.
6
Improving bulk RNA-seq classification by transferring gene signature from single cells in acute myeloid leukemia.通过从急性髓系白血病的单细胞中转录基因特征提高批量 RNA-seq 分类。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac002.
7
Endometrial receptivity revisited: endometrial transcriptome adjusted for tissue cellular heterogeneity.子宫内膜容受性再探:对组织细胞异质性进行调整的子宫内膜转录组。
Hum Reprod. 2018 Nov 1;33(11):2074-2086. doi: 10.1093/humrep/dey301.
8
Recommendations of scRNA-seq Differential Gene Expression Analysis Based on Comprehensive Benchmarking.基于全面基准测试的单细胞RNA测序差异基因表达分析建议
Life (Basel). 2022 Jun 7;12(6):850. doi: 10.3390/life12060850.
9
Detection of differentially expressed genes in discrete single-cell RNA sequencing data using a hurdle model with correlated random effects.使用具有相关随机效应的障碍模型检测离散单细胞RNA测序数据中的差异表达基因。
Biometrics. 2019 Dec;75(4):1051-1062. doi: 10.1111/biom.13074. Epub 2019 Jun 17.
10
Integrated bulk and single-cell RNA-sequencing identified disease-relevant monocytes and a gene network module underlying systemic sclerosis.整合批量和单细胞 RNA 测序鉴定了与疾病相关的单核细胞和系统性硬化症的基因网络模块。
J Autoimmun. 2021 Jan;116:102547. doi: 10.1016/j.jaut.2020.102547. Epub 2020 Oct 8.

引用本文的文献

1
Application of a mathematical model to clarify the statistical characteristics of a pan-tissue DNA methylation clock.应用数学模型阐明泛组织 DNA 甲基化时钟的统计特征。
Geroscience. 2024 Apr;46(2):2001-2015. doi: 10.1007/s11357-023-00949-5. Epub 2023 Oct 3.
2
Characterizing the landscape of gene expression variance in humans.描述人类基因表达方差的特征。
PLoS Genet. 2023 Jul 6;19(7):e1010833. doi: 10.1371/journal.pgen.1010833. eCollection 2023 Jul.
3
Data-driven identification and classification of nonlinear aging patterns reveals the landscape of associations between DNA methylation and aging.

本文引用的文献

1
Comparative Study of Transcriptome in the Hearts Isolated from Mice, Rats, and Humans.比较从小鼠、大鼠和人心脏中分离的转录组。
Biomolecules. 2022 Jun 20;12(6):859. doi: 10.3390/biom12060859.
2
Single-cell eQTL models reveal dynamic T cell state dependence of disease loci.单细胞 eQTL 模型揭示疾病相关位点的 T 细胞状态依赖性动态变化。
Nature. 2022 Jun;606(7912):120-128. doi: 10.1038/s41586-022-04713-1. Epub 2022 May 11.
3
Single-cell eQTL mapping identifies cell type-specific genetic control of autoimmune disease.单细胞 eQTL 图谱分析确定自身免疫性疾病的细胞类型特异性遗传调控。
数据驱动的非线性衰老模式识别与分类揭示了 DNA 甲基化与衰老之间关联的全景。
Hum Genomics. 2023 Feb 11;17(1):8. doi: 10.1186/s40246-023-00453-z.
Science. 2022 Apr 8;376(6589):eabf3041. doi: 10.1126/science.abf3041.
4
Single-Cell Analysis Reveals Unexpected Cellular Changes and Transposon Expression Signatures in the Colonic Epithelium of Treatment-Naïve Adult Crohn's Disease Patients.单细胞分析揭示了治疗初治的成年克罗恩病患者结肠上皮中意想不到的细胞变化和转座子表达特征。
Cell Mol Gastroenterol Hepatol. 2022;13(6):1717-1740. doi: 10.1016/j.jcmgh.2022.02.005. Epub 2022 Feb 12.
5
Statistics or biology: the zero-inflation controversy about scRNA-seq data.统计学还是生物学:关于 scRNA-seq 数据的零膨胀争议。
Genome Biol. 2022 Jan 21;23(1):31. doi: 10.1186/s13059-022-02601-5.
6
Identification of differentially distributed gene expression and distinct sets of cancer-related genes identified by changes in mean and variability.通过均值和变异性变化鉴定差异分布的基因表达以及不同的癌症相关基因集。
NAR Genom Bioinform. 2022 Jan 14;4(1):lqab124. doi: 10.1093/nargab/lqab124. eCollection 2022 Mar.
7
Beyondcell: targeting cancer therapeutic heterogeneity in single-cell RNA-seq data.Beyondcell:单细胞 RNA-seq 数据中靶向癌症治疗异质性。
Genome Med. 2021 Dec 16;13(1):187. doi: 10.1186/s13073-021-01001-x.
8
Cell population-based framework of genetic epidemiology in the single-cell omics era.单细胞组学时代的基于细胞群体的遗传流行病学框架。
Bioessays. 2022 Jan;44(1):e2100118. doi: 10.1002/bies.202100118. Epub 2021 Nov 25.
9
Data-driven comparison of multiple high-dimensional single-cell expression profiles.基于数据驱动的多种高维单细胞表达谱的比较。
J Hum Genet. 2022 Apr;67(4):215-221. doi: 10.1038/s10038-021-00989-9. Epub 2021 Nov 1.
10
Detection of differentially abundant cell subpopulations in scRNA-seq data.单细胞 RNA 测序数据中差异丰度细胞亚群的检测。
Proc Natl Acad Sci U S A. 2021 Jun 1;118(22). doi: 10.1073/pnas.2100293118.