• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用隶属度模型可视化RNA测序表达数据的结构。

Visualizing the structure of RNA-seq expression data using grade of membership models.

作者信息

Dey Kushal K, Hsiao Chiaowen Joyce, Stephens Matthew

机构信息

Department of Statistics, University of Chicago, Chicago, Illinois, United States of America.

Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America.

出版信息

PLoS Genet. 2017 Mar 23;13(3):e1006599. doi: 10.1371/journal.pgen.1006599. eCollection 2017 Mar.

DOI:10.1371/journal.pgen.1006599
PMID:28333934
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5363805/
Abstract

Grade of membership models, also known as "admixture models", "topic models" or "Latent Dirichlet Allocation", are a generalization of cluster models that allow each sample to have membership in multiple clusters. These models are widely used in population genetics to model admixed individuals who have ancestry from multiple "populations", and in natural language processing to model documents having words from multiple "topics". Here we illustrate the potential for these models to cluster samples of RNA-seq gene expression data, measured on either bulk samples or single cells. We also provide methods to help interpret the clusters, by identifying genes that are distinctively expressed in each cluster. By applying these methods to several example RNA-seq applications we demonstrate their utility in identifying and summarizing structure and heterogeneity. Applied to data from the GTEx project on 53 human tissues, the approach highlights similarities among biologically-related tissues and identifies distinctively-expressed genes that recapitulate known biology. Applied to single-cell expression data from mouse preimplantation embryos, the approach highlights both discrete and continuous variation through early embryonic development stages, and highlights genes involved in a variety of relevant processes-from germ cell development, through compaction and morula formation, to the formation of inner cell mass and trophoblast at the blastocyst stage. The methods are implemented in the Bioconductor package CountClust.

摘要

成员等级模型,也被称为“混合模型”“主题模型”或“潜在狄利克雷分配”,是聚类模型的一种泛化,它允许每个样本属于多个聚类。这些模型在群体遗传学中被广泛用于对具有多个“群体”祖先的混合个体进行建模,在自然语言处理中用于对包含多个“主题”词汇的文档进行建模。在这里,我们展示了这些模型对批量样本或单细胞上测量的RNA测序基因表达数据样本进行聚类的潜力。我们还提供了一些方法,通过识别在每个聚类中特异性表达的基因来帮助解释聚类结果。通过将这些方法应用于几个RNA测序示例应用中,我们证明了它们在识别和总结结构及异质性方面的实用性。将该方法应用于GTEx项目中53种人体组织的数据时,该方法突出了生物学相关组织之间的相似性,并识别出能够概括已知生物学特征的特异性表达基因。将该方法应用于小鼠植入前胚胎的单细胞表达数据时,该方法突出了早期胚胎发育阶段的离散和连续变化,并突出了参与各种相关过程的基因——从生殖细胞发育,到致密化和桑椹胚形成,再到囊胚阶段内细胞团和滋养层的形成。这些方法在Bioconductor软件包CountClust中实现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b4b/5363805/8e55ae375879/pgen.1006599.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b4b/5363805/8906243cf361/pgen.1006599.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b4b/5363805/2e119e8beed0/pgen.1006599.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b4b/5363805/49cdae24c693/pgen.1006599.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b4b/5363805/8e55ae375879/pgen.1006599.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b4b/5363805/8906243cf361/pgen.1006599.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b4b/5363805/2e119e8beed0/pgen.1006599.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b4b/5363805/49cdae24c693/pgen.1006599.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b4b/5363805/8e55ae375879/pgen.1006599.g004.jpg

相似文献

1
Visualizing the structure of RNA-seq expression data using grade of membership models.使用隶属度模型可视化RNA测序表达数据的结构。
PLoS Genet. 2017 Mar 23;13(3):e1006599. doi: 10.1371/journal.pgen.1006599. eCollection 2017 Mar.
2
A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa.一种用于隐性营养不良型大疱性表皮松解症的单细胞 RNA-seq 分析的多任务聚类方法。
PLoS Comput Biol. 2018 Apr 9;14(4):e1006053. doi: 10.1371/journal.pcbi.1006053. eCollection 2018 Apr.
3
clusterExperiment and RSEC: A Bioconductor package and framework for clustering of single-cell and other large gene expression datasets.clusterExperiment 和 RSEC:一个用于单细胞和其他大型基因表达数据集聚类的 Bioconductor 包和框架。
PLoS Comput Biol. 2018 Sep 4;14(9):e1006378. doi: 10.1371/journal.pcbi.1006378. eCollection 2018 Sep.
4
RNA-seq based transcriptomic map reveals new insights into mouse salivary gland development and maturation.基于RNA测序的转录组图谱揭示了对小鼠唾液腺发育和成熟的新见解。
BMC Genomics. 2016 Nov 16;17(1):923. doi: 10.1186/s12864-016-3228-7.
5
Data Analysis in Single-Cell Transcriptome Sequencing.单细胞转录组测序中的数据分析
Methods Mol Biol. 2018;1754:311-326. doi: 10.1007/978-1-4939-7717-8_18.
6
Transcriptome profiling of individual rhesus macaque oocytes and preimplantation embryos.恒河猴卵母细胞和胚胎转录组谱分析。
Biol Reprod. 2017 Sep 1;97(3):353-364. doi: 10.1093/biolre/iox114.
7
RNA-sequencing of the sturgeon Acipenser baeri provides insights into expression dynamics of morphogenic differentiation and developmental regulatory genes in early versus late developmental stages.西伯利亚鲟的RNA测序为研究形态发生分化和发育调控基因在发育早期与晚期阶段的表达动态提供了见解。
BMC Genomics. 2016 Aug 8;17:564. doi: 10.1186/s12864-016-2839-3.
8
A multi-Poisson dynamic mixture model to cluster developmental patterns of gene expression by RNA-seq.一种用于通过RNA测序对基因表达发育模式进行聚类的多泊松动态混合模型。
Brief Bioinform. 2015 Mar;16(2):205-15. doi: 10.1093/bib/bbu013. Epub 2014 May 10.
9
Identification and functional analysis of long non-coding RNAs in human and mouse early embryos based on single-cell transcriptome data.基于单细胞转录组数据对人和小鼠早期胚胎中长链非编码RNA的鉴定及功能分析
Oncotarget. 2016 Sep 20;7(38):61215-61228. doi: 10.18632/oncotarget.11304.
10
DTWscore: differential expression and cell clustering analysis for time-series single-cell RNA-seq data.DTW分数:时间序列单细胞RNA测序数据的差异表达和细胞聚类分析
BMC Bioinformatics. 2017 May 23;18(1):270. doi: 10.1186/s12859-017-1647-3.

引用本文的文献

1
Latent Class Analysis with Arbitrary-Distribution Responses.具有任意分布响应的潜在类别分析
Entropy (Basel). 2025 Aug 14;27(8):866. doi: 10.3390/e27080866.
2
Building digital histology models of transcriptional tumor programs with generative deep learning for pathology-based precision medicine.利用生成式深度学习构建转录肿瘤程序的数字组织学模型,用于基于病理学的精准医学。
Genome Med. 2025 Aug 7;17(1):87. doi: 10.1186/s13073-025-01502-z.
3
Spectral divergence prioritizes key classes, genes, and pathways shared between substance use disorders and cardiovascular disease.

本文引用的文献

1
Differential effects of myelin basic protein-activated Th1 and Th2 cells on the local immune microenvironment of injured spinal cord.髓鞘碱性蛋白激活的Th1和Th2细胞对脊髓损伤局部免疫微环境的不同影响。
Exp Neurol. 2016 Mar;277:190-201. doi: 10.1016/j.expneurol.2016.01.002. Epub 2016 Jan 7.
2
A Simple Model-Based Approach to Inferring and Visualizing Cancer Mutation Signatures.一种基于简单模型的癌症突变特征推断与可视化方法。
PLoS Genet. 2015 Dec 2;11(12):e1005657. doi: 10.1371/journal.pgen.1005657. eCollection 2015 Dec.
3
Defining cell types and states with single-cell genomics.
光谱散度对物质使用障碍和心血管疾病之间共有的关键类别、基因和通路进行了优先排序。
Front Neurosci. 2025 Jul 22;19:1572243. doi: 10.3389/fnins.2025.1572243. eCollection 2025.
4
PD-1 regulates tumor-infiltrating CD8+ T cells in both a cell-intrinsic and a cell-extrinsic fashion.程序性死亡受体1(PD-1)以细胞内源性和细胞外源性方式调节肿瘤浸润性CD8+T细胞。
J Exp Med. 2025 Oct 6;222(10). doi: 10.1084/jem.20230542. Epub 2025 Jul 24.
5
Novel cancer subtyping method guided by tumor-normal sample in latent space of transcriptomic variational autoencoder.基于转录组变分自编码器潜在空间中肿瘤-正常样本引导的新型癌症亚型分类方法。
Sci Rep. 2025 Jul 21;15(1):26444. doi: 10.1038/s41598-025-07813-w.
6
Oxygen-induced stress reveals context-specific gene regulatory effects in human brain organoids.氧诱导应激揭示了人类大脑类器官中特定背景下的基因调控效应。
Genome Res. 2025 Jun 2. doi: 10.1101/gr.280219.124.
7
Exploring the latent space of transcriptomic data with topic modeling.运用主题模型探索转录组数据的潜在空间。
NAR Genom Bioinform. 2025 Apr 22;7(2):lqaf049. doi: 10.1093/nargab/lqaf049. eCollection 2025 Jun.
8
Single-Cell Analyses Reveal a Functionally Heterogeneous Exhausted CD8+ T-cell Subpopulation That Is Correlated with Response to Checkpoint Therapy in Melanoma.单细胞分析揭示了一个功能异质性的耗竭性CD8 + T细胞亚群,该亚群与黑色素瘤中检查点疗法的反应相关。
Cancer Res. 2025 Apr 15;85(8):1424-1440. doi: 10.1158/0008-5472.CAN-23-3918.
9
Exploring RNA-Seq Data Analysis Through Visualization Techniques and Tools: A Systematic Review of Opportunities and Limitations for Clinical Applications.通过可视化技术和工具探索RNA测序数据分析:临床应用的机遇与局限的系统综述
Bioengineering (Basel). 2025 Jan 12;12(1):56. doi: 10.3390/bioengineering12010056.
10
Deep topic modeling of spatial transcriptomics in the rheumatoid arthritis synovium identifies distinct classes of ectopic lymphoid structures.类风湿性关节炎滑膜组织中空间转录组学的深度主题建模识别出不同类别的异位淋巴结构。
bioRxiv. 2025 Jan 10:2025.01.08.631928. doi: 10.1101/2025.01.08.631928.
利用单细胞基因组学定义细胞类型和状态。
Genome Res. 2015 Oct;25(10):1491-8. doi: 10.1101/gr.190595.115.
4
A reanalysis of mouse ENCODE comparative gene expression data.小鼠ENCODE比较基因表达数据的重新分析。
F1000Res. 2015 May 19;4:121. doi: 10.12688/f1000research.6536.1. eCollection 2015.
5
Assessing the consistency of public human tissue RNA-seq data sets.评估公共人类组织RNA测序数据集的一致性。
Brief Bioinform. 2015 Nov;16(6):941-9. doi: 10.1093/bib/bbv017. Epub 2015 Mar 30.
6
UNDO: a Bioconductor R package for unsupervised deconvolution of mixed gene expressions in tumor samples.UNDO:一个用于对肿瘤样本中混合基因表达进行无监督反卷积的Bioconductor R软件包。
Bioinformatics. 2015 Jan 1;31(1):137-9. doi: 10.1093/bioinformatics/btu607. Epub 2014 Sep 10.
7
Massively parallel single-cell RNA-seq for marker-free decomposition of tissues into cell types.大规模并行单细胞 RNA-seq 用于无标记组织细胞类型分解。
Science. 2014 Feb 14;343(6172):776-9. doi: 10.1126/science.1247651.
8
Single-cell RNA-seq reveals dynamic, random monoallelic gene expression in mammalian cells.单细胞 RNA 测序揭示了哺乳动物细胞中动态的、随机的单等位基因表达。
Science. 2014 Jan 10;343(6167):193-6. doi: 10.1126/science.1245316.
9
The Genotype-Tissue Expression (GTEx) project.基因型-组织表达 (GTEx) 项目。
Nat Genet. 2013 Jun;45(6):580-5. doi: 10.1038/ng.2653.
10
DeMix: deconvolution for mixed cancer transcriptomes using raw measured data.DeMix:使用原始测量数据对混合癌症转录组进行反卷积。
Bioinformatics. 2013 Aug 1;29(15):1865-71. doi: 10.1093/bioinformatics/btt301. Epub 2013 May 27.