• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于对时间进程基因表达数据进行聚类的递归划分混合模型。

A recursively partitioned mixture model for clustering time-course gene expression data.

作者信息

Koestler Devin C, Marsit Carmen J, Christensen Brock C, Kelsey Karl T, Houseman E Andres

机构信息

Department of Biostatistics, University of Kansas Medical Center, Kansas City, KS 66160, USA.

Department of Community and Family Medicine, Section for Biostatistics and Epidemiology, Dartmouth Medical School, Hanover, New Hampshire 03756, USA ; Department of Pharmacology and Toxicology, Dartmouth College, Hanover, NH 03756, USA.

出版信息

Transl Cancer Res. 2014;3(3):217-232. doi: 10.3978/j.issn.2218-676X.2014.06.04.

DOI:10.3978/j.issn.2218-676X.2014.06.04
PMID:25346887
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4208690/
Abstract

BACKGROUND

Longitudinally collected gene expression data provides an opportunity to investigate the dynamic behavior of gene expression and is crucial for establishing causal links between changes on a molecular level and disease development and progression. In terms of the analysis of such data, clustering of subjects based on time-course expression data may improve our understanding of temporal expression patterns that result in disease phenotypes. Although there are numerous existing methods for clustering subjects using gene expression data, most are not suitable when expression measurements are repeatedly collected over a time-course.

METHODS

We present a modified version of the recursively partitioned mixture model (RPMM) for clustering subjects based on longitudinally collected gene expression data. In the proposed time-course RPMM (TC-RPMM), subjects are clustered on the basis of their temporal profiles of gene expression using a mixture of mixed effects models framework. This framework captures changes in gene expression over time and models the autocorrelation between repeated gene expression measurements for the same subject. We assessed the performance of TC-RPMM using extensive simulation studies and a dataset from a multi-center research study of inflammation and response to injury (www.gluegrant.org), which consisted of time-course gene expression data for 140 subjects.

RESULTS

Our simulation studies encompassed several different scenarios and were aimed at assessing the ability of TC-RPMM to correctly recover true class memberships when the expression trajectories that characterized those classes differed. Overall, our simulation studies revealed favorable performance of TC-RPMM compared to competing approaches, however clustering performance was observed to be highly dependent on the proportion of class discriminating genes used in clustering analysis. When applied to real epidemiologic data with repeated-measures, longitudinal gene expression measurements, TC-RPMM identified clusters that had strong biological and clinical significance.

CONCLUSIONS

Methods for clustering subjects based on temporal gene expression profiles is a high priority for molecular biology and bioinformatics research. Along these lines, the proposed TC-RPMM represents a promising new approach for analyzing time-course gene expression data.

摘要

背景

纵向收集的基因表达数据为研究基因表达的动态行为提供了契机,对于在分子水平上的变化与疾病发展及进展之间建立因果联系至关重要。就此类数据分析而言,基于时间进程表达数据对研究对象进行聚类,可能会增进我们对导致疾病表型的时间表达模式的理解。尽管现有众多利用基因表达数据对研究对象进行聚类的方法,但当在一个时间进程中反复收集表达测量值时,大多数方法并不适用。

方法

我们提出了递归划分混合模型(RPMM)的一个修改版本,用于基于纵向收集的基因表达数据对研究对象进行聚类。在所提出的时间进程RPMM(TC-RPMM)中,利用混合效应模型框架,根据研究对象基因表达的时间概况对其进行聚类。该框架捕捉基因表达随时间的变化,并对同一研究对象重复基因表达测量值之间的自相关性进行建模。我们通过广泛的模拟研究以及来自一项关于炎症和损伤反应的多中心研究(www.gluegrant.org)的数据集评估了TC-RPMM的性能,该数据集包含140名研究对象的时间进程基因表达数据。

结果

我们的模拟研究涵盖了几种不同的情况,旨在评估当表征这些类别的表达轨迹不同时,TC-RPMM正确恢复真实类别归属的能力。总体而言,我们的模拟研究表明,与竞争方法相比,TC-RPMM具有良好的性能,然而观察到聚类性能高度依赖于聚类分析中使用的类别区分基因的比例。当应用于具有重复测量的纵向基因表达测量的实际流行病学数据时,TC-RPMM识别出具有很强生物学和临床意义的聚类。

结论

基于时间基因表达概况对研究对象进行聚类的方法是分子生物学和生物信息学研究的高度优先事项。就此而言,所提出的TC-RPMM代表了一种分析时间进程基因表达数据的有前景的新方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/4208690/571227cafb1a/nihms617176f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/4208690/2d1af1e72306/nihms617176f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/4208690/152b51a8fd31/nihms617176f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/4208690/0b1646c3c67c/nihms617176f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/4208690/37bbd8d279c8/nihms617176f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/4208690/e90ce8844e33/nihms617176f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/4208690/5b52d0217524/nihms617176f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/4208690/571227cafb1a/nihms617176f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/4208690/2d1af1e72306/nihms617176f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/4208690/152b51a8fd31/nihms617176f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/4208690/0b1646c3c67c/nihms617176f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/4208690/37bbd8d279c8/nihms617176f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/4208690/e90ce8844e33/nihms617176f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/4208690/5b52d0217524/nihms617176f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/4208690/571227cafb1a/nihms617176f7.jpg

相似文献

1
A recursively partitioned mixture model for clustering time-course gene expression data.一种用于对时间进程基因表达数据进行聚类的递归划分混合模型。
Transl Cancer Res. 2014;3(3):217-232. doi: 10.3978/j.issn.2218-676X.2014.06.04.
2
Semi-supervised recursively partitioned mixture models for identifying cancer subtypes.半监督递归分区混合模型用于识别癌症亚型。
Bioinformatics. 2010 Oct 15;26(20):2578-85. doi: 10.1093/bioinformatics/btq470. Epub 2010 Aug 16.
3
Recursively partitioned mixture model clustering of DNA methylation data using biologically informed correlation structures.使用生物学信息相关结构对DNA甲基化数据进行递归划分混合模型聚类。
Stat Appl Genet Mol Biol. 2013 Mar 5;12(2):225-40. doi: 10.1515/sagmb-2012-0068.
4
Analyzing gene expression time-courses based on multi-resolution shape mixture model.基于多分辨率形状混合模型分析基因表达时间进程。
Math Biosci. 2016 Nov;281:74-81. doi: 10.1016/j.mbs.2016.08.012. Epub 2016 Sep 10.
5
A mixture model with random-effects components for clustering correlated gene-expression profiles.一种具有随机效应成分的混合模型,用于对相关基因表达谱进行聚类。
Bioinformatics. 2006 Jul 15;22(14):1745-52. doi: 10.1093/bioinformatics/btl165. Epub 2006 May 3.
6
Correlation-based iterative clustering methods for time course data: The identification of temporal gene response modules for influenza infection in humans.基于相关性的时间序列数据迭代聚类方法:人类流感感染中时间基因反应模块的识别
Infect Dis Model. 2016 Sep 2;1(1):28-39. doi: 10.1016/j.idm.2016.07.001. eCollection 2016 Oct.
7
Finding explained groups of time-course gene expression profiles with predictive clustering trees.使用预测聚类树寻找时间进程基因表达谱的可解释分组。
Mol Biosyst. 2010 Apr;6(4):729-40. doi: 10.1039/b913690h. Epub 2010 Feb 19.
8
Application of dynamic topic models to toxicogenomics data.动态主题模型在毒理基因组学数据中的应用。
BMC Bioinformatics. 2016 Oct 6;17(Suppl 13):368. doi: 10.1186/s12859-016-1225-0.
9
Dynamic model-based clustering for time-course gene expression data.基于动态模型的时间序列基因表达数据聚类分析
J Bioinform Comput Biol. 2005 Aug;3(4):821-36. doi: 10.1142/s0219720005001314.
10
Clustering samples characterized by time course gene expression profiles using the mixture of state space models.使用状态空间模型混合方法对以时间进程基因表达谱为特征的样本进行聚类。
Genome Inform. 2007;18:258-66.

引用本文的文献

1
Exploring the longitudinal dynamics of herd BVD antibody test results using model-based clustering.利用基于模型的聚类方法探索 herd BVD 抗体检测结果的纵向动态变化。
Sci Rep. 2019 Aug 6;9(1):11353. doi: 10.1038/s41598-019-47339-6.
2
Promoter methylation of DNA damage repair (DDR) genes in human tumor entities: / is almost exclusively methylated in bladder cancer.人类肿瘤实体中 DNA 损伤修复 (DDR) 基因的启动子甲基化:/ 在膀胱癌中几乎完全甲基化。
Clin Epigenetics. 2018 Feb 6;10:15. doi: 10.1186/s13148-018-0447-6. eCollection 2018.
3
A Linear Mixed Model Spline Framework for Analysing Time Course 'Omics' Data.用于分析时间进程“组学”数据的线性混合模型样条框架
PLoS One. 2015 Aug 27;10(8):e0134540. doi: 10.1371/journal.pone.0134540. eCollection 2015.

本文引用的文献

1
Placenta-imprinted gene expression association of infant neurobehavior.胎盘印迹基因表达与婴儿神经行为的关联。
J Pediatr. 2012 May;160(5):854-860.e2. doi: 10.1016/j.jpeds.2011.10.028. Epub 2011 Dec 6.
2
DNA methylation array analysis identifies profiles of blood-derived DNA methylation associated with bladder cancer.DNA 甲基化芯片分析鉴定出与膀胱癌相关的血液源性 DNA 甲基化特征。
J Clin Oncol. 2011 Mar 20;29(9):1133-9. doi: 10.1200/JCO.2010.31.3577. Epub 2011 Feb 22.
3
Identification and interpretation of longitudinal gene expression changes in trauma.创伤后纵向基因表达变化的识别与解读。
PLoS One. 2010 Dec 20;5(12):e14380. doi: 10.1371/journal.pone.0014380.
4
DNA methylation, isocitrate dehydrogenase mutation, and survival in glioma.DNA 甲基化、异柠檬酸脱氢酶突变与脑胶质瘤患者的生存
J Natl Cancer Inst. 2011 Jan 19;103(2):143-53. doi: 10.1093/jnci/djq497. Epub 2010 Dec 16.
5
Predicting patient survival from longitudinal gene expression.通过纵向基因表达预测患者生存率。
Stat Appl Genet Mol Biol. 2010;9(1):Article41. doi: 10.2202/1544-6115.1617. Epub 2010 Nov 22.
6
Identification of VKORC1 interaction partners by split-ubiquitin system and coimmunoprecipitation.通过拆分泛素系统和免疫共沉淀鉴定 VKORC1 相互作用伙伴。
Thromb Haemost. 2011 Feb;105(2):285-94. doi: 10.1160/TH10-07-0483. Epub 2010 Nov 23.
7
Semi-supervised recursively partitioned mixture models for identifying cancer subtypes.半监督递归分区混合模型用于识别癌症亚型。
Bioinformatics. 2010 Oct 15;26(20):2578-85. doi: 10.1093/bioinformatics/btq470. Epub 2010 Aug 16.
8
A framework for feature selection in clustering.一种用于聚类中特征选择的框架。
J Am Stat Assoc. 2010 Jun 1;105(490):713-726. doi: 10.1198/jasa.2010.tm09415.
9
Mixtures of regression models for time course gene expression data: evaluation of initialization and random effects.基于回归模型的时间序列基因表达数据混合方法:初始化和随机效应评估。
Bioinformatics. 2010 Feb 1;26(3):370-7. doi: 10.1093/bioinformatics/btp686. Epub 2009 Dec 29.
10
A genomic score prognostic of outcome in trauma patients.一种预测创伤患者预后的基因组评分。
Mol Med. 2009 Jul-Aug;15(7-8):220-7. doi: 10.2119/molmed.2009.00027. Epub 2009 Apr 10.