• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于临床时间序列分析和稳健分类的约束混合估计

Constrained mixture estimation for analysis and robust classification of clinical time series.

作者信息

Costa Ivan G, Schönhuth Alexander, Hafemeister Christoph, Schliep Alexander

机构信息

Center of Informatics, Federal University of Pernambuco, Recife, Brazil.

出版信息

Bioinformatics. 2009 Jun 15;25(12):i6-14. doi: 10.1093/bioinformatics/btp222.

DOI:10.1093/bioinformatics/btp222
PMID:19478017
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2687976/
Abstract

MOTIVATION

Personalized medicine based on molecular aspects of diseases, such as gene expression profiling, has become increasingly popular. However, one faces multiple challenges when analyzing clinical gene expression data; most of the well-known theoretical issues such as high dimension of feature spaces versus few examples, noise and missing data apply. Special care is needed when designing classification procedures that support personalized diagnosis and choice of treatment. Here, we particularly focus on classification of interferon-beta (IFNbeta) treatment response in Multiple Sclerosis (MS) patients which has attracted substantial attention in the recent past. Half of the patients remain unaffected by IFNbeta treatment, which is still the standard. For them the treatment should be timely ceased to mitigate the side effects.

RESULTS

We propose constrained estimation of mixtures of hidden Markov models as a methodology to classify patient response to IFNbeta treatment. The advantages of our approach are that it takes the temporal nature of the data into account and its robustness with respect to noise, missing data and mislabeled samples. Moreover, mixture estimation enables to explore the presence of response sub-groups of patients on the transcriptional level. We clearly outperformed all prior approaches in terms of prediction accuracy, raising it, for the first time, >90%. Additionally, we were able to identify potentially mislabeled samples and to sub-divide the good responders into two sub-groups that exhibited different transcriptional response programs. This is supported by recent findings on MS pathology and therefore may raise interesting clinical follow-up questions.

AVAILABILITY

The method is implemented in the GQL framework and is available at http://www.ghmm.org/gql. Datasets are available at http://www.cin.ufpe.br/ approximately igcf/MSConst.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

基于疾病分子特征的个性化医疗,如基因表达谱分析,已越来越受欢迎。然而,在分析临床基因表达数据时会面临多重挑战;大多数众所周知的理论问题,如特征空间维度高而样本少、噪声和缺失数据等都存在。在设计支持个性化诊断和治疗选择的分类程序时需要特别小心。在此,我们特别关注多发性硬化症(MS)患者中干扰素-β(IFNβ)治疗反应的分类,这在最近引起了广泛关注。仍有一半的患者对IFNβ治疗无反应,而IFNβ治疗仍是标准治疗方法。对于这些患者,应及时停止治疗以减轻副作用。

结果

我们提出将隐藏马尔可夫模型混合的约束估计作为一种对患者对IFNβ治疗的反应进行分类的方法。我们方法的优点在于它考虑了数据的时间特性,并且对噪声、缺失数据和错误标记的样本具有鲁棒性。此外,混合估计能够在转录水平上探索患者反应亚组的存在。在预测准确性方面,我们明显优于所有先前的方法,首次将其提高到>90%。此外,我们能够识别潜在错误标记的样本,并将良好反应者细分为两个表现出不同转录反应程序的亚组。这得到了最近关于MS病理学研究结果的支持,因此可能会引发有趣的临床随访问题。

可用性

该方法在GQL框架中实现,可在http://www.ghmm.org/gql获取。数据集可在http://www.cin.ufpe.br/ approximately igcf/MSConst获取。

补充信息

补充数据可在《生物信息学》在线版获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e392/2687976/907157bfbb8f/btp222f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e392/2687976/f2e48279babf/btp222f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e392/2687976/652a34cab1bf/btp222f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e392/2687976/3d80dd707663/btp222f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e392/2687976/e3d1b347da40/btp222f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e392/2687976/1d42668a7565/btp222f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e392/2687976/74779ade18eb/btp222f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e392/2687976/907157bfbb8f/btp222f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e392/2687976/f2e48279babf/btp222f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e392/2687976/652a34cab1bf/btp222f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e392/2687976/3d80dd707663/btp222f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e392/2687976/e3d1b347da40/btp222f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e392/2687976/1d42668a7565/btp222f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e392/2687976/74779ade18eb/btp222f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e392/2687976/907157bfbb8f/btp222f7.jpg

相似文献

1
Constrained mixture estimation for analysis and robust classification of clinical time series.用于临床时间序列分析和稳健分类的约束混合估计
Bioinformatics. 2009 Jun 15;25(12):i6-14. doi: 10.1093/bioinformatics/btp222.
2
Robust inference of groups in gene expression time-courses using mixtures of HMMs.使用隐马尔可夫模型混合对基因表达时间进程中的组进行稳健推断。
Bioinformatics. 2004 Aug 4;20 Suppl 1:i283-9. doi: 10.1093/bioinformatics/bth937.
3
Classification of time series gene expression in clinical studies via integration of biological network.通过整合生物网络对临床研究中的时间序列基因表达进行分类。
PLoS One. 2013;8(3):e58383. doi: 10.1371/journal.pone.0058383. Epub 2013 Mar 13.
4
Detection and interpretation of metabolite-transcript coresponses using combined profiling data.利用联合分析数据检测和解释代谢物-转录协同反应。
Bioinformatics. 2011 Jul 1;27(13):i357-65. doi: 10.1093/bioinformatics/btr231.
5
Prognostic prediction through biclustering-based classification of clinical gene expression time series.通过基于双聚类的临床基因表达时间序列分类进行预后预测。
J Integr Bioinform. 2011 Sep 15;8(3):175. doi: 10.2390/biecoll-jib-2011-175.
6
Discovering gene expression patterns in time course microarray experiments by ANOVA-SCA.通过方差分析-稀疏成分分析在时间进程微阵列实验中发现基因表达模式。
Bioinformatics. 2007 Jul 15;23(14):1792-800. doi: 10.1093/bioinformatics/btm251. Epub 2007 May 22.
7
Patient-specific early classification of multivariate observations.多变量观察结果的患者特异性早期分类。
Int J Data Min Bioinform. 2015;11(4):392-411. doi: 10.1504/ijdmb.2015.067955.
8
Inferring epigenetic and transcriptional regulation during blood cell development with a mixture of sparse linear models.利用稀疏线性模型混合物推断血细胞发育过程中的表观遗传和转录调控。
Bioinformatics. 2012 Sep 15;28(18):2297-303. doi: 10.1093/bioinformatics/bts362. Epub 2012 Jun 23.
9
T cell gene expression profiling identifies distinct subgroups of Japanese multiple sclerosis patients.T细胞基因表达谱分析识别出日本多发性硬化症患者的不同亚组。
J Neuroimmunol. 2006 May;174(1-2):108-18. doi: 10.1016/j.jneuroim.2006.02.004. Epub 2006 Mar 27.
10
Analyzing gene expression time-courses.分析基因表达时间进程。
IEEE/ACM Trans Comput Biol Bioinform. 2005 Jul-Sep;2(3):179-93. doi: 10.1109/TCBB.2005.31.

引用本文的文献

1
GeTeSEPdb: A comprehensive database and online tool for the identification and analysis of gene profiles with temporal-specific expression patterns.GeTeSEPdb:一个用于识别和分析具有时间特异性表达模式的基因图谱的综合数据库及在线工具。
Comput Struct Biotechnol J. 2024 Jun 5;23:2488-2496. doi: 10.1016/j.csbj.2024.06.003. eCollection 2024 Dec.
2
Phenotype similarities in automatically grouped T2D patients by variation-based clustering of IL-1β gene expression.通过基于变异的白细胞介素-1β基因表达聚类对自动分组的2型糖尿病患者进行表型相似性分析。
EJIFCC. 2023 Oct 16;34(3):228-244. eCollection 2023 Oct.
3
Time-frequency time-space LSTM for robust classification of physiological signals.

本文引用的文献

1
Diagnostic signatures from microarrays: a bioinformatics concept for personalized medicine.微阵列的诊断特征:个性化医疗的生物信息学概念。
Drug Discov Today. 2004 Dec 15;9(24 Suppl):S32-6.
2
Computational diagnostics with gene expression profiles.基于基因表达谱的计算诊断
Methods Mol Biol. 2008;453:281-96. doi: 10.1007/978-1-60327-429-6_15.
3
Alignment and classification of time series gene expression in clinical studies.临床研究中时间序列基因表达的比对与分类
时频时空长短时记忆网络在生理信号鲁棒分类中的应用
Sci Rep. 2021 Mar 25;11(1):6936. doi: 10.1038/s41598-021-86432-7.
4
Generalized Correlation Coefficient for Non-Parametric Analysis of Microarray Time-Course Data.用于微阵列时间进程数据非参数分析的广义相关系数
J Integr Bioinform. 2017 Jun 6;14(2):/j/jib.2017.14.issue-2/jib-2017-0011/jib-2017-0011.xml. doi: 10.1515/jib-2017-0011.
5
Natural Cubic Spline Regression Modeling Followed by Dynamic Network Reconstruction for the Identification of Radiation-Sensitivity Gene Association Networks from Time-Course Transcriptome Data.基于自然三次样条回归建模并结合动态网络重构,从时间进程转录组数据中识别辐射敏感性基因关联网络。
PLoS One. 2016 Aug 9;11(8):e0160791. doi: 10.1371/journal.pone.0160791. eCollection 2016.
6
Classification of time series gene expression in clinical studies via integration of biological network.通过整合生物网络对临床研究中的时间序列基因表达进行分类。
PLoS One. 2013;8(3):e58383. doi: 10.1371/journal.pone.0058383. Epub 2013 Mar 13.
7
Early classification of multivariate temporal observations by extraction of interpretable shapelets.通过提取可解释的形状特征对多元时间观测进行早期分类。
BMC Bioinformatics. 2012 Aug 8;13:195. doi: 10.1186/1471-2105-13-195.
8
Studying and modelling dynamic biological processes using time-series gene expression data.利用时间序列基因表达数据研究和模拟动态生物过程。
Nat Rev Genet. 2012 Jul 18;13(8):552-64. doi: 10.1038/nrg3244.
9
A growth curve model with fractional polynomials for analysing incomplete time-course data in microarray gene expression studies.一种用于微阵列基因表达研究中分析不完整时间进程数据的含分数多项式的生长曲线模型。
Adv Bioinformatics. 2011;2011:261514. doi: 10.1155/2011/261514. Epub 2011 Sep 27.
10
Detection and interpretation of metabolite-transcript coresponses using combined profiling data.利用联合分析数据检测和解释代谢物-转录协同反应。
Bioinformatics. 2011 Jul 1;27(13):i357-65. doi: 10.1093/bioinformatics/btr231.
Bioinformatics. 2008 Jul 1;24(13):i147-55. doi: 10.1093/bioinformatics/btn152.
4
Enabling personalized cancer medicine through analysis of gene-expression patterns.通过分析基因表达模式实现个性化癌症医学。
Nature. 2008 Apr 3;452(7187):564-70. doi: 10.1038/nature06915.
5
Semi-supervised learning for the identification of syn-expressed genes from fused microarray and in situ image data.基于融合微阵列和原位图像数据的共表达基因识别的半监督学习
BMC Bioinformatics. 2007;8 Suppl 10(Suppl 10):S3. doi: 10.1186/1471-2105-8-S10-S3.
6
A patient-gene model for temporal expression profiles in clinical studies.临床研究中用于时间表达谱的患者-基因模型。
J Comput Biol. 2007 Apr;14(3):324-38. doi: 10.1089/cmb.2007.0001.
7
g:Profiler--a web-based toolset for functional profiling of gene lists from large-scale experiments.g:Profiler——一个基于网络的工具集,用于对来自大规模实验的基因列表进行功能分析。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W193-200. doi: 10.1093/nar/gkm226. Epub 2007 May 3.
8
Class prediction from time series gene expression profiles using dynamical systems kernels.使用动态系统核从时间序列基因表达谱进行类别预测。
Pac Symp Biocomput. 2006:547-58.
9
Analyzing gene expression time-courses.分析基因表达时间进程。
IEEE/ACM Trans Comput Biol Bioinform. 2005 Jul-Sep;2(3):179-93. doi: 10.1109/TCBB.2005.31.
10
A subtype of multiple sclerosis defined by an activated immune defense program.一种由激活的免疫防御程序所定义的多发性硬化症亚型。
Genes Immun. 2006 Sep;7(6):522-31. doi: 10.1038/sj.gene.6364324. Epub 2006 Jul 13.