• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

VAE-Surv:一种用于骨髓增生异常综合征基于基因的聚类和预后预测的新方法。

VAE-Surv: A novel approach for genetic-based clustering and prognosis prediction in myelodysplastic syndromes.

作者信息

Rollo Cesare, Pancotti Corrado, Sartori Flavio, Caranzano Isabella, D'Amico Saverio, Carota Luciana, Casadei Francesco, Birolo Giovanni, Lanino Luca, Sauta Elisabetta, Asti Gianluca, Buizza Alessandro, Delleani Mattia, Zazzetti Elena, Bicchieri Marilena, Maggioni Giulia, Fenaux Pierre, Platzbecker Uwe, Diez-Campelo Maria, Haferlach Torsten, Castellani Gastone, Della Porta Matteo Giovanni, Fariselli Piero, Sanavia Tiziana

机构信息

Computational Biomedicine Unit, Department of Medical Sciences, University of Torino, Via Santena 19, 10126, Torino, Italy.

IRCCS Humanitas Research Hospital, via Manzoni 56, 20089 Rozzano - Milan, Italy; Train s.r.l., via Alessandro Manzoni 56, 20089 Rozzano - Milan, Italy.

出版信息

Comput Methods Programs Biomed. 2025 Apr;261:108605. doi: 10.1016/j.cmpb.2025.108605. Epub 2025 Jan 20.

DOI:10.1016/j.cmpb.2025.108605
PMID:39874934
Abstract

BACKGROUND AND OBJECTIVES

Several computational pipelines for biomedical data have been proposed to stratify patients and to predict their prognosis through survival analysis. However, these analyses are usually performed independently, without integrating the information derived from each of them. Clustering of survival data is an underexplored problem, and current approaches are limited for biomedical applications, whose data are usually heterogeneous and multimodal, with poor scalability for high-dimensionality.

METHODS

We introduce VAE-Surv, a multimodal computational framework for patients' stratification and prognosis prediction. VAE-Surv integrates a Variational Autoencoder (VAE), which reduces the high-dimensional space characterizing the molecular data, with a deep survival model, which combines the embedded information with the clinical features. The VAE embedding step prioritizes local coherence within the feature space to detect potential nonlinear relationships among the molecular markers. The latent representation is then exploited to perform K-means clustering. To test the clinical robustness of the algorithm, VAE-Surv was applied to the Genomed4all cohort of Myelodysplastic Syndromes (MDS), comparing the identified subtypes with the World Health Organization (WHO) classification. The survival outcome was compared with the state-of-the-art Cox model and its penalized versions. Finally, to assess the generalizability of the results, the method was also validated on an external MDS cohort.

RESULTS

Tested on 2,043 patients in the GenomMed4All cohort, VAE-Surv achieved a median C-index of 0.78, outperforming classical approaches. In addition, the latent space enhanced the clustering performance compared to a traditional approach that applies the clustering directly to the input data. Compared to the WHO 2016 MDS subtypes, the analysis of the identified clusters showed that the proposed framework can capture existing clinical categorizations while also suggesting novel, data-driven patient groups. Even tested in an external MDS cohort of 2,384 patients, VAE-Surv achieved a good prediction performance (median C-index=0.74), preserving the interpretability of the main clinical and genetic features.

CONCLUSIONS

VAE-Surv enables automatic identification of patients' clusters, while outperforming the traditional CoxPH model in survival prediction tasks at the same time. Applied to MDS use case, the obtained genetic-based clusters exhibit a clear survival stratification, and the application of the clinical information allowed high performance in prognosis prediction.

摘要

背景与目的

已经提出了几种用于生物医学数据的计算流程,以对患者进行分层,并通过生存分析预测其预后。然而,这些分析通常是独立进行的,没有整合从每个分析中获得的信息。生存数据的聚类是一个未被充分探索的问题,当前的方法在生物医学应用中存在局限性,因为生物医学数据通常是异质的和多模态的,对于高维数据的可扩展性较差。

方法

我们引入了VAE-Surv,这是一种用于患者分层和预后预测的多模态计算框架。VAE-Surv将一个变分自编码器(VAE)与一个深度生存模型相结合,VAE用于降低表征分子数据的高维空间,深度生存模型则将嵌入信息与临床特征相结合。VAE嵌入步骤优先考虑特征空间内的局部连贯性,以检测分子标记之间潜在的非线性关系。然后利用潜在表示进行K均值聚类。为了测试该算法的临床稳健性,将VAE-Surv应用于骨髓增生异常综合征(MDS)的Genomed4all队列,将识别出的亚型与世界卫生组织(WHO)分类进行比较。将生存结果与最先进的Cox模型及其惩罚版本进行比较。最后,为了评估结果的可推广性,该方法还在一个外部MDS队列上进行了验证。

结果

在GenomMed4All队列中的2043名患者上进行测试时,VAE-Surv的中位C指数达到了0.78,优于传统方法。此外,与直接将聚类应用于输入数据的传统方法相比,潜在空间增强了聚类性能。与WHO 2016 MDS亚型相比,对识别出的聚类进行分析表明,所提出的框架能够捕捉现有的临床分类,同时还能提出新的数据驱动的患者群体。即使在一个包含2384名患者的外部MDS队列中进行测试,VAE-Surv也取得了良好的预测性能(中位C指数=0.74),同时保留了主要临床和遗传特征的可解释性。

结论

VAE-Surv能够自动识别患者聚类,同时在生存预测任务中优于传统的CoxPH模型。应用于MDS用例时,所获得的基于基因的聚类表现出明显的生存分层,临床信息的应用在预后预测中具有高性能。

相似文献

1
VAE-Surv: A novel approach for genetic-based clustering and prognosis prediction in myelodysplastic syndromes.VAE-Surv:一种用于骨髓增生异常综合征基于基因的聚类和预后预测的新方法。
Comput Methods Programs Biomed. 2025 Apr;261:108605. doi: 10.1016/j.cmpb.2025.108605. Epub 2025 Jan 20.
2
Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage.将机器学习中的手工特征与潜在变量相结合,以预测放射性肺损伤。
Med Phys. 2019 May;46(5):2497-2511. doi: 10.1002/mp.13497. Epub 2019 Apr 8.
3
MOSAIC: An Artificial Intelligence-Based Framework for Multimodal Analysis, Classification, and Personalized Prognostic Assessment in Rare Cancers.MOSAIC:一种基于人工智能的罕见癌症多模态分析、分类和个性化预后评估框架。
JCO Clin Cancer Inform. 2024 Jun;8:e2400008. doi: 10.1200/CCI.24.00008.
4
ECG-surv: A deep learning-based model to predict time to 1-year mortality from 12-lead electrocardiogram.心电图生存预测模型(ECG-surv):一种基于深度学习的模型,用于根据12导联心电图预测1年死亡率的时间。
Biomed J. 2025 Feb;48(1):100732. doi: 10.1016/j.bj.2024.100732. Epub 2024 May 1.
5
Integrated multi-omics analysis of ovarian cancer using variational autoencoders.基于变分自动编码器的卵巢癌多组学综合分析。
Sci Rep. 2021 Mar 18;11(1):6265. doi: 10.1038/s41598-021-85285-4.
6
Deep clustering analysis via variational autoencoder with Gamma mixture latent embeddings.基于具有伽马混合潜在嵌入的变分自编码器的深度聚类分析。
Neural Netw. 2025 Mar;183:106979. doi: 10.1016/j.neunet.2024.106979. Epub 2024 Dec 4.
7
Research on load clustering algorithm based on variational autoencoder and hierarchical clustering.基于变分自编码器和层次聚类的负荷聚类算法研究
PLoS One. 2024 Jun 13;19(6):e0303977. doi: 10.1371/journal.pone.0303977. eCollection 2024.
8
A novel survival multifactor dimensionality reduction method for detecting gene-gene interactions with application to bladder cancer prognosis.一种新的生存多因素降维方法,用于检测膀胱癌预后的基因-基因相互作用。
Hum Genet. 2011 Jan;129(1):101-10. doi: 10.1007/s00439-010-0905-5. Epub 2010 Oct 28.
9
Novel multi-omics deconfounding variational autoencoders can obtain meaningful disease subtyping.新型多组学去混淆变分自动编码器可获得有意义的疾病亚型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae512.
10
Deep Clustering Analysis via Dual Variational Autoencoder With Spherical Latent Embeddings.基于具有球形潜在嵌入的对偶变分自编码器的深度聚类分析
IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):6303-6312. doi: 10.1109/TNNLS.2021.3135460. Epub 2023 Sep 1.

引用本文的文献

1
A Comprehensive Review of Deep Learning Applications with Multi-Omics Data in Cancer Research.癌症研究中多组学数据深度学习应用的综合综述
Genes (Basel). 2025 May 28;16(6):648. doi: 10.3390/genes16060648.