• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于多模态图的半监督癌症生存预测管道

A Multi-Modal Graph-Based Semi-Supervised Pipeline for Predicting Cancer Survival.

作者信息

Hassanzadeh Hamid Reza, Phan John H, Wang May D

机构信息

Department of Computational Science and Engineering, Georgia Institute of Technology Atlanta, Georgia 30332.

Department of Biomedical Engineering Georgia Institute of Technology and Emory University, Atlanta, Georgia 30332.

出版信息

Proceedings (IEEE Int Conf Bioinformatics Biomed). 2016 Dec;2016:184-189. doi: 10.1109/bibm.2016.7822516. Epub 2017 Jan 19.

DOI:10.1109/bibm.2016.7822516
PMID:32655981
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7353958/
Abstract

Cancer survival prediction is an active area of research that can help prevent unnecessary therapies and improve patient's quality of life. Gene expression profiling is being widely used in cancer studies to discover informative biomarkers that aid predict different clinical endpoint prediction. We use multiple modalities of data derived from RNA deep-sequencing (RNA-seq) to predict survival of cancer patients. Despite the wealth of information available in expression profiles of cancer tumors, fulfilling the aforementioned objective remains a big challenge, for the most part, due to the paucity of data samples compared to the high dimension of the expression profiles. As such, analysis of transcriptomic data modalities calls for state-of-the-art big-data analytics techniques that can maximally use all the available data to discover the relevant information hidden within a significant amount of noise. In this paper, we propose a pipeline that predicts cancer patients' survival by exploiting the structure of the input (manifold learning) and by leveraging the unlabeled samples using Laplacian support vector machines, a graph-based semi supervised learning (GSSL) paradigm. We show that under certain circumstances, no single modality per se will result in the best accuracy and by fusing different models together via a stacked generalization strategy, we may boost the accuracy synergistically. We apply our approach to two cancer datasets and present promising results. We maintain that a similar pipeline can be used for predictive tasks where labeled samples are expensive to acquire.

摘要

癌症生存预测是一个活跃的研究领域,它有助于避免不必要的治疗并提高患者的生活质量。基因表达谱分析在癌症研究中被广泛应用,以发现有助于预测不同临床终点的信息性生物标志物。我们使用从RNA深度测序(RNA-seq)获得的多种数据模式来预测癌症患者的生存情况。尽管癌症肿瘤的表达谱中存在大量信息,但实现上述目标仍然是一个巨大的挑战,在很大程度上是由于与高维表达谱相比,数据样本匮乏。因此,对转录组数据模式的分析需要最先进的大数据分析技术,这些技术可以最大限度地利用所有可用数据,以发现隐藏在大量噪声中的相关信息。在本文中,我们提出了一种流程,通过利用输入的结构(流形学习)并使用拉普拉斯支持向量机(一种基于图的半监督学习(GSSL)范式)来利用未标记样本,从而预测癌症患者的生存情况。我们表明,在某些情况下,单一模式本身不会带来最佳准确性,而通过堆叠泛化策略将不同模型融合在一起,我们可以协同提高准确性。我们将我们的方法应用于两个癌症数据集,并呈现出有前景的结果。我们认为,类似的流程可用于获取标记样本成本高昂的预测任务。

相似文献

1
A Multi-Modal Graph-Based Semi-Supervised Pipeline for Predicting Cancer Survival.一种基于多模态图的半监督癌症生存预测管道
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2016 Dec;2016:184-189. doi: 10.1109/bibm.2016.7822516. Epub 2017 Jan 19.
2
Enhanced manifold regularization for semi-supervised classification.用于半监督分类的增强流形正则化
J Opt Soc Am A Opt Image Sci Vis. 2016 Jun 1;33(6):1207-13. doi: 10.1364/JOSAA.33.001207.
3
A semi-supervised deep learning method based on stacked sparse auto-encoder for cancer prediction using RNA-seq data.基于堆叠稀疏自动编码器的半监督深度学习方法在 RNA-seq 数据癌症预测中的应用。
Comput Methods Programs Biomed. 2018 Nov;166:99-105. doi: 10.1016/j.cmpb.2018.10.004. Epub 2018 Oct 5.
4
Enhanced low-rank representation via sparse manifold adaption for semi-supervised learning.基于稀疏流形自适应的半监督学习增强低秩表示
Neural Netw. 2015 May;65:1-17. doi: 10.1016/j.neunet.2015.01.001. Epub 2015 Jan 10.
5
L1-norm locally linear representation regularization multi-source adaptation learning.L1 范数局部线性表示正则化多源自适应学习。
Neural Netw. 2015 Sep;69:80-98. doi: 10.1016/j.neunet.2015.01.009. Epub 2015 Feb 25.
6
p -Laplacian Regularization for Scene Recognition.用于场景识别的p -拉普拉斯正则化
IEEE Trans Cybern. 2019 Aug;49(8):2927-2940. doi: 10.1109/TCYB.2018.2833843. Epub 2018 May 22.
7
Cancer survival analysis using semi-supervised learning method based on Cox and AFT models with L1/2 regularization.基于带有L1/2正则化的Cox模型和加速失效时间(AFT)模型的半监督学习方法进行癌症生存分析。
BMC Med Genomics. 2016 Mar 1;9:11. doi: 10.1186/s12920-016-0169-6.
8
An Integrated Deep Network for Cancer Survival Prediction Using Omics Data.一种使用组学数据进行癌症生存预测的集成深度网络。
Front Big Data. 2021 Jul 16;4:568352. doi: 10.3389/fdata.2021.568352. eCollection 2021.
9
A semi-supervised machine learning framework for microRNA classification.一种用于 microRNA 分类的半监督机器学习框架。
Hum Genomics. 2019 Oct 22;13(Suppl 1):43. doi: 10.1186/s40246-019-0221-7.
10
Graph-Based Semi-Supervised Learning: A Comprehensive Review.基于图的半监督学习:全面综述。
IEEE Trans Neural Netw Learn Syst. 2023 Nov;34(11):8174-8194. doi: 10.1109/TNNLS.2022.3155478. Epub 2023 Oct 27.

引用本文的文献

1
Smart Biosensor for Breast Cancer Survival Prediction Based on Multi-View Multi-Way Graph Learning.基于多视图多向图学习的乳腺癌生存预测智能生物传感器
Sensors (Basel). 2024 May 21;24(11):3289. doi: 10.3390/s24113289.

本文引用的文献

1
Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning.基于特征选择和半监督学习的基因表达谱数据癌症生物标志物识别
IEEE J Transl Eng Health Med. 2014 Dec 2;2:4300211. doi: 10.1109/JTEHM.2014.2375820. eCollection 2014.
2
A semi-supervised method for predicting cancer survival using incomplete clinical data.
Annu Int Conf IEEE Eng Med Biol Soc. 2015;2015:210-3. doi: 10.1109/EMBC.2015.7318337.
3
Comparison of RNA-seq and microarray-based models for clinical endpoint prediction.用于临床终点预测的RNA测序和基于微阵列模型的比较。
Genome Biol. 2015 Jun 25;16(1):133. doi: 10.1186/s13059-015-0694-1.
4
Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction.知识增强:一种基于图的整合方法,利用多组学数据和基因组知识进行癌症临床结果预测。
J Am Med Inform Assoc. 2015 Jan;22(1):109-20. doi: 10.1136/amiajnl-2013-002481. Epub 2014 Jul 7.
5
Semi-supervised learning for potential human microRNA-disease associations inference.用于潜在人类微小RNA-疾病关联推断的半监督学习
Sci Rep. 2014 Jun 30;4:5501. doi: 10.1038/srep05501.
6
Incorporating inter-relationships between different levels of genomic data into cancer clinical outcome prediction.将不同层次基因组数据之间的相互关系纳入癌症临床结果预测。
Methods. 2014 Jun 1;67(3):344-53. doi: 10.1016/j.ymeth.2014.02.003. Epub 2014 Feb 18.
7
Integrative gene network construction to analyze cancer recurrence using semi-supervised learning.使用半监督学习构建整合基因网络以分析癌症复发
PLoS One. 2014 Jan 31;9(1):e86309. doi: 10.1371/journal.pone.0086309. eCollection 2014.
8
Breast cancer survivability prediction using labeled, unlabeled, and pseudo-labeled patient data.利用标记、未标记和伪标记患者数据进行乳腺癌生存预测。
J Am Med Inform Assoc. 2013 Jul-Aug;20(4):613-8. doi: 10.1136/amiajnl-2012-001570. Epub 2013 Mar 6.
9
Gene-expression-based cancer subtypes prediction through feature selection and transductive SVM.基于基因表达的癌症亚型预测:特征选择与转导 SVM 方法
IEEE Trans Biomed Eng. 2013 Apr;60(4):1111-7. doi: 10.1109/TBME.2012.2225622. Epub 2012 Oct 18.
10
Prediction of allergenic proteins by means of the concept of Chou's pseudo amino acid composition and a machine learning approach.利用 Chou 的伪氨基酸组成概念和机器学习方法预测过敏原蛋白。
Med Chem. 2013 Feb;9(1):133-7. doi: 10.2174/157340613804488341.