Llaneza-Lago Sergio, Fraser William D, Green Darrell
Biomedical Research Centre, Norwich Medical School, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, United Kingdom.
Bioanalytical Facility, Norwich Medical School, University of East Anglia, Norwich Research Park, Norwich NR4 7UQ, United Kingdom.
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae665.
Identification of cancer subtypes is a critical step for developing precision medicine. Most cancer subtyping is based on the analysis of RNA sequencing (RNA-seq) data from patient cohorts using unsupervised machine learning methods such as hierarchical cluster analysis, but these computational approaches disregard the heterogeneous composition of individual cancer samples. Here, we used a more sophisticated unsupervised Bayesian model termed latent process decomposition (LPD), which handles individual cancer sample heterogeneity and deconvolutes the structure of transcriptome data to provide clinically relevant information. The work was performed on the pediatric tumor osteosarcoma, which is a prototypical model for a rare and heterogeneous cancer. The LPD model detected three osteosarcoma subtypes. The subtype with the poorest prognosis was validated using independent patient datasets. This new stratification framework will be important for more accurate diagnostic labeling, expediting precision medicine, and improving clinical trial success. Our results emphasize the importance of using more sophisticated machine learning approaches (and for teaching deep learning and artificial intelligence) for RNA-seq data analysis, which may assist drug targeting and clinical management.
识别癌症亚型是开发精准医学的关键步骤。大多数癌症亚型分类是基于使用无监督机器学习方法(如层次聚类分析)对患者队列的RNA测序(RNA-seq)数据进行分析,但这些计算方法忽略了单个癌症样本的异质性组成。在此,我们使用了一种更复杂的无监督贝叶斯模型,称为潜在过程分解(LPD),该模型可处理单个癌症样本的异质性,并对转录组数据结构进行反卷积以提供临床相关信息。这项工作是在小儿肿瘤骨肉瘤上进行的,骨肉瘤是一种罕见且异质性癌症的典型模型。LPD模型检测到三种骨肉瘤亚型。使用独立的患者数据集验证了预后最差的亚型。这个新的分层框架对于更准确的诊断标记、加速精准医学以及提高临床试验成功率将具有重要意义。我们的结果强调了使用更复杂的机器学习方法(以及用于深度学习和人工智能教学)进行RNA-seq数据分析的重要性,这可能有助于药物靶向和临床管理。