West Daniel, Stepney Susan, Hancock Y
Department of Computer Science, University of York, Heslington, York, YO10 5DD, UK.
School of Physics, Engineering and Technology, University of York, Heslington, York, YO10 5DD, UK.
Sci Rep. 2025 Jan 4;15(1):773. doi: 10.1038/s41598-024-83708-6.
Prostate cancer is a disease which poses an interesting clinical question: Should it be treated? Only a small subset of prostate cancers are aggressive and require removal and treatment to prevent metastatic spread. However, conventional diagnostics remain challenged to risk-stratify such patients; hence, new methods of approach to biomolecularly sub-classify the disease are needed. Here we use an unsupervised self-organising map approach to analyse live-cell Raman spectroscopy data obtained from prostate cell-lines; our aim is to exemplify this method to sub-stratify, at the single-cell-level, the cancer disease state using high-dimensional datasets with minimal preprocessing. The results demonstrate a new sub-clustering of the prostate cancer cell-line into two groups-protein-rich and lipid-rich sub-cellular components-which we believe to be mechanistically linked. This finding shows the potential for unsupervised machine learning to discover distinct disease-state features for more accurate characterisation of highly heterogeneous prostate cancer. Applications may lead to more targeted diagnoses, prognoses and clinical treatment decisions via molecularly-informed stratification that would benefit patients. A method that could discover distinct disease-state features that are mechanistically linked could also assist in the development of more effective broad-spectrum treatments that simultaneously target linked disease-state processes.
是否应该对其进行治疗?只有一小部分前列腺癌具有侵袭性,需要切除和治疗以防止转移扩散。然而,传统诊断方法在对这类患者进行风险分层方面仍然面临挑战;因此,需要新的方法从生物分子角度对该疾病进行亚分类。在此,我们使用无监督自组织映射方法来分析从前列腺细胞系获得的活细胞拉曼光谱数据;我们的目标是通过最少的预处理,利用高维数据集在单细胞水平上对癌症疾病状态进行亚分层,以此来例证这种方法。结果表明,前列腺癌细胞系可新分为两组——富含蛋白质和富含脂质的亚细胞成分——我们认为这两组在机制上存在关联。这一发现显示了无监督机器学习在发现不同疾病状态特征以更准确地表征高度异质性前列腺癌方面的潜力。其应用可能通过分子信息分层实现更有针对性的诊断、预后评估和临床治疗决策,从而使患者受益。一种能够发现具有机制关联的不同疾病状态特征的方法,也有助于开发更有效的广谱治疗方法,同时针对相关的疾病状态过程。