Bukkuri Anuraag, Andor Noemi, Darcy Isabel K
Department of Integrated Mathematical Oncology, Moffitt Cancer Center, Tampa, FL, United States.
Department of Mathematics, University of Iowa, Iowa City, IA, United States.
Front Artif Intell. 2021 Apr 13;4:659037. doi: 10.3389/frai.2021.659037. eCollection 2021.
The emergence of the information age in the last few decades brought with it an explosion of biomedical data. But with great power comes great responsibility: there is now a pressing need for new data analysis algorithms to be developed to make sense of the data and transform this information into knowledge which can be directly translated into the clinic. Topological data analysis (TDA) provides a promising path forward: using tools from the mathematical field of algebraic topology, TDA provides a framework to extract insights into the often high-dimensional, incomplete, and noisy nature of biomedical data. Nowhere is this more evident than in the field of oncology, where patient-specific data is routinely presented to clinicians in a variety of forms, from imaging to single cell genomic sequencing. In this review, we focus on applications involving persistent homology, one of the main tools of TDA. We describe some recent successes of TDA in oncology, specifically in predicting treatment responses and prognosis, tumor segmentation and computer-aided diagnosis, disease classification, and cellular architecture determination. We also provide suggestions on avenues for future research including utilizing TDA to analyze cancer time-series data such as gene expression changes during pathogenesis, investigation of the relation between angiogenic vessel structure and treatment efficacy from imaging data, and experimental confirmation that geometric and topological connectivity implies functional connectivity in the context of cancer.
过去几十年信息时代的出现带来了生物医学数据的爆炸式增长。但能力越大,责任越大:现在迫切需要开发新的数据分析算法,以便理解这些数据,并将这些信息转化为可直接应用于临床的知识。拓扑数据分析(TDA)提供了一条很有前景的前进道路:利用代数拓扑数学领域的工具,TDA提供了一个框架,用于深入了解生物医学数据通常具有的高维、不完整和有噪声的特性。这一点在肿瘤学领域最为明显,在该领域,特定患者的数据通常以各种形式呈现给临床医生,从成像到单细胞基因组测序。在本综述中,我们重点关注涉及持久同调的应用,持久同调是TDA的主要工具之一。我们描述了TDA在肿瘤学方面最近取得的一些成功,特别是在预测治疗反应和预后、肿瘤分割与计算机辅助诊断、疾病分类以及细胞结构确定方面。我们还为未来的研究方向提供了建议,包括利用TDA分析癌症时间序列数据,如发病过程中的基因表达变化,从成像数据研究血管生成结构与治疗效果之间的关系,以及通过实验证实几何和拓扑连通性在癌症背景下意味着功能连通性。