Steyaert Sandra, Pizurica Marija, Nagaraj Divya, Khandelwal Priya, Hernandez-Boussard Tina, Gentles Andrew J, Gevaert Olivier
Stanford Center for Biomedical Informatics Research (BMIR), Department of Medicine, Stanford University.
Department of Computer Science, Stanford University.
Nat Mach Intell. 2023 Apr;5(4):351-362. doi: 10.1038/s42256-023-00633-5. Epub 2023 Apr 6.
Technological advances now make it possible to study a patient from multiple angles with high-dimensional, high-throughput multi-scale biomedical data. In oncology, massive amounts of data are being generated ranging from molecular, histopathology, radiology to clinical records. The introduction of deep learning has significantly advanced the analysis of biomedical data. However, most approaches focus on single data modalities leading to slow progress in methods to integrate complementary data types. Development of effective multimodal fusion approaches is becoming increasingly important as a single modality might not be consistent and sufficient to capture the heterogeneity of complex diseases to tailor medical care and improve personalised medicine. Many initiatives now focus on integrating these disparate modalities to unravel the biological processes involved in multifactorial diseases such as cancer. However, many obstacles remain, including lack of usable data as well as methods for clinical validation and interpretation. Here, we cover these current challenges and reflect on opportunities through deep learning to tackle data sparsity and scarcity, multimodal interpretability, and standardisation of datasets.
技术进步使我们现在能够利用高维、高通量的多尺度生物医学数据从多个角度研究患者。在肿瘤学领域,正在产生大量数据,范围从分子、组织病理学、放射学到临床记录。深度学习的引入显著推动了生物医学数据的分析。然而,大多数方法专注于单一数据模态,导致在整合互补数据类型的方法上进展缓慢。随着单一模态可能不足以一致且充分地捕捉复杂疾病的异质性以定制医疗护理和改善个性化医疗,开发有效的多模态融合方法变得越来越重要。现在许多倡议都集中在整合这些不同的模态,以揭示诸如癌症等多因素疾病所涉及的生物学过程。然而,仍然存在许多障碍,包括缺乏可用数据以及临床验证和解释方法。在此,我们涵盖这些当前挑战,并思考通过深度学习应对数据稀疏性和稀缺性、多模态可解释性以及数据集标准化的机会。