Lecca Paola
Faculty of Computer Science, Free University of Bozen-Bolzano, Piazza Domenicani, Bolzano, Italy.
Front Bioinform. 2021 Sep 22;1:746712. doi: 10.3389/fbinf.2021.746712. eCollection 2021.
Most machine learning-based methods predict outcomes rather than understanding causality. Machine learning methods have been proved to be efficient in finding correlations in data, but unskilful to determine causation. This issue severely limits the applicability of machine learning methods to infer the causal relationships between the entities of a biological network, and more in general of any dynamical system, such as medical intervention strategies and clinical outcomes system, that is representable as a network. From the perspective of those who want to use the results of network inference not only to understand the mechanisms underlying the dynamics, but also to understand how the network reacts to external stimuli (e. g. environmental factors, therapeutic treatments), tools that can understand the causal relationships between data are highly demanded. Given the increasing popularity of machine learning techniques in computational biology and the recent literature proposing the use of machine learning techniques for the inference of biological networks, we would like to present the challenges that mathematics and computer science research faces in generalising machine learning to an approach capable of understanding causal relationships, and the prospects that achieving this will open up for the medical application domains of systems biology, the main paradigm of which is precisely network biology at any physical scale.
大多数基于机器学习的方法是预测结果而非理解因果关系。机器学习方法已被证明在发现数据中的相关性方面很有效,但在确定因果关系方面并不擅长。这个问题严重限制了机器学习方法在推断生物网络(更一般地说,任何动态系统,如可表示为网络的医学干预策略和临床结果系统)中实体之间因果关系的适用性。从那些不仅想利用网络推断结果来理解动态背后的机制,还想了解网络如何对外界刺激(如环境因素、治疗方法)做出反应的人的角度来看,非常需要能够理解数据之间因果关系的工具。鉴于机器学习技术在计算生物学中越来越受欢迎,以及最近有文献提出使用机器学习技术来推断生物网络,我们想提出数学和计算机科学研究在将机器学习推广为一种能够理解因果关系的方法时所面临的挑战,以及实现这一点将为系统生物学的医学应用领域带来的前景,其主要范式正是任何物理尺度下的网络生物学。