Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan, Hubei 430079, P.R. China.
School of Computer, Central China Normal University, Wuhan, Hubei 430079, P.R. China.
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae180.
The crisis of antibiotic resistance, which causes antibiotics used to treat bacterial infections to become less effective, has emerged as one of the foremost challenges to public health. Identifying the properties of antibiotic resistance genes (ARGs) is an essential way to mitigate this issue. Although numerous methods have been proposed for this task, most of these approaches concentrate solely on predicting antibiotic class, disregarding other important properties of ARGs. In addition, existing methods for simultaneously predicting multiple properties of ARGs fail to account for the causal relationships among these properties, limiting the predictive performance.
In this study, we propose a causality-guided framework for annotating properties of ARGs, in which causal inference is utilized for representation learning. More specifically, the hidden biological patterns determining the properties of ARGs are described by a Gaussian Mixture Model, and procedure of causal representation learning is used to derive the hidden features. In addition, a causal graph among different properties is constructed to capture the causal relationships among properties of ARGs, which is integrated into the task of annotating properties of ARGs. The experimental results on a real-world dataset demonstrate the effectiveness of the proposed framework on the task of annotating properties of ARGs.
The data and source codes are available in GitHub at https://github.com/David-WZhao/CausalARG.
抗生素耐药性危机已经成为公共卫生面临的首要挑战之一,因为它会导致原本用于治疗细菌感染的抗生素效果降低。确定抗生素耐药基因(ARGs)的特性是缓解这一问题的重要方法。尽管已经提出了许多用于实现这一任务的方法,但这些方法大多只专注于预测抗生素类别,而忽略了 ARGs 的其他重要特性。此外,现有的同时预测 ARGs 多个特性的方法未能考虑到这些特性之间的因果关系,限制了预测性能。
在本研究中,我们提出了一种基于因果关系的 ARGs 特性标注框架,该框架利用因果推理进行表示学习。更具体地说,通过高斯混合模型描述决定 ARGs 特性的隐藏生物模式,并使用因果表示学习过程来推导出隐藏特征。此外,还构建了不同特性之间的因果关系图,以捕捉 ARGs 特性之间的因果关系,并将其集成到 ARGs 特性标注任务中。在真实数据集上的实验结果表明,所提出的框架在 ARGs 特性标注任务上是有效的。
数据和源代码可在 GitHub 上获得,网址为 https://github.com/David-WZhao/CausalARG。