Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
Machine Learning Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
Nucleic Acids Res. 2023 Apr 24;51(7):e38. doi: 10.1093/nar/gkad053.
Inference of global gene regulatory networks from omics data is a long-term goal of systems biology. Most methods developed for inferring transcription factor (TF)-gene interactions either relied on a small dataset or used snapshot data which is not suitable for inferring a process that is inherently temporal. Here, we developed a new computational method that combines neural networks and multi-task learning to predict RNA velocity rather than gene expression values. This allows our method to overcome many of the problems faced by prior methods leading to more accurate and more comprehensive set of identified regulatory interactions. Application of our method to atlas scale single cell data from 6 HuBMAP tissues led to several validated and novel predictions and greatly improved on prior methods proposed for this task.
从组学数据中推断全局基因调控网络是系统生物学的长期目标。大多数用于推断转录因子(TF)-基因相互作用的方法要么依赖于小数据集,要么使用不适合推断本质上是时间性过程的快照数据。在这里,我们开发了一种新的计算方法,将神经网络和多任务学习相结合来预测 RNA 速度,而不是基因表达值。这使得我们的方法能够克服以前的方法所面临的许多问题,从而得到更准确和更全面的识别调控相互作用集。将我们的方法应用于来自 6 个 HuBMAP 组织的图谱规模单细胞数据,导致了几个经过验证的新预测,并大大优于为此任务提出的先前方法。