Department of Biomedical Engineering, School of Control Science and Engineering, Shandong University, Jinan, Shandong 250061, China.
Department of Orthodontics, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200011, China.
Biochim Biophys Acta Gene Regul Mech. 2023 Jun;1866(2):194911. doi: 10.1016/j.bbagrm.2023.194911. Epub 2023 Feb 16.
Gene regulatory network (GRN) is a model that characterizes the complex relationships between genes and thereby provides an informatics environment to measure the importance of nodes. The evaluation of important nodes in a GRN can effectively refer to their functional implications severing as key players in particular biological processes, such as master regulator and driver gene. Currently, it is mainly based on network topological parameters and focuses only on evaluating a single node individually. However, genes and products play their functions by interacting with each other. It is worth noting that the effects of gene combinations in GRN are not simply additive. Key combinations discovery is of significance in revealing gene sets with important functions. Recently, with the development of single-cell RNA-sequencing (scRNA-seq) technology, we can quantify gene expression profiles of individual cells that provide the potential to identify crucial nodes in gene regulations regarding specific condition, e.g., stem cell differentiation.
In this paper, we propose a bioinformatics method, called Pseudo Knockout Importance (PKI), to quantify the importance of node and node sets in a specific GRN structure using time-course scRNA-seq data. First, we construct ordinary differential equations to approach the gene regulations during cell differentiation. Then we design gene pseudo knockout experiments and define PKI score evaluation criteria based on the coefficient of determination. The importance of nodes can be described as the influence on the ODE system of removing variables. For key gene combinations, PKI is derived as a combinatorial optimization problem of quantifying the in silico gene knockout effects.
Here, we focus our analyses on the specific GRN of embryonic stem cells with time series gene expression profile. To verify the effectiveness and advantage of PKI method, we compare its node importance rankings with other twelve kinds of centrality-based methods, such as degree and Latora closeness. For key node combinations, we compare the results with the method based on minimum dominant set. Moreover, the famous combinations of transcription factors in induced pluripotent stem cell are also employed to verify the vital gene combinations identified by PKI. These results demonstrate the reliability and superiority of the proposed method.
基因调控网络(GRN)是一种描述基因之间复杂关系的模型,为衡量节点重要性提供了信息学环境。GRN 中重要节点的评估可以有效地参考其功能意义,作为特定生物过程中的关键参与者,如主调控因子和驱动基因。目前,它主要基于网络拓扑参数,并且仅侧重于单独评估单个节点。然而,基因和产物通过相互作用发挥其功能。值得注意的是,GRN 中基因组合的影响不是简单的加和。关键组合的发现对于揭示具有重要功能的基因集具有重要意义。最近,随着单细胞 RNA 测序(scRNA-seq)技术的发展,我们可以量化单个细胞的基因表达谱,这为识别特定条件下基因调控中的关键节点提供了潜力,例如干细胞分化。
在本文中,我们提出了一种生物信息学方法,称为伪敲除重要性(PKI),使用时间序列 scRNA-seq 数据来量化特定 GRN 结构中节点和节点集的重要性。首先,我们构建常微分方程来接近细胞分化过程中的基因调控。然后,我们设计基因伪敲除实验,并基于确定系数定义 PKI 评分评估标准。节点的重要性可以描述为去除变量对 ODE 系统的影响。对于关键基因组合,PKI 被推导为量化计算机基因敲除效果的组合优化问题。
在这里,我们将分析重点放在具有时间序列基因表达谱的胚胎干细胞的特定 GRN 上。为了验证 PKI 方法的有效性和优势,我们将其节点重要性排名与其他 12 种基于中心性的方法(如度和 Latora 紧密性)进行了比较。对于关键节点组合,我们将结果与基于最小优势集的方法进行了比较。此外,还使用诱导多能干细胞中的转录因子的著名组合来验证 PKI 识别的重要基因组合。这些结果证明了所提出方法的可靠性和优越性。