Peng Xueqing, Qiao Rui, Li Peiluan, Chen Luonan
School of Mathematics and Statistics, Henan University of Science and Technology, Luoyang, China.
Longmen Laboratory, Luoyang, Henan, China.
PLoS Comput Biol. 2025 Jul 29;21(7):e1013336. doi: 10.1371/journal.pcbi.1013336. eCollection 2025 Jul.
Typically, in dynamic biological processes, there is a critical state or tipping point that marks the transition from one stable state to another, surpassing which a considerable qualitative shift takes place. Identifying this tipping point and its driving network is essential to avert or delay disastrous outcomes. However, most traditional approaches built upon undirected networks still suffer from a lack of robustness and effectiveness when implemented based on high-dimensional small-sample data, especially for single-cell data. To address this challenge, we develop a directed network flow entropy (DNFE) method, which can transform measured omics data into a directed network. This method is applicable to both single-cell RNA-sequencing (scRNA-seq) and bulk data. Applying this algorithm to six real datasets, including three single-cell datasets, two bulk tumor datasets, and a blood dataset, the method is proved to be effective not only in identifying critical states, as well as their dynamic network biomarkers, but also in helping explore regulatory relationships between genes. Numerical simulation results demonstrate that the DNFE algorithm is robust across various noise levels and outperforms existing methods in detecting tipping points. Furthermore, the numerical simulations for 100-node and 1000-node gene regulatory networks illustrate the method's application for large-scale data. The DNFE method predicts active transcription factors, and further identified "dark genes", which are usually overlooked with traditional methods.
通常,在动态生物过程中,存在一个临界状态或转折点,它标志着从一种稳定状态向另一种稳定状态的转变,超过这个点就会发生相当大的质变。识别这个转折点及其驱动网络对于避免或延迟灾难性后果至关重要。然而,大多数基于无向网络的传统方法在基于高维小样本数据(特别是单细胞数据)实施时,仍然缺乏稳健性和有效性。为了应对这一挑战,我们开发了一种有向网络流熵(DNFE)方法,该方法可以将测得的组学数据转化为有向网络。此方法适用于单细胞RNA测序(scRNA-seq)和批量数据。将该算法应用于六个真实数据集,包括三个单细胞数据集、两个批量肿瘤数据集和一个血液数据集,结果证明该方法不仅在识别临界状态及其动态网络生物标志物方面有效,而且有助于探索基因之间的调控关系。数值模拟结果表明,DNFE算法在各种噪声水平下都具有稳健性,并且在检测转折点方面优于现有方法。此外,对100节点和1000节点基因调控网络的数值模拟说明了该方法在大规模数据中的应用。DNFE方法预测了活跃的转录因子,并进一步识别出了“暗基因”,这些基因通常会被传统方法忽视。