Zhong Jiayuan, Ding Dandan, Liu Juntan, Liu Rui, Chen Pei
School of Mathematics and Big Data, Foshan University, Foshan 528000, China.
School of Mathematics, South China University of technology, Guangzhou 510640, China.
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad028.
Complex biological systems do not always develop smoothly but occasionally undergo a sharp transition; i.e. there exists a critical transition or tipping point at which a drastic qualitative shift occurs. Hunting for such a critical transition is important to prevent or delay the occurrence of catastrophic consequences, such as disease deterioration. However, the identification of the critical state for complex biological systems is still a challenging problem when using high-dimensional small sample data, especially where only a certain sample is available, which often leads to the failure of most traditional statistical approaches. In this study, a novel quantitative method, sample-perturbed network entropy (SPNE), is developed based on the sample-perturbed directed network to reveal the critical state of complex biological systems at the single-sample level. Specifically, the SPNE approach effectively quantifies the perturbation effect caused by a specific sample on the directed network in terms of network entropy and thus captures the criticality of biological systems. This model-free method was applied to both bulk and single-cell expression data. Our approach was validated by successfully detecting the early warning signals of the critical states for six real datasets, including four tumor datasets from The Cancer Genome Atlas (TCGA) and two single-cell datasets of cell differentiation. In addition, the functional analyses of signaling biomarkers demonstrated the effectiveness of the analytical and computational results.
复杂生物系统并非总是平稳发展,而是偶尔会经历急剧转变;也就是说,存在一个关键转变或临界点,在该点会发生剧烈的质变。寻找这样的关键转变对于预防或延缓灾难性后果(如疾病恶化)的发生很重要。然而,在使用高维小样本数据时,尤其是在只有特定样本可用的情况下,识别复杂生物系统的临界状态仍然是一个具有挑战性的问题,这常常导致大多数传统统计方法失效。在本研究中,基于样本扰动有向网络开发了一种新的定量方法——样本扰动网络熵(SPNE),以在单样本水平上揭示复杂生物系统的临界状态。具体而言,SPNE方法根据网络熵有效地量化了特定样本对有向网络造成的扰动效应,从而捕捉生物系统的临界性。这种无模型方法被应用于批量和单细胞表达数据。我们的方法通过成功检测六个真实数据集(包括来自癌症基因组图谱(TCGA)的四个肿瘤数据集和两个细胞分化的单细胞数据集)的临界状态预警信号得到了验证。此外,信号生物标志物的功能分析证明了分析和计算结果的有效性。