Li Peiyu, Jing Shaowen, Tian Guo, Jiang Jing
Network and Informatization Office, Henan University of Science and Technology, Luoyang, China.
Henan Engineering Laboratory of Cloud Computing Data Center Network Key Technologies, Luoyang, 471023, China.
Sci Rep. 2025 Jan 31;15(1):3925. doi: 10.1038/s41598-025-87438-1.
Complex diseases may not always progress in a gradual manner. In the early stages of complex diseases, obvious symptoms are usually not observable, but there is a commonality: there is a brief state of predisease between the progression from normal state to disease state, which usually includes three stages: normal state, critical state of predisease, and disease state. Identifying this critical state, especially with a single sample from an individual, remains a difficult task. In this study, we applied three methods, i.e., single-sample Jensen-Shannon Divergence (sJSD), network information gain (NIG), and temporal network flow entropy (TNFE) method, to a simulated dataset and type 2 diabetes (GSE13268 and GSE13269). Three different methods were utilized to create indexes, including the Inconsistency Index (ICI), NIG, and TNFE, to measure the overall disruption caused by individual samples compared to a set of reference samples. Changes in these indexes were used to identify critical states during the progression of the disease. Results from the numerical simulations show the effectiveness of the three methods. All the methods can detect two critical states based on a single sample, which are respectively at 8 weeks and 16 weeks for GSE13268 and at 4 weeks and 16 weeks for GSE13269, indicating the critical states before deterioration can be detected and the dynamic network biomarkers (DNBs) can be identified successfully. But there are differences in the sensitivity of predictive indicators based on the three methods. The identified dynamic network biomarkers are also significantly different. In addition, the computational principles of the three methods are compared. The proposed three methods can effectively detect the critical state and identify the DNB, solely based on a single sample. The three methods are data-driven and model-free on an individual basis. sJSD method is more sensitive to the critical state, while NIG and TNFE methods are more robust and effective. They can therefore not only help future studies of personalized disease diagnosis but also provide a better insight into clinical practice.
复杂疾病并不总是以渐进的方式发展。在复杂疾病的早期阶段,通常观察不到明显症状,但存在一个共性:从正常状态发展到疾病状态之间存在一个短暂的疾病前期状态,该状态通常包括三个阶段:正常状态、疾病前期临界状态和疾病状态。识别这个临界状态,尤其是从个体的单个样本中识别,仍然是一项艰巨的任务。在本研究中,我们将三种方法,即单样本詹森-香农散度(sJSD)、网络信息增益(NIG)和时间网络流熵(TNFE)方法,应用于一个模拟数据集以及2型糖尿病数据集(GSE13268和GSE13269)。利用三种不同方法创建指标,包括不一致指数(ICI)、NIG和TNFE,以衡量与一组参考样本相比单个样本所造成的整体干扰。这些指标的变化被用于识别疾病进展过程中的临界状态。数值模拟结果显示了这三种方法的有效性。所有方法都能基于单个样本检测到两个临界状态,对于GSE13268分别在第8周和第16周,对于GSE13269分别在第4周和第16周,这表明可以检测到恶化前的临界状态并成功识别动态网络生物标志物(DNB)。但基于这三种方法的预测指标在敏感性上存在差异。所识别的动态网络生物标志物也有显著不同。此外,还比较了这三种方法的计算原理。所提出的三种方法仅基于单个样本就能有效地检测临界状态并识别DNB。这三种方法是数据驱动且基于个体无模型的。sJSD方法对临界状态更敏感,而NIG和TNFE方法更稳健有效。因此,它们不仅有助于未来个性化疾病诊断的研究,还能为临床实践提供更好的见解。