Yan Han, Lu Shuhan, Zhang Sanguo
School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, People's Republic of China.
Key Laboratory of Big Data Mining and Knowledge Management, Chinese Academy of Sciences, Beijing, People's Republic of China.
J Appl Stat. 2023 Aug 14;51(10):1843-1860. doi: 10.1080/02664763.2023.2245178. eCollection 2024.
A growing literature suggests that gene expression can be greatly altered in disease conditions, and identifying those changes will improve the understanding of complex diseases such as cancers or diabetes. A prevailing direction in the analysis of gene expression studies the changes in gene pathways which include sets of related genes. Therefore, introducing structured exploration to differential analysis of gene expression networks may lead to meaningful discoveries. The topic of this paper is differential network analysis, which focuses on capturing the differences between two or more precision matrices. We discuss the connection between the thresholding method and the D-trace loss method on differential network analysis in the case that the precision matrices share the common connected components. Based on this connection, we further propose the cluster D-trace loss method which directly estimates the differential network and achieves model selection consistency. Simulation studies demonstrate its improved performance and computational efficiency. Finally, the usefulness of our proposed estimator is demonstrated by a real-data analysis on non-small cell lung cancer.
越来越多的文献表明,在疾病状态下基因表达会发生很大改变,识别这些变化将有助于加深对癌症或糖尿病等复杂疾病的理解。基因表达分析中一个流行的方向是研究基因通路的变化,基因通路包括相关基因集。因此,将结构化探索引入基因表达网络的差异分析可能会带来有意义的发现。本文的主题是差异网络分析,其重点是捕捉两个或多个精度矩阵之间的差异。我们讨论了在精度矩阵共享共同连通分量的情况下,阈值化方法与差异网络分析中的D-迹损失方法之间的联系。基于这种联系,我们进一步提出了聚类D-迹损失方法,该方法直接估计差异网络并实现模型选择一致性。模拟研究证明了其改进的性能和计算效率。最后,通过对非小细胞肺癌的实际数据分析证明了我们提出的估计器的有效性。