The Key Laboratory of Stem Cell Biology, Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China.
Biomed Res Int. 2013;2013:267375. doi: 10.1155/2013/267375. Epub 2013 May 22.
Lung cancer is one of the leading causes of cancer mortality worldwide. The main types of lung cancer are small cell lung cancer (SCLC) and nonsmall cell lung cancer (NSCLC). In this work, a computational method was proposed for identifying lung-cancer-related genes with a shortest path approach in a protein-protein interaction (PPI) network. Based on the PPI data from STRING, a weighted PPI network was constructed. 54 NSCLC- and 84 SCLC-related genes were retrieved from associated KEGG pathways. Then the shortest paths between each pair of these 54 NSCLC genes and 84 SCLC genes were obtained with Dijkstra's algorithm. Finally, all the genes on the shortest paths were extracted, and 25 and 38 shortest genes with a permutation P value less than 0.05 for NSCLC and SCLC were selected for further analysis. Some of the shortest path genes have been reported to be related to lung cancer. Intriguingly, the candidate genes we identified from the PPI network contained more cancer genes than those identified from the gene expression profiles. Furthermore, these genes possessed more functional similarity with the known cancer genes than those identified from the gene expression profiles. This study proved the efficiency of the proposed method and showed promising results.
肺癌是全球癌症死亡的主要原因之一。肺癌的主要类型是小细胞肺癌 (SCLC) 和非小细胞肺癌 (NSCLC)。在这项工作中,提出了一种基于最短路径方法的计算方法,用于在蛋白质-蛋白质相互作用 (PPI) 网络中识别与肺癌相关的基因。基于 STRING 的 PPI 数据,构建了一个加权 PPI 网络。从相关的 KEGG 途径中检索到 54 个 NSCLC 相关基因和 84 个 SCLC 相关基因。然后,使用 Dijkstra 算法获得每对这些 54 个 NSCLC 基因和 84 个 SCLC 基因之间的最短路径。最后,提取最短路径上的所有基因,并选择具有置换 P 值小于 0.05 的 25 个和 38 个 NSCLC 和 SCLC 的最短基因进行进一步分析。一些最短路径基因已被报道与肺癌有关。有趣的是,我们从 PPI 网络中识别出的候选基因比从基因表达谱中识别出的癌症基因更多。此外,与从基因表达谱中识别出的基因相比,这些基因与已知的癌症基因具有更多的功能相似性。这项研究证明了所提出方法的效率,并取得了有希望的结果。