• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

VarWalker:基于下一代测序数据对假定癌症基因进行个性化突变网络分析

VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data.

作者信息

Jia Peilin, Zhao Zhongming

机构信息

Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America ; Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America.

Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America ; Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America ; Department of Cancer Biology, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America ; Department of Psychiatry, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America.

出版信息

PLoS Comput Biol. 2014 Feb 6;10(2):e1003460. doi: 10.1371/journal.pcbi.1003460. eCollection 2014 Feb.

DOI:10.1371/journal.pcbi.1003460
PMID:24516372
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3916227/
Abstract

A major challenge in interpreting the large volume of mutation data identified by next-generation sequencing (NGS) is to distinguish driver mutations from neutral passenger mutations to facilitate the identification of targetable genes and new drugs. Current approaches are primarily based on mutation frequencies of single-genes, which lack the power to detect infrequently mutated driver genes and ignore functional interconnection and regulation among cancer genes. We propose a novel mutation network method, VarWalker, to prioritize driver genes in large scale cancer mutation data. VarWalker fits generalized additive models for each sample based on sample-specific mutation profiles and builds on the joint frequency of both mutation genes and their close interactors. These interactors are selected and optimized using the Random Walk with Restart algorithm in a protein-protein interaction network. We applied the method in >300 tumor genomes in two large-scale NGS benchmark datasets: 183 lung adenocarcinoma samples and 121 melanoma samples. In each cancer, we derived a consensus mutation subnetwork containing significantly enriched consensus cancer genes and cancer-related functional pathways. These cancer-specific mutation networks were then validated using independent datasets for each cancer. Importantly, VarWalker prioritizes well-known, infrequently mutated genes, which are shown to interact with highly recurrently mutated genes yet have been ignored by conventional single-gene-based approaches. Utilizing VarWalker, we demonstrated that network-assisted approaches can be effectively adapted to facilitate the detection of cancer driver genes in NGS data.

摘要

解读通过下一代测序(NGS)识别出的大量突变数据面临的一个主要挑战是区分驱动突变和中性乘客突变,以促进可靶向基因和新药的识别。当前方法主要基于单基因的突变频率,这缺乏检测罕见突变驱动基因的能力,并且忽略了癌症基因之间的功能互联和调控。我们提出了一种新的突变网络方法VarWalker,用于在大规模癌症突变数据中对驱动基因进行优先级排序。VarWalker基于样本特异性突变谱为每个样本拟合广义相加模型,并基于突变基因及其紧密相互作用因子的联合频率构建模型。这些相互作用因子在蛋白质-蛋白质相互作用网络中使用带重启的随机游走算法进行选择和优化。我们将该方法应用于两个大规模NGS基准数据集中的300多个肿瘤基因组:183个肺腺癌样本和121个黑色素瘤样本。在每种癌症中,我们得出了一个包含显著富集的一致性癌症基因和癌症相关功能通路的一致性突变子网。然后使用每种癌症的独立数据集对这些癌症特异性突变网络进行验证。重要的是,VarWalker对众所周知的、罕见突变的基因进行了优先级排序,这些基因显示与高度频繁突变的基因相互作用,但被传统的基于单基因的方法所忽略。利用VarWalker,我们证明了网络辅助方法可以有效地用于促进在NGS数据中检测癌症驱动基因。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3740/3916227/ce27252f3175/pcbi.1003460.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3740/3916227/d7c7b376a50e/pcbi.1003460.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3740/3916227/f7cb1a33d941/pcbi.1003460.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3740/3916227/72957ef08aa0/pcbi.1003460.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3740/3916227/28c52fa6dbd6/pcbi.1003460.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3740/3916227/ce27252f3175/pcbi.1003460.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3740/3916227/d7c7b376a50e/pcbi.1003460.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3740/3916227/f7cb1a33d941/pcbi.1003460.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3740/3916227/72957ef08aa0/pcbi.1003460.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3740/3916227/28c52fa6dbd6/pcbi.1003460.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3740/3916227/ce27252f3175/pcbi.1003460.g005.jpg

相似文献

1
VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data.VarWalker:基于下一代测序数据对假定癌症基因进行个性化突变网络分析
PLoS Comput Biol. 2014 Feb 6;10(2):e1003460. doi: 10.1371/journal.pcbi.1003460. eCollection 2014 Feb.
2
LNDriver: identifying driver genes by integrating mutation and expression data based on gene-gene interaction network.LNDriver:基于基因-基因相互作用网络整合突变和表达数据来识别驱动基因。
BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):467. doi: 10.1186/s12859-016-1332-y.
3
Machine learning random forest for predicting oncosomatic variant NGS analysis.机器学习随机森林预测肿瘤体细胞变异 NGS 分析。
Sci Rep. 2021 Nov 8;11(1):21820. doi: 10.1038/s41598-021-01253-y.
4
Identifying driver mutations from sequencing data of heterogeneous tumors in the era of personalized genome sequencing.在个性化基因组测序时代,从异质性肿瘤的测序数据中识别驱动突变。
Brief Bioinform. 2014 Mar;15(2):244-55. doi: 10.1093/bib/bbt042. Epub 2013 Jul 1.
5
Finding co-mutated genes and candidate cancer genes in cancer genomes by stratified false discovery rate control.通过分层错误发现率控制在癌症基因组中寻找共突变基因和候选癌症基因。
Mol Biosyst. 2011 Apr;7(4):1158-66. doi: 10.1039/c0mb00211a. Epub 2011 Jan 28.
6
De novo discovery of mutated driver pathways in cancer.癌症中突变驱动途径的从头发现。
Genome Res. 2012 Feb;22(2):375-85. doi: 10.1101/gr.120477.111. Epub 2011 Jun 7.
7
Cancer Gene Discovery by Network Analysis of Somatic Mutations Using the MUFFINN Server.使用MUFFINN服务器通过体细胞突变的网络分析发现癌症基因
Methods Mol Biol. 2019;1907:37-50. doi: 10.1007/978-1-4939-8967-6_3.
8
Identification of mutated core cancer modules by integrating somatic mutation, copy number variation, and gene expression data.通过整合体细胞突变、拷贝数变异和基因表达数据来鉴定核心癌症突变模块。
BMC Syst Biol. 2013;7 Suppl 2(Suppl 2):S4. doi: 10.1186/1752-0509-7-S2-S4. Epub 2013 Oct 14.
9
DriverRWH: discovering cancer driver genes by random walk on a gene mutation hypergraph.DriverRWH:通过基因突变超图上的随机游走发现癌症驱动基因。
BMC Bioinformatics. 2022 Jul 13;23(1):277. doi: 10.1186/s12859-022-04788-7.
10
A computational approach for the discovery of significant cancer genes by weighted mutation and asymmetric spreading strength in networks.通过加权突变和网络中不对称扩展强度发现重要癌症基因的计算方法。
Sci Rep. 2021 Dec 7;11(1):23551. doi: 10.1038/s41598-021-02671-8.

引用本文的文献

1
MNMO: discover driver genes from a multi-omics data based-multi-layer network.MNMO:从基于多组学数据的多层网络中发现驱动基因。
Bioinformatics. 2025 Mar 29;41(4). doi: 10.1093/bioinformatics/btaf134.
2
Enhancing Molecular Network-Based Cancer Driver Gene Prediction Using Machine Learning Approaches: Current Challenges and Opportunities.使用机器学习方法增强基于分子网络的癌症驱动基因预测:当前挑战与机遇
J Cell Mol Med. 2025 Jan;29(1):e70351. doi: 10.1111/jcmm.70351.
3
Pharmacogenomic profiling of the South Korean population: Insights and implications for personalized medicine.

本文引用的文献

1
Computational approaches to identify functional genetic variants in cancer genomes.计算方法在癌症基因组中识别功能遗传变异。
Nat Methods. 2013 Aug;10(8):723-9. doi: 10.1038/nmeth.2562.
2
Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles.基于疾病与蛋白质扩散分布之间的拓扑相似性对候选疾病基因进行优先级排序。
BMC Bioinformatics. 2013;14 Suppl 5(Suppl 5):S5. doi: 10.1186/1471-2105-14-S5-S5. Epub 2013 Apr 10.
3
Disease gene identification by random walk on multigraphs merging heterogeneous genomic and phenotype data.
韩国人群的药物基因组学分析:对个性化医疗的见解与启示。
Front Pharmacol. 2024 Dec 3;15:1476765. doi: 10.3389/fphar.2024.1476765. eCollection 2024.
4
Variant Impact Predictor database (VIPdb), version 2: trends from three decades of genetic variant impact predictors.变异影响预测器数据库(VIPdb),版本 2:三十年来遗传变异影响预测器的趋势。
Hum Genomics. 2024 Aug 28;18(1):90. doi: 10.1186/s40246-024-00663-z.
5
Variant Impact Predictor database (VIPdb), version 2: Trends from 25 years of genetic variant impact predictors.变异影响预测数据库(VIPdb),版本2:25年基因变异影响预测的趋势
bioRxiv. 2024 Jun 28:2024.06.25.600283. doi: 10.1101/2024.06.25.600283.
6
DriverMP enables improved identification of cancer driver genes.DriverMP 可提高癌症驱动基因的识别能力。
Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad106. Epub 2023 Dec 13.
7
MMPatho: Leveraging Multilevel Consensus and Evolutionary Information for Enhanced Missense Mutation Pathogenic Prediction.MMPatho:利用多层次共识和进化信息提高错义突变致病性预测。
J Chem Inf Model. 2023 Nov 27;63(22):7239-7257. doi: 10.1021/acs.jcim.3c00950. Epub 2023 Nov 10.
8
Unraveling the Drivers of Tumorigenesis in the Context of Evolution: Theoretical Models and Bioinformatics Tools.解析进化背景下肿瘤发生的驱动因素:理论模型和生物信息学工具。
J Mol Evol. 2023 Aug;91(4):405-423. doi: 10.1007/s00239-023-10117-0. Epub 2023 May 29.
9
Mapping the common gene networks that underlie related diseases.绘制相关疾病的共同基因网络图谱。
Nat Protoc. 2023 Jun;18(6):1745-1759. doi: 10.1038/s41596-022-00797-1. Epub 2023 Jan 18.
10
SYSMut: decoding the functional significance of rare somatic mutations in cancer.SYSMut:解码癌症中罕见体细胞突变的功能意义。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac280.
基于合并异构基因组和表型数据的多图随机游走的疾病基因鉴定。
BMC Genomics. 2012;13 Suppl 7(Suppl 7):S27. doi: 10.1186/1471-2164-13-S7-S27. Epub 2012 Dec 13.
4
Comprehensive DNA methylation and extensive mutation analyses reveal an association between the CpG island methylator phenotype and oncogenic mutations in gastric cancers.全面的 DNA 甲基化和广泛的突变分析揭示了 CpG 岛甲基化表型与胃癌中致癌突变之间的关联。
Cancer Lett. 2013 Mar 1;330(1):33-40. doi: 10.1016/j.canlet.2012.11.022. Epub 2012 Nov 27.
5
Mapping the hallmarks of lung adenocarcinoma with massively parallel sequencing.大规模平行测序绘制肺腺癌特征图谱。
Cell. 2012 Sep 14;150(6):1107-20. doi: 10.1016/j.cell.2012.08.029.
6
HYST: a hybrid set-based test for genome-wide association studies, with application to protein-protein interaction-based association analysis.HYST:一种基于混合集的全基因组关联研究测试方法,应用于基于蛋白质-蛋白质相互作用的关联分析。
Am J Hum Genet. 2012 Sep 7;91(3):478-88. doi: 10.1016/j.ajhg.2012.08.004.
7
A landscape of driver mutations in melanoma.黑色素瘤中的驱动基因突变全景。
Cell. 2012 Jul 20;150(2):251-63. doi: 10.1016/j.cell.2012.06.024.
8
Computational tools for prioritizing candidate genes: boosting disease gene discovery.计算工具在候选基因优先级排序中的应用:提高疾病基因发现的效率。
Nat Rev Genet. 2012 Jul 3;13(8):523-36. doi: 10.1038/nrg3253.
9
Routine multiplex mutational profiling of melanomas enables enrollment in genotype-driven therapeutic trials.对黑色素瘤进行常规多重突变分析可使患者入组到基于基因分型的治疗试验中。
PLoS One. 2012;7(4):e35309. doi: 10.1371/journal.pone.0035309. Epub 2012 Apr 20.
10
An experimental investigation of kernels on graphs for collaborative recommendation and semisupervised classification.基于图核的协同推荐和半监督分类的实验研究。
Neural Netw. 2012 Jul;31:53-72. doi: 10.1016/j.neunet.2012.03.001. Epub 2012 Mar 20.