• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过整合功能和结构信息的驱动基因发现网络嵌入框架。

Network embedding framework for driver gene discovery by combining functional and structural information.

机构信息

School of Computer Science, Qufu Normal University, Rizhao, 27826, China.

出版信息

BMC Genomics. 2023 Jul 29;24(1):426. doi: 10.1186/s12864-023-09515-x.

DOI:10.1186/s12864-023-09515-x
PMID:37516822
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10386255/
Abstract

Comprehensive analysis of multiple data sets can identify potential driver genes for various cancers. In recent years, driver gene discovery based on massive mutation data and gene interaction networks has attracted increasing attention, but there is still a need to explore combining functional and structural information of genes in protein interaction networks to identify driver genes. Therefore, we propose a network embedding framework combining functional and structural information to identify driver genes. Firstly, we combine the mutation data and gene interaction networks to construct mutation integration network using network propagation algorithm. Secondly, the struc2vec model is used for extracting gene features from the mutation integration network, which contains both gene's functional and structural information. Finally, machine learning algorithms are utilized to identify the driver genes. Compared with the previous four excellent methods, our method can find gene pairs that are distant from each other through structural similarities and has better performance in identifying driver genes for 12 cancers in the cancer genome atlas. At the same time, we also conduct a comparative analysis of three gene interaction networks, three gene standard sets, and five machine learning algorithms. Our framework provides a new perspective for feature selection to identify novel driver genes.

摘要

综合分析多个数据集可以识别各种癌症的潜在驱动基因。近年来,基于大量突变数据和基因相互作用网络的驱动基因发现引起了越来越多的关注,但仍需要探索结合蛋白质相互作用网络中基因的功能和结构信息来识别驱动基因。因此,我们提出了一种结合功能和结构信息的网络嵌入框架来识别驱动基因。首先,我们结合突变数据和基因相互作用网络,使用网络传播算法构建突变整合网络。其次,使用 struc2vec 模型从突变整合网络中提取基因特征,其中包含基因的功能和结构信息。最后,利用机器学习算法识别驱动基因。与之前的四种优秀方法相比,我们的方法可以通过结构相似性找到彼此相距较远的基因对,并且在识别癌症基因组图谱中的 12 种癌症的驱动基因方面具有更好的性能。同时,我们还对三种基因相互作用网络、三种基因标准集和五种机器学习算法进行了对比分析。我们的框架为识别新的驱动基因提供了一个新的特征选择视角。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/da6d60a4f0ee/12864_2023_9515_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/0a55185bc972/12864_2023_9515_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/3532933e315b/12864_2023_9515_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/0245c21fa756/12864_2023_9515_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/5ff603e978bd/12864_2023_9515_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/723c64429c38/12864_2023_9515_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/a7035cc694ba/12864_2023_9515_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/a09f03907a81/12864_2023_9515_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/da6d60a4f0ee/12864_2023_9515_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/0a55185bc972/12864_2023_9515_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/3532933e315b/12864_2023_9515_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/0245c21fa756/12864_2023_9515_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/5ff603e978bd/12864_2023_9515_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/723c64429c38/12864_2023_9515_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/a7035cc694ba/12864_2023_9515_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/a09f03907a81/12864_2023_9515_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48bc/10386255/da6d60a4f0ee/12864_2023_9515_Fig8_HTML.jpg

相似文献

1
Network embedding framework for driver gene discovery by combining functional and structural information.通过整合功能和结构信息的驱动基因发现网络嵌入框架。
BMC Genomics. 2023 Jul 29;24(1):426. doi: 10.1186/s12864-023-09515-x.
2
Two-stage-vote ensemble framework based on integration of mutation data and gene interaction network for uncovering driver genes.基于突变数据和基因互作网络集成的两阶段投票集成框架,用于发现驱动基因。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab429.
3
Identifying overlapping mutated driver pathways by constructing gene networks in cancer.通过构建癌症基因网络来识别重叠的突变驱动通路。
BMC Bioinformatics. 2015;16 Suppl 5(Suppl 5):S3. doi: 10.1186/1471-2105-16-S5-S3. Epub 2015 Mar 18.
4
Prioritization of cancer driver gene with prize-collecting steiner tree by introducing an edge weighted strategy in the personalized gene interaction network.在个性化基因交互网络中引入边加权策略,通过收集 Steiner 树来对癌症驱动基因进行优先级排序。
BMC Bioinformatics. 2022 Aug 16;23(1):341. doi: 10.1186/s12859-022-04802-y.
5
GenHITS: A network science approach to driver gene detection in human regulatory network using gene's influence evaluation.GenHITS:一种利用基因影响力评估的网络科学方法,用于检测人类调控网络中的驱动基因。
J Biomed Inform. 2021 Feb;114:103661. doi: 10.1016/j.jbi.2020.103661. Epub 2020 Dec 14.
6
mAcancer-Net: Identification of mA-mediated cancer driver genes from gene-site heterogeneous network.mA 癌基因网络:从基因-位点异质网络中鉴定 mA 介导的癌症驱动基因。
Methods. 2022 Jul;203:125-138. doi: 10.1016/j.ymeth.2022.04.002. Epub 2022 Apr 15.
7
Enhancing Cancer Driver Gene Prediction by Protein-Protein Interaction Network.增强基于蛋白质-蛋白质相互作用网络的癌症驱动基因预测
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jul-Aug;19(4):2231-2240. doi: 10.1109/TCBB.2021.3063532. Epub 2022 Aug 8.
8
A novel network control model for identifying personalized driver genes in cancer.一种用于鉴定癌症中个性化驱动基因的新型网络调控模型。
PLoS Comput Biol. 2019 Nov 25;15(11):e1007520. doi: 10.1371/journal.pcbi.1007520. eCollection 2019 Nov.
9
Machine Learning Classification and Structure-Functional Analysis of Cancer Mutations Reveal Unique Dynamic and Network Signatures of Driver Sites in Oncogenes and Tumor Suppressor Genes.机器学习分类和癌症突变的结构-功能分析揭示了癌基因和肿瘤抑制基因中驱动位点的独特动态和网络特征。
J Chem Inf Model. 2018 Oct 22;58(10):2131-2150. doi: 10.1021/acs.jcim.8b00414. Epub 2018 Oct 3.
10
Identifying cancer driver genes based on multi-view heterogeneous graph convolutional network and self-attention mechanism.基于多视图异质图卷积网络和自注意力机制的癌症驱动基因识别。
BMC Bioinformatics. 2023 Jan 13;24(1):16. doi: 10.1186/s12859-023-05140-3.

引用本文的文献

1
Protein Sequence Analysis landscape: A Systematic Review of Task Types, Databases, Datasets, Word Embeddings Methods, and Language Models.蛋白质序列分析全景:任务类型、数据库、数据集、词嵌入方法和语言模型的系统综述
Database (Oxford). 2025 May 30;2025. doi: 10.1093/database/baaf027.
2
DNA sequence analysis landscape: a comprehensive review of DNA sequence analysis task types, databases, datasets, word embedding methods, and language models.DNA序列分析全景:对DNA序列分析任务类型、数据库、数据集、词嵌入方法和语言模型的全面综述。
Front Med (Lausanne). 2025 Apr 8;12:1503229. doi: 10.3389/fmed.2025.1503229. eCollection 2025.
3

本文引用的文献

1
KEGG for taxonomy-based analysis of pathways and genomes.KEGG 用于基于分类的途径和基因组分析。
Nucleic Acids Res. 2023 Jan 6;51(D1):D587-D592. doi: 10.1093/nar/gkac963.
2
Two-stage-vote ensemble framework based on integration of mutation data and gene interaction network for uncovering driver genes.基于突变数据和基因互作网络集成的两阶段投票集成框架,用于发现驱动基因。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab429.
3
A compendium of mutational cancer driver genes.癌症驱动基因突变综合分析
NetREm: Network Regression Embeddings reveal cell-type transcription factor coordination for gene regulation.
NetREm:网络回归嵌入揭示细胞类型转录因子在基因调控中的协同作用。
Bioinform Adv. 2024 Dec 20;5(1):vbae206. doi: 10.1093/bioadv/vbae206. eCollection 2025.
4
Transitioning from wet lab to artificial intelligence: a systematic review of AI predictors in CRISPR.从湿实验室到人工智能的转变:对CRISPR中人工智能预测因子的系统综述
J Transl Med. 2025 Feb 4;23(1):153. doi: 10.1186/s12967-024-06013-w.
5
Enhancing Molecular Network-Based Cancer Driver Gene Prediction Using Machine Learning Approaches: Current Challenges and Opportunities.使用机器学习方法增强基于分子网络的癌症驱动基因预测:当前挑战与机遇
J Cell Mol Med. 2025 Jan;29(1):e70351. doi: 10.1111/jcmm.70351.
Nat Rev Cancer. 2020 Oct;20(10):555-572. doi: 10.1038/s41568-020-0290-x. Epub 2020 Aug 10.
4
Prediction of cancer driver genes through network-based moment propagation of mutation scores.通过基于网络的突变分数矩传播预测癌症驱动基因。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i508-i515. doi: 10.1093/bioinformatics/btaa452.
5
Multi-omics Data Integration, Interpretation, and Its Application.多组学数据整合、解读及其应用
Bioinform Biol Insights. 2020 Jan 31;14:1177932219899051. doi: 10.1177/1177932219899051. eCollection 2020.
6
Pan-cancer analysis of whole genomes.泛癌症全基因组分析。
Nature. 2020 Feb;578(7793):82-93. doi: 10.1038/s41586-020-1969-6. Epub 2020 Feb 5.
7
Inferring subgroup-specific driver genes from heterogeneous cancer samples via subspace learning with subgroup indication.基于亚组指示的子空间学习推断异质癌症样本中的亚组特异性驱动基因。
Bioinformatics. 2020 Mar 1;36(6):1855-1863. doi: 10.1093/bioinformatics/btz793.
8
Toward understanding the origin and evolution of cellular organisms.为了理解细胞生物的起源和进化。
Protein Sci. 2019 Nov;28(11):1947-1951. doi: 10.1002/pro.3715. Epub 2019 Sep 9.
9
CancerMine: a literature-mined resource for drivers, oncogenes and tumor suppressors in cancer.癌症基因库:一个从文献中挖掘出的癌症相关驱动基因、致癌基因和抑癌基因的资源。
Nat Methods. 2019 Jun;16(6):505-507. doi: 10.1038/s41592-019-0422-y. Epub 2019 May 20.
10
Activin A Signaling Regulates IL13Rα2 Expression to Promote Breast Cancer Metastasis.激活素A信号传导调节IL13Rα2表达以促进乳腺癌转移。
Front Oncol. 2019 Feb 5;9:32. doi: 10.3389/fonc.2019.00032. eCollection 2019.