• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

改进现有的分析流程,利用多组学数据识别和分析癌症驱动基因。

Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data.

机构信息

Department of Computational Biomedicine, Vingroup Big Data Institute, Hanoi, Vietnam.

Faculty of Pharmacy, Dainam University, Hanoi, Vietnam.

出版信息

Sci Rep. 2020 Nov 25;10(1):20521. doi: 10.1038/s41598-020-77318-1.

DOI:10.1038/s41598-020-77318-1
PMID:33239644
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7688645/
Abstract

The cumulative of genes carrying mutations is vital for the establishment and development of cancer. However, this driver gene exploring research line has selected and used types of tools and models of analysis unsystematically and discretely. Also, the previous studies may have neglected low-frequency drivers and seldom predicted subgroup specificities of identified driver genes. In this study, we presented an improved driver gene identification and analysis pipeline that comprises the four most widely focused analyses for driver genes: enrichment analysis, clinical feature association with expression profiles of identified driver genes as well as with their functional modules, and patient stratification by existing advanced computational tools integrating multi-omics data. The improved pipeline's general usability was demonstrated straightforwardly for breast cancer, validated by some independent databases. Accordingly, 31 validated driver genes, including four novel ones, were discovered. Subsequently, we detected cancer-related significantly enriched gene ontology terms and pathways, probable drug targets, two co-expressed modules associated significantly with several clinical features, such as number of positive lymph nodes, Nottingham prognostic index, and tumor stage, and two biologically distinct groups of BRCA patients. Data and source code of the case study can be downloaded at https://github.com/hauldhut/drivergene .

摘要

基因突变的累积对于癌症的发生和发展至关重要。然而,这条驱动基因的研究路线在选择和使用各种工具和分析模型时缺乏系统性和连贯性。此外,以前的研究可能忽略了低频驱动基因,并且很少预测已识别的驱动基因的亚组特异性。在这项研究中,我们提出了一种改进的驱动基因识别和分析管道,该管道包括最广泛关注的四种驱动基因分析:富集分析、与识别的驱动基因的表达谱以及其功能模块的临床特征关联,以及利用现有的先进计算工具整合多组学数据对患者进行分层。改进的管道的通用性在乳腺癌中得到了直接验证,并通过一些独立的数据库进行了验证。因此,发现了 31 个经过验证的驱动基因,包括 4 个新的驱动基因。随后,我们检测到了与癌症相关的显著富集的基因本体论术语和途径、可能的药物靶点、两个与多个临床特征(如阳性淋巴结数量、诺丁汉预后指数和肿瘤分期)显著相关的共表达模块,以及两个具有明显生物学差异的 BRCA 患者群体。案例研究的数据和源代码可在 https://github.com/hauldhut/drivergene 上下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b250/7688645/c634a0c1e436/41598_2020_77318_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b250/7688645/f6eca09a72e6/41598_2020_77318_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b250/7688645/785ed3399b46/41598_2020_77318_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b250/7688645/6ba800b1ea93/41598_2020_77318_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b250/7688645/de1179283644/41598_2020_77318_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b250/7688645/c634a0c1e436/41598_2020_77318_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b250/7688645/f6eca09a72e6/41598_2020_77318_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b250/7688645/785ed3399b46/41598_2020_77318_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b250/7688645/6ba800b1ea93/41598_2020_77318_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b250/7688645/de1179283644/41598_2020_77318_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b250/7688645/c634a0c1e436/41598_2020_77318_Fig5_HTML.jpg

相似文献

1
Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data.改进现有的分析流程,利用多组学数据识别和分析癌症驱动基因。
Sci Rep. 2020 Nov 25;10(1):20521. doi: 10.1038/s41598-020-77318-1.
2
Identification of candidate cancer drivers by integrative Epi-DNA and Gene Expression (iEDGE) data analysis.通过整合表观遗传学 DNA 和基因表达(iEDGE)数据分析鉴定候选癌症驱动基因。
Sci Rep. 2019 Nov 15;9(1):16904. doi: 10.1038/s41598-019-52886-z.
3
Cancer driver gene discovery through an integrative genomics approach in a non-parametric Bayesian framework.在非参数贝叶斯框架下通过综合基因组学方法发现癌症驱动基因。
Bioinformatics. 2017 Feb 15;33(4):483-490. doi: 10.1093/bioinformatics/btw662.
4
DGMP: Identifying Cancer Driver Genes by Jointing DGCN and MLP from Multi-omics Genomic Data.DGMP:通过结合多组学基因组数据的 DGCN 和 MLP 识别癌症驱动基因。
Genomics Proteomics Bioinformatics. 2022 Oct;20(5):928-938. doi: 10.1016/j.gpb.2022.11.004. Epub 2022 Dec 1.
5
Integration of somatic mutation, expression and functional data reveals potential driver genes predictive of breast cancer survival.体细胞突变、表达和功能数据的整合揭示了预测乳腺癌生存的潜在驱动基因。
Bioinformatics. 2015 Aug 15;31(16):2607-13. doi: 10.1093/bioinformatics/btv164. Epub 2015 Mar 24.
6
The Integrative Method Based on the Module-Network for Identifying Driver Genes in Cancer Subtypes.基于模块网络的癌症亚型驱动基因识别的综合方法。
Molecules. 2018 Jan 24;23(2):183. doi: 10.3390/molecules23020183.
7
A novel network control model for identifying personalized driver genes in cancer.一种用于鉴定癌症中个性化驱动基因的新型网络调控模型。
PLoS Comput Biol. 2019 Nov 25;15(11):e1007520. doi: 10.1371/journal.pcbi.1007520. eCollection 2019 Nov.
8
Identifying core gene modules in glioblastoma based on multilayer factor-mediated dysfunctional regulatory networks through integrating multi-dimensional genomic data.通过整合多维度基因组数据,基于多层因子介导的功能失调调控网络识别胶质母细胞瘤中的核心基因模块。
Nucleic Acids Res. 2015 Feb 27;43(4):1997-2007. doi: 10.1093/nar/gkv074. Epub 2015 Feb 4.
9
IMI-driver: Integrating multi-level gene networks and multi-omics for cancer driver gene identification.IMI-driver:综合多层次基因网络和多组学数据进行癌症驱动基因识别。
PLoS Comput Biol. 2024 Aug 26;20(8):e1012389. doi: 10.1371/journal.pcbi.1012389. eCollection 2024 Aug.
10
Simultaneous Integration of Multi-omics Data Improves the Identification of Cancer Driver Modules.多组学数据的同步整合提高了癌症驱动模块的识别能力。
Cell Syst. 2019 May 22;8(5):456-466.e5. doi: 10.1016/j.cels.2019.04.005. Epub 2019 May 15.

引用本文的文献

1
DriverDetector: An R package providing multiple statistical methods for cancer driver genes detection and tools for downstream analysis.DriverDetector:一个R软件包,提供用于癌症驱动基因检测的多种统计方法以及下游分析工具。
Heliyon. 2024 Jul 1;10(14):e33582. doi: 10.1016/j.heliyon.2024.e33582. eCollection 2024 Jul 30.
2
Multiomics analysis reveals metabolic subtypes and identifies diacylglycerol kinase α (DGKA) as a potential therapeutic target for intrahepatic cholangiocarcinoma.多组学分析揭示代谢亚型并确定二酰甘油激酶α(DGKA)作为肝内胆管癌的潜在治疗靶点。
Cancer Commun (Lond). 2024 Feb;44(2):226-250. doi: 10.1002/cac2.12513. Epub 2023 Dec 24.
3

本文引用的文献

1
Multi-Omics Analysis Detects Novel Prognostic Subgroups of Breast Cancer.多组学分析检测出乳腺癌新的预后亚组。
Front Genet. 2020 Oct 15;11:574661. doi: 10.3389/fgene.2020.574661. eCollection 2020.
2
Bayesian structural equation modeling in multiple omics data with application to circadian genes.贝叶斯结构方程模型在多组学数据中的应用及在生物钟基因中的应用。
Bioinformatics. 2020 Jul 1;36(13):3951-3958. doi: 10.1093/bioinformatics/btaa286.
3
Integrative Analysis of MicroRNA and Gene Interactions for Revealing Candidate Signatures in Prostate Cancer.
Uncovering Oncogenic Mechanisms of Tumor Suppressor Genes in Breast Cancer Multi-Omics Data.
揭示乳腺癌多组学数据中肿瘤抑制基因的致癌机制。
Int J Mol Sci. 2022 Aug 25;23(17):9624. doi: 10.3390/ijms23179624.
4
Orthologs of human circulating miRNAs associated with hepatocellular carcinoma are elevated in mouse plasma months before tumour detection.与肝细胞癌相关的人类循环 miRNA 的同源物在肿瘤检测前数月就在小鼠血浆中升高。
Sci Rep. 2022 Jun 28;12(1):10927. doi: 10.1038/s41598-022-15061-5.
5
Comprehensive Analysis of the Expression and Prognosis for the DREAM Complex in Human Cancers.人类癌症中DREAM复合体的表达与预后综合分析
Front Genet. 2022 May 12;13:814725. doi: 10.3389/fgene.2022.814725. eCollection 2022.
6
DrGA: cancer driver gene analysis in a simpler manner.DrGA:更简单的癌症驱动基因分析。
BMC Bioinformatics. 2022 Mar 5;23(1):86. doi: 10.1186/s12859-022-04606-0.
7
Identification and Validation of a Novel Three Hub Long Noncoding RNAs With m6A Modification Signature in Low-Grade Gliomas.低级别胶质瘤中具有m6A修饰特征的新型三个枢纽长链非编码RNA的鉴定与验证
Front Mol Biosci. 2022 Feb 14;9:801931. doi: 10.3389/fmolb.2022.801931. eCollection 2022.
8
oCEM: Automatic detection and analysis of overlapping co-expressed gene modules.OCEM:重叠共表达基因模块的自动检测和分析。
BMC Genomics. 2022 Jan 8;23(1):39. doi: 10.1186/s12864-021-08072-5.
9
SOCS3 is Related to Cell Proliferation in Neuronal Tissue: An Integrated Analysis of Bioinformatics and Experiments.信号转导与转录激活因子3(SOCS3)与神经组织中的细胞增殖相关:生物信息学与实验的综合分析
Front Genet. 2021 Sep 27;12:743786. doi: 10.3389/fgene.2021.743786. eCollection 2021.
10
A Multimodal Affinity Fusion Network for Predicting the Survival of Breast Cancer Patients.用于预测乳腺癌患者生存情况的多模态亲和力融合网络。
Front Genet. 2021 Aug 20;12:709027. doi: 10.3389/fgene.2021.709027. eCollection 2021.
整合分析微小RNA与基因相互作用以揭示前列腺癌的候选特征
Front Genet. 2020 Feb 27;11:176. doi: 10.3389/fgene.2020.00176. eCollection 2020.
4
Integrative analysis of highly mutated genes in hepatitis B virus-related hepatic carcinoma.乙型肝炎病毒相关肝癌中高度突变基因的综合分析
Cancer Med. 2020 Apr;9(7):2462-2479. doi: 10.1002/cam4.2903. Epub 2020 Feb 4.
5
Robust and interpretable PAM50 reclassification exhibits survival advantage for myoepithelial and immune phenotypes.强大且可解释的PAM50重新分类对肌上皮和免疫表型具有生存优势。
NPJ Breast Cancer. 2019 Sep 9;5:30. doi: 10.1038/s41523-019-0124-8. eCollection 2019.
6
Multiple Omics Data Integration to Identify Long Noncoding RNA Responsible for Breast Cancer-Related Mortality.整合多组学数据以鉴定与乳腺癌相关死亡率相关的长链非编码RNA
Cancer Inform. 2019 Aug 24;18:1176935119871933. doi: 10.1177/1176935119871933. eCollection 2019.
7
Integrated bioinformatics analysis to identify 15 hub genes in breast cancer.综合生物信息学分析以鉴定乳腺癌中的15个关键基因。
Oncol Lett. 2019 Aug;18(2):1023-1034. doi: 10.3892/ol.2019.10411. Epub 2019 May 30.
8
Co-Expression Network Analysis Identified Gene Signatures in Osteosarcoma as a Predictive Tool for Lung Metastasis and Survival.共表达网络分析确定骨肉瘤中的基因特征作为肺转移和生存的预测工具。
J Cancer. 2019 Jun 9;10(16):3706-3716. doi: 10.7150/jca.32092. eCollection 2019.
9
OncodriveCLUSTL: a sequence-based clustering method to identify cancer drivers.OncodriveCLUSTL:一种基于序列的聚类方法,用于识别癌症驱动基因。
Bioinformatics. 2019 Nov 1;35(22):4788-4790. doi: 10.1093/bioinformatics/btz501.
10
Integrative Analysis of Genomic and Clinical Data Reveals Intrinsic Characteristics of Bladder Urothelial Carcinoma Progression.整合基因组和临床数据的分析揭示了膀胱癌进展的内在特征。
Genes (Basel). 2019 Jun 17;10(6):464. doi: 10.3390/genes10060464.