• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多组学特征对机器学习算法驱动基因鉴定的影响。

Effects of Multi-Omics Characteristics on Identification of Driver Genes Using Machine Learning Algorithms.

机构信息

School of Computer Science, Qufu Normal University, Rizhao 276826, China.

出版信息

Genes (Basel). 2022 Apr 19;13(5):716. doi: 10.3390/genes13050716.

DOI:10.3390/genes13050716
PMID:35627101
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9141966/
Abstract

Cancer is a complex disease caused by genomic and epigenetic alterations; hence, identifying meaningful cancer drivers is an important and challenging task. Most studies have detected cancer drivers with mutated traits, while few studies consider multiple omics characteristics as important factors. In this study, we present a framework to analyze the effects of multi-omics characteristics on the identification of driver genes. We utilize four machine learning algorithms within this framework to detect cancer driver genes in pan-cancer data, including 75 characteristics among 19,636 genes. The 75 features are divided into four types and analyzed using Kullback-Leibler divergence based on CGC genes and non-CGC genes. We detect cancer driver genes in two different ways. One is to detect driver genes from a single feature type, while the other is from the top N features. The first analysis denotes that the mutational features are the best characteristics. The second analysis reveals that the top 45 features are the most effective feature combinations and superior to the mutational features. The top 45 features not only contain mutational features but also three other types of features. Therefore, our study extends the detection of cancer driver genes and provides a more comprehensive understanding of cancer mechanisms.

摘要

癌症是一种由基因组和表观遗传改变引起的复杂疾病;因此,鉴定有意义的癌症驱动基因是一项重要且具有挑战性的任务。大多数研究都检测到了具有突变特征的癌症驱动基因,而少数研究则考虑了多种组学特征作为重要因素。在本研究中,我们提出了一种分析多组学特征对驱动基因识别影响的框架。我们利用该框架中的四种机器学习算法,在泛癌症数据中检测了包括 19636 个基因中的 75 个特征在内的癌症驱动基因。这 75 个特征分为四种类型,并基于 CGC 基因和非 CGC 基因使用基于 Kullback-Leibler 散度的方法进行分析。我们通过两种不同的方法来检测癌症驱动基因。一种是从单一特征类型中检测驱动基因,另一种是从前 N 个特征中检测。第一个分析表明,突变特征是最好的特征。第二个分析表明,前 45 个特征是最有效的特征组合,优于突变特征。前 45 个特征不仅包含突变特征,还包含另外三种类型的特征。因此,我们的研究扩展了癌症驱动基因的检测,并提供了对癌症机制的更全面理解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851a/9141966/d9dd0c74229f/genes-13-00716-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851a/9141966/297b115c7a9d/genes-13-00716-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851a/9141966/8f3fe06e2774/genes-13-00716-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851a/9141966/5fb9cdc67ade/genes-13-00716-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851a/9141966/f8fdd0e228cf/genes-13-00716-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851a/9141966/28fdbd527764/genes-13-00716-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851a/9141966/b8f0e625b648/genes-13-00716-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851a/9141966/d9dd0c74229f/genes-13-00716-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851a/9141966/297b115c7a9d/genes-13-00716-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851a/9141966/8f3fe06e2774/genes-13-00716-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851a/9141966/5fb9cdc67ade/genes-13-00716-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851a/9141966/f8fdd0e228cf/genes-13-00716-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851a/9141966/28fdbd527764/genes-13-00716-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851a/9141966/b8f0e625b648/genes-13-00716-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851a/9141966/d9dd0c74229f/genes-13-00716-g007.jpg

相似文献

1
Effects of Multi-Omics Characteristics on Identification of Driver Genes Using Machine Learning Algorithms.多组学特征对机器学习算法驱动基因鉴定的影响。
Genes (Basel). 2022 Apr 19;13(5):716. doi: 10.3390/genes13050716.
2
InDEP: an interpretable machine learning approach to predict cancer driver genes from multi-omics data.InDEP:一种从多组学数据预测癌症驱动基因的可解释机器学习方法。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad318.
3
Machine learning methods for prediction of cancer driver genes: a survey paper.机器学习方法在癌症驱动基因预测中的应用:综述论文。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac062.
4
Machine Learning Classification and Structure-Functional Analysis of Cancer Mutations Reveal Unique Dynamic and Network Signatures of Driver Sites in Oncogenes and Tumor Suppressor Genes.机器学习分类和癌症突变的结构-功能分析揭示了癌基因和肿瘤抑制基因中驱动位点的独特动态和网络特征。
J Chem Inf Model. 2018 Oct 22;58(10):2131-2150. doi: 10.1021/acs.jcim.8b00414. Epub 2018 Oct 3.
5
Identifying cancer driver genes based on multi-view heterogeneous graph convolutional network and self-attention mechanism.基于多视图异质图卷积网络和自注意力机制的癌症驱动基因识别。
BMC Bioinformatics. 2023 Jan 13;24(1):16. doi: 10.1186/s12859-023-05140-3.
6
LOTUS: A single- and multitask machine learning algorithm for the prediction of cancer driver genes.LOTUS:一种用于癌症驱动基因预测的单任务和多任务机器学习算法。
PLoS Comput Biol. 2019 Sep 30;15(9):e1007381. doi: 10.1371/journal.pcbi.1007381. eCollection 2019 Sep.
7
DGMP: Identifying Cancer Driver Genes by Jointing DGCN and MLP from Multi-omics Genomic Data.DGMP:通过结合多组学基因组数据的 DGCN 和 MLP 识别癌症驱动基因。
Genomics Proteomics Bioinformatics. 2022 Oct;20(5):928-938. doi: 10.1016/j.gpb.2022.11.004. Epub 2022 Dec 1.
8
Multi-Omic Data Improve Prediction of Personalized Tumor Suppressors and Oncogenes.多组学数据改善个性化肿瘤抑制基因和癌基因的预测。
Front Genet. 2022 May 10;13:854190. doi: 10.3389/fgene.2022.854190. eCollection 2022.
9
Evaluating machine learning methodologies for identification of cancer driver genes.评估用于识别癌症驱动基因的机器学习方法。
Sci Rep. 2021 Jun 10;11(1):12281. doi: 10.1038/s41598-021-91656-8.
10
Integration of multiple networks and pathways identifies cancer driver genes in pan-cancer analysis.多网络和通路的整合分析鉴定了泛癌分析中的癌症驱动基因。
BMC Genomics. 2018 Jan 6;19(1):25. doi: 10.1186/s12864-017-4423-x.

引用本文的文献

1
Lactate-related gene signatures as prognostic predictors and comprehensive analysis of immune profiles in nasopharyngeal carcinoma.乳酸相关基因特征作为鼻咽癌预后预测指标及免疫图谱的综合分析
J Transl Med. 2024 Dec 20;22(1):1116. doi: 10.1186/s12967-024-05935-9.
2
IMI-driver: Integrating multi-level gene networks and multi-omics for cancer driver gene identification.IMI-driver:综合多层次基因网络和多组学数据进行癌症驱动基因识别。
PLoS Comput Biol. 2024 Aug 26;20(8):e1012389. doi: 10.1371/journal.pcbi.1012389. eCollection 2024 Aug.

本文引用的文献

1
Identifying Cancer Drivers Using DRIVE: A Feature-Based Machine Learning Model for a Pan-Cancer Assessment of Somatic Missense Mutations.使用DRIVE识别癌症驱动因素:一种基于特征的机器学习模型用于体细胞错义突变的泛癌评估。
Cancers (Basel). 2021 Jun 3;13(11):2779. doi: 10.3390/cancers13112779.
2
A rat epigenetic clock recapitulates phenotypic aging and co-localizes with heterochromatin.大鼠表观遗传钟再现表型衰老,并与异染色质共定位。
Elife. 2020 Nov 12;9:e59201. doi: 10.7554/eLife.59201.
3
DORGE: Discovery of Oncogenes and tumoR suppressor genes using Genetic and Epigenetic features.
利用遗传和表观遗传特征发现癌基因和肿瘤抑制基因。
Sci Adv. 2020 Nov 11;6(46). doi: 10.1126/sciadv.aba6784. Print 2020 Nov.
4
Somatic mutation distributions in cancer genomes vary with three-dimensional chromatin structure.体细胞突变在肿瘤基因组中的分布与三维染色质结构有关。
Nat Genet. 2020 Nov;52(11):1178-1188. doi: 10.1038/s41588-020-0708-0. Epub 2020 Oct 5.
5
Identifying Molecular Biomarkers for Diseases With Machine Learning Based on Integrative Omics.基于组学整合的机器学习识别疾病的分子生物标志物
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2514-2525. doi: 10.1109/TCBB.2020.2986387. Epub 2021 Dec 8.
6
Inferring subgroup-specific driver genes from heterogeneous cancer samples via subspace learning with subgroup indication.基于亚组指示的子空间学习推断异质癌症样本中的亚组特异性驱动基因。
Bioinformatics. 2020 Mar 1;36(6):1855-1863. doi: 10.1093/bioinformatics/btz793.
7
XGBoost Model for Chronic Kidney Disease Diagnosis.XGBoost 模型用于慢性肾脏病诊断。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Nov-Dec;17(6):2131-2140. doi: 10.1109/TCBB.2019.2911071. Epub 2020 Dec 8.
8
Activin A Signaling Regulates IL13Rα2 Expression to Promote Breast Cancer Metastasis.激活素A信号传导调节IL13Rα2表达以促进乳腺癌转移。
Front Oncol. 2019 Feb 5;9:32. doi: 10.3389/fonc.2019.00032. eCollection 2019.
9
NCBoost classifies pathogenic non-coding variants in Mendelian diseases through supervised learning on purifying selection signals in humans.NCBoost 通过在人类中对净化选择信号进行监督学习,对孟德尔疾病中的致病性非编码变体进行分类。
Genome Biol. 2019 Feb 11;20(1):32. doi: 10.1186/s13059-019-1634-2.
10
MiR-326/Sp1/KLF3: A novel regulatory axis in lung cancer progression.miR-326/Sp1/KLF3:肺癌进展中的新调控轴。
Cell Prolif. 2019 Mar;52(2):e12551. doi: 10.1111/cpr.12551. Epub 2018 Nov 28.