• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的云平台多组学数据分析用于研究基因调控

Machine learning-based analysis of multi-omics data on the cloud for investigating gene regulations.

作者信息

Oh Minsik, Park Sungjoon, Kim Sun, Chae Heejoon

机构信息

Department of Computer Science and Engineering, Seoul National University, Seoul, 08826, Korea.

Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 08826, Korea.

出版信息

Brief Bioinform. 2021 Jan 18;22(1):66-76. doi: 10.1093/bib/bbaa032.

DOI:10.1093/bib/bbaa032
PMID:32227074
Abstract

Gene expressions are subtly regulated by quantifiable measures of genetic molecules such as interaction with other genes, methylation, mutations, transcription factor and histone modifications. Integrative analysis of multi-omics data can help scientists understand the condition or patient-specific gene regulation mechanisms. However, analysis of multi-omics data is challenging since it requires not only the analysis of multiple omics data sets but also mining complex relations among different genetic molecules by using state-of-the-art machine learning methods. In addition, analysis of multi-omics data needs quite large computing infrastructure. Moreover, interpretation of the analysis results requires collaboration among many scientists, often requiring reperforming analysis from different perspectives. Many of the aforementioned technical issues can be nicely handled when machine learning tools are deployed on the cloud. In this survey article, we first survey machine learning methods that can be used for gene regulation study, and we categorize them according to five different goals: gene regulatory subnetwork discovery, disease subtype analysis, survival analysis, clinical prediction and visualization. We also summarize the methods in terms of multi-omics input types. Then, we explain why the cloud is potentially a good solution for the analysis of multi-omics data, followed by a survey of two state-of-the-art cloud systems, Galaxy and BioVLAB. Finally, we discuss important issues when the cloud is used for the analysis of multi-omics data for the gene regulation study.

摘要

基因表达受到遗传分子可量化指标的精细调控,这些指标包括与其他基因的相互作用、甲基化、突变、转录因子和组蛋白修饰。多组学数据的综合分析有助于科学家了解特定病情或患者的基因调控机制。然而,多组学数据分析具有挑战性,因为它不仅需要分析多个组学数据集,还需要使用最先进的机器学习方法挖掘不同遗传分子之间的复杂关系。此外,多组学数据分析需要相当大的计算基础设施。而且,分析结果的解读需要众多科学家的合作,通常需要从不同角度重新进行分析。当机器学习工具部署在云端时,上述许多技术问题都能得到很好的解决。在这篇综述文章中,我们首先综述可用于基因调控研究的机器学习方法,并根据五个不同目标对其进行分类:基因调控子网发现、疾病亚型分析、生存分析、临床预测和可视化。我们还根据多组学输入类型对这些方法进行了总结。然后,我们解释了为什么云端可能是多组学数据分析的一个好解决方案,接着对两个最先进的云系统Galaxy和BioVLAB进行了综述。最后,我们讨论了将云端用于基因调控研究的多组学数据分析时的重要问题。

相似文献

1
Machine learning-based analysis of multi-omics data on the cloud for investigating gene regulations.基于机器学习的云平台多组学数据分析用于研究基因调控
Brief Bioinform. 2021 Jan 18;22(1):66-76. doi: 10.1093/bib/bbaa032.
2
BioVLAB-mCpG-SNP-EXPRESS: A system for multi-level and multi-perspective analysis and exploration of DNA methylation, sequence variation (SNPs), and gene expression from multi-omics data.BioVLAB-mCpG-SNP-EXPRESS:一个用于从多组学数据中对DNA甲基化、序列变异(单核苷酸多态性)和基因表达进行多层次、多视角分析与探索的系统。
Methods. 2016 Dec 1;111:64-71. doi: 10.1016/j.ymeth.2016.07.019. Epub 2016 Jul 28.
3
Transcriptomics and epigenetic data integration learning module on Google Cloud.转录组学和表观遗传学数据集成学习模块在谷歌云上。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae352.
4
Perspectives of using Cloud computing in integrative analysis of multi-omics data.云计算在多组学数据综合分析中的应用前景。
Brief Funct Genomics. 2021 Jul 17;20(4):198-206. doi: 10.1093/bfgp/elab007.
5
Serverless computing in omics data analysis and integration.无服务器计算在组学数据分析和整合中的应用。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab349.
6
Knowledge-guided analysis of "omics" data using the KnowEnG cloud platform.基于 KnowEnG 云平台的“组学”数据知识引导分析。
PLoS Biol. 2020 Jan 23;18(1):e3000583. doi: 10.1371/journal.pbio.3000583. eCollection 2020 Jan.
7
A cloud-based learning module for biomarker discovery.基于云的生物标志物发现学习模块。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae126.
8
A New Era of Neuro-Oncology Research Pioneered by Multi-Omics Analysis and Machine Learning.多组学分析和机器学习开创神经肿瘤学研究新纪元。
Biomolecules. 2021 Apr 12;11(4):565. doi: 10.3390/biom11040565.
9
Survey and comparative assessments of computational multi-omics integrative methods with multiple regulatory networks identifying distinct tumor compositions across pan-cancer data sets.对具有多个调控网络的计算多组学综合方法进行调查和比较评估,以识别泛癌数据集之间不同的肿瘤组成。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa102.
10
Cloud Computing Enabled Big Multi-Omics Data Analytics.基于云计算的大型多组学数据分析
Bioinform Biol Insights. 2021 Jul 28;15:11779322211035921. doi: 10.1177/11779322211035921. eCollection 2021.

引用本文的文献

1
Identification of novel molecular subtypes and construction of a prognostic signature via multi-omics analysis and machine learning in lung adenocarcinoma.通过多组学分析和机器学习在肺腺癌中鉴定新的分子亚型并构建预后特征
Front Oncol. 2025 Jul 21;15:1590216. doi: 10.3389/fonc.2025.1590216. eCollection 2025.
2
Leveraging the integration of bioinformatics and machine learning to uncover common biomarkers and molecular pathways underlying diabetes and nephrolithiasis.利用生物信息学与机器学习的整合来揭示糖尿病和肾结石潜在的共同生物标志物及分子通路。
Front Immunol. 2025 Jul 11;16:1574157. doi: 10.3389/fimmu.2025.1574157. eCollection 2025.
3
Omics Sciences in Regular Physical Activity.
常规体育活动中的组学科学
Int J Mol Sci. 2025 Jun 10;26(12):5529. doi: 10.3390/ijms26125529.
4
Multi-omics and single-cell approaches reveal molecular subtypes and key cell interactions in hepatocellular carcinoma.多组学和单细胞方法揭示了肝细胞癌的分子亚型和关键细胞相互作用。
Front Pharmacol. 2025 May 22;16:1605162. doi: 10.3389/fphar.2025.1605162. eCollection 2025.
5
Combining multi-omics analysis with machine learning to uncover novel molecular subtypes, prognostic markers, and insights into immunotherapy for melanoma.将多组学分析与机器学习相结合,以揭示黑色素瘤的新型分子亚型、预后标志物以及免疫治疗相关见解。
BMC Cancer. 2025 Apr 7;25(1):630. doi: 10.1186/s12885-025-14012-3.
6
Identification of prognostic subtypes and the role of FXYD6 in ovarian cancer through multi-omics clustering.通过多组学聚类鉴定卵巢癌的预后亚型及FXYD6的作用
Front Immunol. 2025 Mar 18;16:1556715. doi: 10.3389/fimmu.2025.1556715. eCollection 2025.
7
Advancing lung adenocarcinoma prognosis and immunotherapy prediction with a multi-omics consensus machine learning approach.采用多组学生物学共识机器学习方法改善肺腺癌预后和免疫治疗预测。
J Cell Mol Med. 2024 Jul;28(13):e18520. doi: 10.1111/jcmm.18520.
8
Integrated multi-omics analysis and machine learning to refine molecular subtypes, prognosis, and immunotherapy in lung adenocarcinoma.整合多组学分析和机器学习以细化肺腺癌的分子亚型、预后和免疫治疗。
Funct Integr Genomics. 2024 Jun 27;24(4):118. doi: 10.1007/s10142-024-01388-x.
9
Unveiling divergent treatment prognoses in IDHwt-GBM subtypes through multiomics clustering: a swift dual MRI-mRNA model for precise subtype prediction.通过多组学聚类揭示 IDHwt-GBM 亚型中的不同治疗预后:一种快速的双重 MRI-mRNA 模型,用于精准的亚型预测。
J Transl Med. 2024 Jun 18;22(1):578. doi: 10.1186/s12967-024-05401-6.
10
Differences in the microbiome of the small intestine of Leghorn lines divergently selected for antibody titer to sheep erythrocytes suggest roles for commensals in host humoral response.对绵羊红细胞抗体效价进行差异选择的来航鸡品系,其小肠微生物群的差异表明共生菌在宿主体液免疫反应中发挥作用。
Front Physiol. 2024 Jan 8;14:1304051. doi: 10.3389/fphys.2023.1304051. eCollection 2023.