• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

启发式支持的主动机器学习:以预测黑腹果蝇必需发育阶段和免疫反应基因为例的研究。

Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster.

机构信息

Department of Computer & Information Sciences, Covenant University, Ota, Ogun State, Nigeria.

Covenant University Bioinformatics Research (CUBRe), Covenant University, Ota, Ogun State, Nigeria.

出版信息

PLoS One. 2023 Aug 9;18(8):e0288023. doi: 10.1371/journal.pone.0288023. eCollection 2023.

DOI:10.1371/journal.pone.0288023
PMID:37556452
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10411809/
Abstract

Computational prediction of absolute essential genes using machine learning has gained wide attention in recent years. However, essential genes are mostly conditional and not absolute. Experimental techniques provide a reliable approach of identifying conditionally essential genes; however, experimental methods are laborious, time and resource consuming, hence computational techniques have been used to complement the experimental methods. Computational techniques such as supervised machine learning, or flux balance analysis are grossly limited due to the unavailability of required data for training the model or simulating the conditions for gene essentiality. This study developed a heuristic-enabled active machine learning method based on a light gradient boosting model to predict essential immune response and embryonic developmental genes in Drosophila melanogaster. We proposed a new sampling selection technique and introduced a heuristic function which replaces the human component in traditional active learning models. The heuristic function dynamically selects the unlabelled samples to improve the performance of the classifier in the next iteration. Testing the proposed model with four benchmark datasets, the proposed model showed superior performance when compared to traditional active learning models (random sampling and uncertainty sampling). Applying the model to identify conditionally essential genes, four novel essential immune response genes and a list of 48 novel genes that are essential in embryonic developmental condition were identified. We performed functional enrichment analysis of the predicted genes to elucidate their biological processes and the result evidence our predictions. Immune response and embryonic development related processes were significantly enriched in the essential immune response and embryonic developmental genes, respectively. Finally, we propose the predicted essential genes for future experimental studies and use of the developed tool accessible at http://heal.covenantuniversity.edu.ng for conditional essentiality predictions.

摘要

近年来,使用机器学习对绝对必需基因进行计算预测已经引起了广泛关注。然而,必需基因大多是有条件的,而不是绝对的。实验技术提供了一种识别条件必需基因的可靠方法;然而,实验方法繁琐、耗时且资源密集,因此计算技术已被用于补充实验方法。由于缺乏用于训练模型或模拟基因必需性条件的所需数据,监督机器学习或通量平衡分析等计算技术受到严重限制。本研究开发了一种基于轻梯度提升模型的启发式主动机器学习方法,用于预测果蝇中的必需免疫反应和胚胎发育基因。我们提出了一种新的采样选择技术,并引入了一个启发式函数,该函数取代了传统主动学习模型中的人为成分。启发式函数动态选择未标记的样本,以在下一次迭代中提高分类器的性能。使用四个基准数据集对提出的模型进行测试,与传统的主动学习模型(随机采样和不确定性采样)相比,提出的模型表现出更好的性能。将模型应用于识别条件必需基因,鉴定出四个新的必需免疫反应基因和一组在胚胎发育条件下必需的 48 个新基因。我们对预测基因进行了功能富集分析,以阐明它们的生物学过程,结果证明了我们的预测。在必需免疫反应和胚胎发育基因中,分别显著富集了免疫反应和胚胎发育相关过程。最后,我们提出了预测的必需基因,以供未来的实验研究使用,并在 http://heal.covenantuniversity.edu.ng 上提供了开发的工具,用于条件必需性预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4462/10411809/df9db6fef0ad/pone.0288023.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4462/10411809/4c73121f056f/pone.0288023.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4462/10411809/e154639020b3/pone.0288023.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4462/10411809/c93296ffabce/pone.0288023.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4462/10411809/c7a9cab459a1/pone.0288023.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4462/10411809/b8148b97ea33/pone.0288023.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4462/10411809/df9db6fef0ad/pone.0288023.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4462/10411809/4c73121f056f/pone.0288023.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4462/10411809/e154639020b3/pone.0288023.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4462/10411809/c93296ffabce/pone.0288023.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4462/10411809/c7a9cab459a1/pone.0288023.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4462/10411809/b8148b97ea33/pone.0288023.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4462/10411809/df9db6fef0ad/pone.0288023.g006.jpg

相似文献

1
Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster.启发式支持的主动机器学习:以预测黑腹果蝇必需发育阶段和免疫反应基因为例的研究。
PLoS One. 2023 Aug 9;18(8):e0288023. doi: 10.1371/journal.pone.0288023. eCollection 2023.
2
Machine learning approach to gene essentiality prediction: a review.机器学习在基因必需性预测中的应用:综述。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab128.
3
Identifying mouse developmental essential genes using machine learning.利用机器学习识别小鼠发育必需基因。
Dis Model Mech. 2018 Dec 13;11(12):dmm034546. doi: 10.1242/dmm.034546.
4
Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning.使用机器学习在两种模式真核生物之间交叉预测必需基因。
Int J Mol Sci. 2021 May 11;22(10):5056. doi: 10.3390/ijms22105056.
5
Essential gene prediction using limited gene essentiality information-An integrative semi-supervised machine learning strategy.利用有限的基因必需性信息进行必需基因预测——一种综合的半监督机器学习策略。
PLoS One. 2020 Nov 30;15(11):e0242943. doi: 10.1371/journal.pone.0242943. eCollection 2020.
6
Harnessing model organism genomics to underpin the machine learning-based prediction of essential genes in eukaryotes - Biotechnological implications.利用模式生物基因组学来支持基于机器学习的真核生物必需基因预测——生物技术意义。
Biotechnol Adv. 2022 Jan-Feb;54:107822. doi: 10.1016/j.biotechadv.2021.107822. Epub 2021 Aug 27.
7
DeepHE: Accurately predicting human essential genes based on deep learning.DeepHE:基于深度学习的人类必需基因精准预测。
PLoS Comput Biol. 2020 Sep 16;16(9):e1008229. doi: 10.1371/journal.pcbi.1008229. eCollection 2020 Sep.
8
Correction: Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster.更正:启用启发式的主动机器学习:以预测黑腹果蝇的关键发育阶段和免疫反应基因为例的研究。
PLoS One. 2024 Jun 18;19(6):e0305979. doi: 10.1371/journal.pone.0305979. eCollection 2024.
9
Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型,对于使用可穿戴设备进行压力预测具有良好的泛化能力。
J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.
10
Predicting gene essentiality in by feature engineering and machine-learning.通过特征工程和机器学习预测基因必需性。 (你提供的原文“Predicting gene essentiality in by feature engineering and machine-learning.”似乎不完整,“in”后面缺少具体内容,但按照要求进行了现有内容的翻译。)
Comput Struct Biotechnol J. 2020 May 15;18:1093-1102. doi: 10.1016/j.csbj.2020.05.008. eCollection 2020.

引用本文的文献

1
A hybrid machine learning model with attention mechanism and multidimensional multivariate feature coding for essential gene prediction.一种具有注意力机制和多维多变量特征编码的混合机器学习模型用于必需基因预测。
BMC Biol. 2025 Apr 24;23(1):108. doi: 10.1186/s12915-025-02209-8.
2
An optimized deep-forest algorithm using a modified differential evolution optimization algorithm: A case of host-pathogen protein-protein interaction prediction.一种使用改进差分进化优化算法的优化深度森林算法:宿主-病原体蛋白质-蛋白质相互作用预测实例
Comput Struct Biotechnol J. 2025 Jan 26;27:595-611. doi: 10.1016/j.csbj.2025.01.020. eCollection 2025.
3

本文引用的文献

1
Exploring conditional gene essentiality through systems genetics approaches in yeast.通过酵母中的系统遗传学方法探索条件基因的必需性。
Curr Opin Genet Dev. 2022 Oct;76:101963. doi: 10.1016/j.gde.2022.101963. Epub 2022 Aug 5.
2
Predicting host dependency factors of pathogens in using machine learning.利用机器学习预测病原体的宿主依赖性因子。
Comput Struct Biotechnol J. 2021 Aug 9;19:4581-4592. doi: 10.1016/j.csbj.2021.08.010. eCollection 2021.
3
Harnessing model organism genomics to underpin the machine learning-based prediction of essential genes in eukaryotes - Biotechnological implications.
Combination of computational techniques and RNAi reveal targets in Anopheles gambiae for malaria vector control.
计算技术与 RNAi 的结合揭示了疟蚊控制的靶标
PLoS One. 2024 Jul 5;19(7):e0305207. doi: 10.1371/journal.pone.0305207. eCollection 2024.
4
Correction: Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster.更正:启用启发式的主动机器学习:以预测黑腹果蝇的关键发育阶段和免疫反应基因为例的研究。
PLoS One. 2024 Jun 18;19(6):e0305979. doi: 10.1371/journal.pone.0305979. eCollection 2024.
利用模式生物基因组学来支持基于机器学习的真核生物必需基因预测——生物技术意义。
Biotechnol Adv. 2022 Jan-Feb;54:107822. doi: 10.1016/j.biotechadv.2021.107822. Epub 2021 Aug 27.
4
Environmental robustness of the global yeast genetic interaction network.全球酵母基因相互作用网络的环境稳健性。
Science. 2021 May 7;372(6542). doi: 10.1126/science.abf8424.
5
Machine learning approach to gene essentiality prediction: a review.机器学习在基因必需性预测中的应用:综述。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab128.
6
Combined use of feature engineering and machine-learning to predict essential genes in .结合特征工程和机器学习来预测……中的必需基因。 (原文句末不完整)
NAR Genom Bioinform. 2020 Jul 22;2(3):lqaa051. doi: 10.1093/nargab/lqaa051. eCollection 2020 Sep.
7
PDGF/VEGF signaling from muscles to hepatocyte-like cells protects against obesity.肌肉向肝样细胞传递的 PDGF/VEGF 信号可预防肥胖。
Elife. 2020 Oct 27;9:e56969. doi: 10.7554/eLife.56969.
8
as a Model Organism in Host-Pathogen Interaction Studies.作为宿主-病原体相互作用研究中的模式生物。
Front Cell Infect Microbiol. 2020 Jun 23;10:214. doi: 10.3389/fcimb.2020.00214. eCollection 2020.
9
Essential gene prediction in using machine learning approaches based on sequence and functional features.基于序列和功能特征,使用机器学习方法进行必需基因预测。
Comput Struct Biotechnol J. 2020 Mar 10;18:612-621. doi: 10.1016/j.csbj.2020.02.022. eCollection 2020.
10
Ensembl 2020.Ensembl 2020.
Nucleic Acids Res. 2020 Jan 8;48(D1):D682-D688. doi: 10.1093/nar/gkz966.