• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种新型深度学习辅助混合网络用于疟原虫寄生虫线粒体蛋白分类。

A novel deep learning-assisted hybrid network for plasmodium falciparum parasite mitochondrial proteins classification.

机构信息

Department of Computer Science, Faculty of Mathematical and Computer Science, University of Gezira, Wad Madani, Sudan.

Preparatory Year Department, Al-Ghad International Colleges for Applied Medical Sciences, Riyadh, Saudi Arabia.

出版信息

PLoS One. 2022 Oct 6;17(10):e0275195. doi: 10.1371/journal.pone.0275195. eCollection 2022.

DOI:10.1371/journal.pone.0275195
PMID:36201724
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9536844/
Abstract

Plasmodium falciparum is a parasitic protozoan that can cause malaria, which is a deadly disease. Therefore, the accurate identification of malaria parasite mitochondrial proteins is essential for understanding their functions and identifying novel drug targets. For classifying protein sequences, several adaptive statistical techniques have been devised. Despite significant gains, prediction performance is still constrained by the lack of appropriate feature descriptors and learning strategies in current systems. Moreover, good ground truth data is important for Artificial Intelligence (AI)-based models but there is a lack of that data in the literature. Therefore, in this work, we propose a novel hybrid network that combines 1D Convolutional Neural Network (CNN) and Bidirectional Gated Recurrent Unit (BGRU) to classify the malaria parasite mitochondrial proteins. Furthermore, we curate a sequential data that are collected from National Center for Biotechnology Information (NCBI) and UniProtKB/Swiss-Prot proteins databanks to prepare a dataset that can be used by the research community for AI-based algorithms evaluation. We obtain 4204 cases after preprocessing of the collected data and denote this set of proteins as PF4204. Finally, we conduct an ablation study on several conventional and deep models using PF4204 and the benchmark PF2095 datasets. The proposed model 'CNN-BGRU' obtains the accuracy values of 0.9096 and 0.9857 on PF4204 and PF2095 datasets, respectively. In addition, the CNN-BGRU is compared with state-of-the-arts, where the results illustrate that it can extract robust features and identify proteins accurately.

摘要

疟原虫是一种能引起疟疾的寄生虫原生动物,疟疾是一种致命疾病。因此,准确识别疟原虫线粒体蛋白对于了解其功能和鉴定新的药物靶点至关重要。为了对蛋白质序列进行分类,已经设计了几种自适应统计技术。尽管取得了显著的进展,但预测性能仍然受到当前系统中缺乏适当的特征描述符和学习策略的限制。此外,人工智能 (AI) 模型需要良好的真实数据,但文献中缺乏这种数据。因此,在这项工作中,我们提出了一种新的混合网络,该网络结合了一维卷积神经网络 (CNN) 和双向门控循环单元 (BGRU) 来对疟原虫线粒体蛋白进行分类。此外,我们从国家生物技术信息中心 (NCBI) 和 UniProtKB/Swiss-Prot 蛋白质数据库中收集了顺序数据,以准备一个数据集,供研究社区用于 AI 算法评估。我们在对收集的数据进行预处理后得到了 4204 个案例,并将这组蛋白质表示为 PF4204。最后,我们使用 PF4204 和基准 PF2095 数据集对几种传统和深度模型进行了消融研究。所提出的模型“CNN-BGRU”在 PF4204 和 PF2095 数据集上分别获得了 0.9096 和 0.9857 的准确率值。此外,CNN-BGRU 与最先进的方法进行了比较,结果表明它可以提取稳健的特征并准确识别蛋白质。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb2a/9536844/74b9f15b6ecd/pone.0275195.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb2a/9536844/07f48126932b/pone.0275195.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb2a/9536844/c45afe8b47cc/pone.0275195.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb2a/9536844/74b9f15b6ecd/pone.0275195.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb2a/9536844/07f48126932b/pone.0275195.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb2a/9536844/c45afe8b47cc/pone.0275195.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb2a/9536844/74b9f15b6ecd/pone.0275195.g005.jpg

相似文献

1
A novel deep learning-assisted hybrid network for plasmodium falciparum parasite mitochondrial proteins classification.一种新型深度学习辅助混合网络用于疟原虫寄生虫线粒体蛋白分类。
PLoS One. 2022 Oct 6;17(10):e0275195. doi: 10.1371/journal.pone.0275195. eCollection 2022.
2
Identification of mitochondrial proteins of malaria parasite using analysis of variance.利用方差分析鉴定疟原虫的线粒体蛋白
Amino Acids. 2015 Feb;47(2):329-33. doi: 10.1007/s00726-014-1862-4. Epub 2014 Nov 11.
3
Stock prediction based on bidirectional gated recurrent unit with convolutional neural network and feature selection.基于双向门控循环单元与卷积神经网络的特征选择股票预测。
PLoS One. 2022 Feb 4;17(2):e0262501. doi: 10.1371/journal.pone.0262501. eCollection 2022.
4
Dispensable Role of Mitochondrial Fission Protein 1 (Fis1) in the Erythrocytic Development of Plasmodium falciparum.线粒体分裂蛋白 1(Fis1)在疟原虫红细胞发育中的可有可无的作用。
mSphere. 2020 Sep 23;5(5):e00579-20. doi: 10.1128/mSphere.00579-20.
5
An automated framework for evaluation of deep learning models for splice site predictions.用于评估深度学习模型进行剪接位点预测的自动化框架。
Sci Rep. 2023 Jun 23;13(1):10221. doi: 10.1038/s41598-023-34795-4.
6
Prediction of mitochondrial proteins of malaria parasite using split amino acid composition and PSSM profile.利用氨基酸组成拆分和 PSSM 图谱预测疟原虫的线粒体蛋白。
Amino Acids. 2010 Jun;39(1):101-10. doi: 10.1007/s00726-009-0381-1. Epub 2009 Nov 12.
7
The applications of deep learning algorithms on in silico druggable proteins identification.深度学习算法在虚拟可成药蛋白识别中的应用。
J Adv Res. 2022 Nov;41:219-231. doi: 10.1016/j.jare.2022.01.009. Epub 2022 Jan 22.
8
Deep Learning for Smartphone-Based Malaria Parasite Detection in Thick Blood Smears.基于深度学习的智能手机厚血涂片疟原虫检测
IEEE J Biomed Health Inform. 2020 May;24(5):1427-1438. doi: 10.1109/JBHI.2019.2939121. Epub 2019 Sep 23.
9
Application of machine and deep learning algorithms in optical microscopic detection of Plasmodium: A malaria diagnostic tool for the future.机器学习和深度学习算法在疟原虫光学显微镜检测中的应用:未来的疟疾诊断工具。
Photodiagnosis Photodyn Ther. 2022 Dec;40:103198. doi: 10.1016/j.pdpdt.2022.103198. Epub 2022 Nov 12.
10
Malaria detection through digital microscopic imaging using Deep Greedy Network with transfer learning.使用具有迁移学习的深度贪婪网络通过数字显微成像进行疟疾检测。
J Med Imaging (Bellingham). 2021 Sep;8(5):054502. doi: 10.1117/1.JMI.8.5.054502. Epub 2021 Sep 28.

引用本文的文献

1
Antimalarial Drug Repurposing of Epirubicin and Pelitinib in Combination with Artemether and Lumefantrine.表柔比星和培利替尼与蒿甲醚和本芴醇联合用于抗疟药物的重新利用
Pharmaceuticals (Basel). 2025 Jun 25;18(7):956. doi: 10.3390/ph18070956.

本文引用的文献

1
To Assist Oncologists: An Efficient Machine Learning-Based Approach for Anti-Cancer Peptides Classification.辅助肿瘤学家:一种基于机器学习的高效抗癌肽分类方法。
Sensors (Basel). 2022 May 25;22(11):4005. doi: 10.3390/s22114005.
2
An Empirical Evaluation of Convolutional Networks for Malaria Diagnosis.用于疟疾诊断的卷积网络的实证评估
J Imaging. 2022 Mar 7;8(3):66. doi: 10.3390/jimaging8030066.
3
Atrous Convolutions and Residual GRU Based Architecture for Matching Power Demand with Supply.多孔卷积和基于剩余 GRU 的架构,用于匹配供需电力。
Sensors (Basel). 2021 Oct 29;21(21):7191. doi: 10.3390/s21217191.
4
Malaria parasite detection in thick blood smear microscopic images using modified YOLOV3 and YOLOV4 models.使用改进的 YOLOV3 和 YOLOV4 模型检测厚血涂片显微镜图像中的疟原虫。
BMC Bioinformatics. 2021 Mar 8;22(1):112. doi: 10.1186/s12859-021-04036-4.
5
Artificial intelligence and machine learning-aided drug discovery in central nervous system diseases: State-of-the-arts and future directions.人工智能和机器学习辅助中枢神经系统疾病药物发现:现状与未来方向。
Med Res Rev. 2021 May;41(3):1427-1473. doi: 10.1002/med.21764. Epub 2020 Dec 9.
6
Host-Malaria Parasite Interactions and Impacts on Mutual Evolution.宿主-疟原虫相互作用及其对共同进化的影响。
Front Cell Infect Microbiol. 2020 Oct 27;10:587933. doi: 10.3389/fcimb.2020.587933. eCollection 2020.
7
Towards Efficient Building Designing: Heating and Cooling Load Prediction via Multi-Output Model.迈向高效的建筑设计:通过多输出模型进行加热和冷却负荷预测。
Sensors (Basel). 2020 Nov 10;20(22):6419. doi: 10.3390/s20226419.
8
Recognition of Mitochondrial Proteins in Plasmodium Based on the Tripeptide Composition.基于三肽组成对疟原虫线粒体蛋白质的识别
Front Cell Dev Biol. 2020 Sep 16;8:578901. doi: 10.3389/fcell.2020.578901. eCollection 2020.
9
DeepMito: accurate prediction of protein sub-mitochondrial localization using convolutional neural networks.DeepMito:使用卷积神经网络准确预测蛋白质亚线粒体定位
Bioinformatics. 2020 Jan 1;36(1):56-64. doi: 10.1093/bioinformatics/btz512.
10
PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method.PredT4SE-Stack:使用堆叠集成方法从蛋白质序列预测细菌IV型分泌效应蛋白
Front Microbiol. 2018 Oct 26;9:2571. doi: 10.3389/fmicb.2018.02571. eCollection 2018.