• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度必需:一种用于预测微生物必需基因的深度神经网络。

DeeplyEssential: a deep neural network for predicting essential genes in microbes.

机构信息

Department of Computer Science and Engineering, University of California Riverside, 900 University Ave, Riverside, 92507, CA, USA.

出版信息

BMC Bioinformatics. 2020 Sep 30;21(Suppl 14):367. doi: 10.1186/s12859-020-03688-y.

DOI:10.1186/s12859-020-03688-y
PMID:32998698
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7525945/
Abstract

BACKGROUND

Essential genes are those genes that are critical for the survival of an organism. The prediction of essential genes in bacteria can provide targets for the design of novel antibiotic compounds or antimicrobial strategies.

RESULTS

We propose a deep neural network for predicting essential genes in microbes. Our architecture called DEEPLYESSENTIAL makes minimal assumptions about the input data (i.e., it only uses gene primary sequence and the corresponding protein sequence) to carry out the prediction thus maximizing its practical application compared to existing predictors that require structural or topological features which might not be readily available. We also expose and study a hidden performance bias that effected previous classifiers. Extensive results show that DEEPLYESSENTIAL outperform existing classifiers that either employ down-sampling to balance the training set or use clustering to exclude multiple copies of orthologous genes.

CONCLUSION

Deep neural network architectures can efficiently predict whether a microbial gene is essential (or not) using only its sequence information.

摘要

背景

必需基因是指对生物体生存至关重要的基因。预测细菌中的必需基因可以为设计新型抗生素化合物或抗菌策略提供目标。

结果

我们提出了一种用于预测微生物中必需基因的深度神经网络。我们的架构称为 DEEPLYESSENTIAL,对输入数据(即仅使用基因的一级序列和相应的蛋白质序列)做出了最小的假设,从而与需要结构或拓扑特征的现有预测器相比,最大限度地提高了其实用性,这些特征可能不容易获得。我们还揭示并研究了影响以前分类器的隐藏性能偏差。广泛的结果表明,DEEPLLYESSENTIAL 优于现有的分类器,这些分类器要么采用下采样来平衡训练集,要么使用聚类来排除同源基因的多个副本。

结论

深度神经网络架构可以仅使用其序列信息有效地预测微生物基因是否必需(或非必需)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/f5051619359e/12859_2020_3688_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/8e73cd2ce429/12859_2020_3688_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/3388f96c24b8/12859_2020_3688_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/aa324f02993f/12859_2020_3688_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/043f278f3910/12859_2020_3688_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/dccda52bfc55/12859_2020_3688_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/88624deca8f1/12859_2020_3688_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/b438929fd423/12859_2020_3688_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/86dc33631dc1/12859_2020_3688_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/aa006457709b/12859_2020_3688_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/f5051619359e/12859_2020_3688_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/8e73cd2ce429/12859_2020_3688_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/3388f96c24b8/12859_2020_3688_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/aa324f02993f/12859_2020_3688_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/043f278f3910/12859_2020_3688_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/dccda52bfc55/12859_2020_3688_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/88624deca8f1/12859_2020_3688_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/b438929fd423/12859_2020_3688_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/86dc33631dc1/12859_2020_3688_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/aa006457709b/12859_2020_3688_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed54/7525945/f5051619359e/12859_2020_3688_Fig10_HTML.jpg

相似文献

1
DeeplyEssential: a deep neural network for predicting essential genes in microbes.深度必需:一种用于预测微生物必需基因的深度神经网络。
BMC Bioinformatics. 2020 Sep 30;21(Suppl 14):367. doi: 10.1186/s12859-020-03688-y.
2
Prediction of essential genes in prokaryote based on artificial neural network.基于人工神经网络的原核生物必需基因预测。
Genes Genomics. 2020 Jan;42(1):97-106. doi: 10.1007/s13258-019-00884-w. Epub 2019 Nov 17.
3
DeepHE: Accurately predicting human essential genes based on deep learning.DeepHE:基于深度学习的人类必需基因精准预测。
PLoS Comput Biol. 2020 Sep 16;16(9):e1008229. doi: 10.1371/journal.pcbi.1008229. eCollection 2020 Sep.
4
Selection of key sequence-based features for prediction of essential genes in 31 diverse bacterial species.基于关键序列特征的31种不同细菌物种中必需基因预测的选择
PLoS One. 2017 Mar 30;12(3):e0174638. doi: 10.1371/journal.pone.0174638. eCollection 2017.
5
Geptop: a gene essentiality prediction tool for sequenced bacterial genomes based on orthology and phylogeny.Geptop:一种基于直系同源性和系统发育的已测序细菌基因组基因必需性预测工具。
PLoS One. 2013 Aug 15;8(8):e72343. doi: 10.1371/journal.pone.0072343. eCollection 2013.
6
Sequence-based information-theoretic features for gene essentiality prediction.用于基因必需性预测的基于序列的信息论特征。
BMC Bioinformatics. 2017 Nov 9;18(1):473. doi: 10.1186/s12859-017-1884-5.
7
Assessing the utility of deep neural networks in predicting postoperative surgical complications: a retrospective study.评估深度学习神经网络在预测术后手术并发症中的效用:一项回顾性研究。
Lancet Digit Health. 2021 Aug;3(8):e471-e485. doi: 10.1016/S2589-7500(21)00084-4. Epub 2021 Jun 29.
8
MABAL: a Novel Deep-Learning Architecture for Machine-Assisted Bone Age Labeling.MABAL:一种用于机器辅助骨龄标注的新型深度学习架构。
J Digit Imaging. 2018 Aug;31(4):513-519. doi: 10.1007/s10278-018-0053-3.
9
Network-based features enable prediction of essential genes across diverse organisms.基于网络的特征可实现跨多种生物的必需基因预测。
PLoS One. 2018 Dec 13;13(12):e0208722. doi: 10.1371/journal.pone.0208722. eCollection 2018.
10
Predicting essential genes of 37 prokaryotes by combining information-theoretic features.通过结合信息论特征预测37种原核生物的必需基因。
J Microbiol Methods. 2021 Sep;188:106297. doi: 10.1016/j.mimet.2021.106297. Epub 2021 Jul 31.

引用本文的文献

1
A deep ensemble framework for human essential gene prediction by integrating multi-omics data.一种通过整合多组学数据进行人类必需基因预测的深度集成框架。
Sci Rep. 2025 Jul 21;15(1):26407. doi: 10.1038/s41598-025-99164-9.
2
A hybrid machine learning model with attention mechanism and multidimensional multivariate feature coding for essential gene prediction.一种具有注意力机制和多维多变量特征编码的混合机器学习模型用于必需基因预测。
BMC Biol. 2025 Apr 24;23(1):108. doi: 10.1186/s12915-025-02209-8.
3
Recent advances in the characterization of essential genes and development of a database of essential genes.

本文引用的文献

1
Network-based features enable prediction of essential genes across diverse organisms.基于网络的特征可实现跨多种生物的必需基因预测。
PLoS One. 2018 Dec 13;13(12):e0208722. doi: 10.1371/journal.pone.0208722. eCollection 2018.
2
HumanNet v2: human gene networks for disease research.HumanNet v2:用于疾病研究的人类基因网络。
Nucleic Acids Res. 2019 Jan 8;47(D1):D573-D580. doi: 10.1093/nar/gky1126.
3
Sequence-based information-theoretic features for gene essentiality prediction.用于基因必需性预测的基于序列的信息论特征。
必需基因表征的最新进展及必需基因数据库的开发。
Imeta. 2024 Jan 2;3(1):e157. doi: 10.1002/imt2.157. eCollection 2024 Feb.
4
An unsupervised deep learning framework for predicting human essential genes from population and functional genomic data.一种从人群和功能基因组数据中预测人类必需基因的无监督深度学习框架。
BMC Bioinformatics. 2023 Sep 18;24(1):347. doi: 10.1186/s12859-023-05481-z.
5
Evaluation of machine learning classifiers for predicting essential genes in strains.用于预测菌株中必需基因的机器学习分类器评估
Bioinformation. 2022 Dec 31;18(12):1126-1130. doi: 10.6026/973206300181126. eCollection 2022.
6
Genome engineering on size reduction and complexity simplification: A review.基因组工程的规模缩减与复杂性简化:综述。
J Adv Res. 2024 Jun;60:159-171. doi: 10.1016/j.jare.2023.07.006. Epub 2023 Jul 12.
7
DeepCellEss: cell line-specific essential protein prediction with attention-based interpretable deep learning.DeepCellEss:基于注意力机制的可解释深度学习的细胞系特异性必需蛋白预测。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac779.
8
..
J Biosci. 2022;47(2). doi: 10.1007/s12038-022-00253-y.
9
Bacterial genome reductions: Tools, applications, and challenges.细菌基因组缩减:工具、应用及挑战
Front Genome Ed. 2022 Aug 31;4:957289. doi: 10.3389/fgeed.2022.957289. eCollection 2022.
10
Genome-wide functional screens enable the prediction of high activity CRISPR-Cas9 and -Cas12a guides in Yarrowia lipolytica.全基因组功能筛选可预测在解脂耶氏酵母中具有高活性的 CRISPR-Cas9 和 -Cas12a 向导。
Nat Commun. 2022 Feb 17;13(1):922. doi: 10.1038/s41467-022-28540-0.
BMC Bioinformatics. 2017 Nov 9;18(1):473. doi: 10.1186/s12859-017-1884-5.
4
GRAFENE: Graphlet-based alignment-free network approach integrates 3D structural and sequence (residue order) data to improve protein structural comparison.基于图元的无比对网络方法整合了 3D 结构和序列(残基顺序)数据,以改进蛋白质结构比对。
Sci Rep. 2017 Nov 2;7(1):14890. doi: 10.1038/s41598-017-14411-y.
5
Maximum entropy methods for extracting the learned features of deep neural networks.用于提取深度神经网络学习特征的最大熵方法。
PLoS Comput Biol. 2017 Oct 30;13(10):e1005836. doi: 10.1371/journal.pcbi.1005836. eCollection 2017 Oct.
6
Selection of key sequence-based features for prediction of essential genes in 31 diverse bacterial species.基于关键序列特征的31种不同细菌物种中必需基因预测的选择
PLoS One. 2017 Mar 30;12(3):e0174638. doi: 10.1371/journal.pone.0174638. eCollection 2017.
7
Deep Learning in Medical Image Analysis.医学图像分析中的深度学习
Annu Rev Biomed Eng. 2017 Jun 21;19:221-248. doi: 10.1146/annurev-bioeng-071516-044442. Epub 2017 Mar 9.
8
Identifying Bacterial Essential Genes Based on a Feature-Integrated Method.基于特征集成方法的细菌必需基因鉴定。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jul-Aug;16(4):1274-1279. doi: 10.1109/TCBB.2017.2669968. Epub 2017 Feb 15.
9
Gene essentiality prediction based on fractal features and machine learning.基于分形特征和机器学习的基因必需性预测
Mol Biosyst. 2017 Feb 28;13(3):577-584. doi: 10.1039/c6mb00806b.
10
The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible.2017年的STRING数据库:质量可控的蛋白质-蛋白质相互作用网络,广泛可用。
Nucleic Acids Res. 2017 Jan 4;45(D1):D362-D368. doi: 10.1093/nar/gkw937. Epub 2016 Oct 18.