• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

经训练可用于检测病毒和噬菌体结构蛋白的人工神经网络。

Artificial neural networks trained to detect viral and phage structural proteins.

机构信息

Program of Computational Science, San Diego State University, San Diego, California, United States of America.

出版信息

PLoS Comput Biol. 2012;8(8):e1002657. doi: 10.1371/journal.pcbi.1002657. Epub 2012 Aug 23.

DOI:10.1371/journal.pcbi.1002657
PMID:22927809
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3426561/
Abstract

Phages play critical roles in the survival and pathogenicity of their hosts, via lysogenic conversion factors, and in nutrient redistribution, via cell lysis. Analyses of phage- and viral-encoded genes in environmental samples provide insights into the physiological impact of viruses on microbial communities and human health. However, phage ORFs are extremely diverse of which over 70% of them are dissimilar to any genes with annotated functions in GenBank. Better identification of viruses would also aid in better detection and diagnosis of disease, in vaccine development, and generally in better understanding the physiological potential of any environment. In contrast to enzymes, viral structural protein function can be much more challenging to detect from sequence data because of low sequence conservation, few known conserved catalytic sites or sequence domains, and relatively limited experimental data. We have designed a method of predicting phage structural protein sequences that uses Artificial Neural Networks (ANNs). First, we trained ANNs to classify viral structural proteins using amino acid frequency; these correctly classify a large fraction of test cases with a high degree of specificity and sensitivity. Subsequently, we added estimates of protein isoelectric points as a feature to ANNs that classify specialized families of proteins, namely major capsid and tail proteins. As expected, these more specialized ANNs are more accurate than the structural ANNs. To experimentally validate the ANN predictions, several ORFs with no significant similarities to known sequences that are ANN-predicted structural proteins were examined by transmission electron microscopy. Some of these self-assembled into structures strongly resembling virion structures. Thus, our ANNs are new tools for identifying phage and potential prophage structural proteins that are difficult or impossible to detect by other bioinformatic analysis. The networks will be valuable when sequence is available but in vitro propagation of the phage may not be practical or possible.

摘要

噬菌体在其宿主的生存和致病性方面发挥着关键作用,通过溶原性转换因子,以及通过细胞裂解进行营养再分配。对环境样本中噬菌体和病毒编码基因的分析,提供了关于病毒对微生物群落和人类健康的生理影响的见解。然而,噬菌体 ORF 极其多样化,其中超过 70%的 ORF 与 GenBank 中具有注释功能的任何基因都不相似。更好地识别病毒也将有助于更好地检测和诊断疾病,开发疫苗,并更好地了解任何环境的生理潜力。与酶不同,由于序列保守性低、已知的保守催化位点或序列结构域少以及相对有限的实验数据,病毒结构蛋白的功能从序列数据中检测更加具有挑战性。我们设计了一种使用人工神经网络 (ANN) 预测噬菌体结构蛋白序列的方法。首先,我们使用氨基酸频率训练 ANN 来分类病毒结构蛋白;这些 ANN 正确分类了大量测试案例,具有高度的特异性和敏感性。随后,我们添加了蛋白质等电点的估计值作为特征,用于分类专门的蛋白质家族,即主要衣壳和尾部蛋白。正如预期的那样,这些更专门的 ANN 比结构 ANN 更准确。为了通过实验验证 ANN 预测,我们通过透射电子显微镜检查了几个与已知序列没有显著相似性但被 ANN 预测为结构蛋白的 ORF。其中一些自我组装成与病毒结构非常相似的结构。因此,我们的 ANN 是识别噬菌体和潜在噬菌体结构蛋白的新工具,这些结构蛋白通过其他生物信息学分析难以或不可能检测到。当有序列可用但体外繁殖噬菌体不切实际或不可能时,这些网络将非常有价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/3e3c80ba7f9f/pcbi.1002657.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/f7a1cf6c9cce/pcbi.1002657.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/acc32ca60e7d/pcbi.1002657.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/f25ecfc3a005/pcbi.1002657.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/4cf0bcee3065/pcbi.1002657.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/09d77ff4acb1/pcbi.1002657.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/e87b54499536/pcbi.1002657.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/479879e33ffa/pcbi.1002657.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/ec0547741f15/pcbi.1002657.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/dbcb1c1d68af/pcbi.1002657.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/3e3c80ba7f9f/pcbi.1002657.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/f7a1cf6c9cce/pcbi.1002657.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/acc32ca60e7d/pcbi.1002657.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/f25ecfc3a005/pcbi.1002657.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/4cf0bcee3065/pcbi.1002657.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/09d77ff4acb1/pcbi.1002657.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/e87b54499536/pcbi.1002657.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/479879e33ffa/pcbi.1002657.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/ec0547741f15/pcbi.1002657.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/dbcb1c1d68af/pcbi.1002657.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d040/3426561/3e3c80ba7f9f/pcbi.1002657.g010.jpg

相似文献

1
Artificial neural networks trained to detect viral and phage structural proteins.经训练可用于检测病毒和噬菌体结构蛋白的人工神经网络。
PLoS Comput Biol. 2012;8(8):e1002657. doi: 10.1371/journal.pcbi.1002657. Epub 2012 Aug 23.
2
Functional and comparative genome analysis of novel virulent actinophages belonging to Streptomyces flavovirens.属于黄弗链霉菌的新型烈性放线菌噬菌体的功能和比较基因组分析
BMC Microbiol. 2017 Mar 3;17(1):51. doi: 10.1186/s12866-017-0940-7.
3
Sequence analysis of the genome of the temperate Yersinia enterocolitica phage PY54.温和型小肠结肠炎耶尔森菌噬菌体PY54基因组的序列分析
J Mol Biol. 2003 Aug 15;331(3):605-22. doi: 10.1016/s0022-2836(03)00763-0.
4
Genome organization and characterization of the virulent lactococcal phage 1358 and its similarities to Listeria phages.乳酸菌噬菌体 1358 的基因组组织与特性及其与李斯特菌噬菌体的相似性。
Appl Environ Microbiol. 2010 Mar;76(5):1623-32. doi: 10.1128/AEM.02173-09. Epub 2010 Jan 8.
5
PhANNs, a fast and accurate tool and web server to classify phage structural proteins.PhANNs,一个快速准确的工具和网络服务器,用于分类噬菌体结构蛋白。
PLoS Comput Biol. 2020 Nov 2;16(11):e1007845. doi: 10.1371/journal.pcbi.1007845. eCollection 2020 Nov.
6
Bioinformatic analysis of the Acinetobacter baumannii phage AB1 genome.生物信息学分析鲍曼不动杆菌噬菌体 AB1 基因组。
Gene. 2012 Oct 10;507(2):125-34. doi: 10.1016/j.gene.2012.07.029. Epub 2012 Jul 31.
7
Complete nucleotide sequence, molecular analysis and genome structure of bacteriophage A118 of Listeria monocytogenes: implications for phage evolution.单核细胞增生李斯特菌噬菌体A118的完整核苷酸序列、分子分析及基因组结构:对噬菌体进化的启示
Mol Microbiol. 2000 Jan;35(2):324-40. doi: 10.1046/j.1365-2958.2000.01720.x.
8
Sequence analysis of Leuconostoc mesenteroides bacteriophage Phi1-A4 isolated from an industrial vegetable fermentation.从蔬菜发酵工业中分离的明串珠菌噬菌体 Phi1-A4 的序列分析。
Appl Environ Microbiol. 2010 Mar;76(6):1955-66. doi: 10.1128/AEM.02126-09. Epub 2010 Jan 29.
9
Automated classification of tailed bacteriophages according to their neck organization.根据颈部结构对有尾噬菌体进行自动分类。
BMC Genomics. 2014 Nov 27;15(1):1027. doi: 10.1186/1471-2164-15-1027.
10
Comparative genomic analysis of 142 bacteriophages infecting Salmonella enterica subsp. enterica.比较分析感染肠沙门氏菌亚种的 142 种噬菌体的基因组。
BMC Genomics. 2020 May 26;21(1):374. doi: 10.1186/s12864-020-6765-z.

引用本文的文献

1
Fold first, ask later: structure-informed function annotation of phage proteins.先折叠,后询问:噬菌体蛋白质的结构导向功能注释
bioRxiv. 2025 Jul 20:2025.07.17.665397. doi: 10.1101/2025.07.17.665397.
2
A review of neural networks for metagenomic binning.宏基因组分箱的神经网络综述。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf065.
3
Elucidation of molecular function of phage protein responsible for optimization of host cell lysis.阐明负责优化宿主细胞裂解的噬菌体蛋白的分子功能。

本文引用的文献

1
Gene design, cloning and protein-expression methods for high-value targets at the Seattle Structural Genomics Center for Infectious Disease.西雅图传染病结构基因组学中心针对高价值靶点的基因设计、克隆及蛋白质表达方法。
Acta Crystallogr Sect F Struct Biol Cryst Commun. 2011 Sep 1;67(Pt 9):992-7. doi: 10.1107/S1744309111026698. Epub 2011 Aug 13.
2
Gene Composer in a structural genomics environment.结构基因组学环境中的基因编辑器。
Acta Crystallogr Sect F Struct Biol Cryst Commun. 2011 Sep 1;67(Pt 9):985-91. doi: 10.1107/S1744309111027424. Epub 2011 Aug 13.
3
Characterization of a novel temperate phage originating from a cereulide-producing Bacillus cereus strain.
BMC Microbiol. 2024 Dec 19;24(1):532. doi: 10.1186/s12866-024-03684-9.
4
Recent Applications of Artificial Intelligence in Discovery of New Antibacterial Agents.人工智能在新型抗菌药物发现中的最新应用
Adv Appl Bioinform Chem. 2024 Dec 3;17:139-157. doi: 10.2147/AABC.S484321. eCollection 2024.
5
PhageScanner: a reconfigurable machine learning framework for bacteriophage genomic and metagenomic feature annotation.噬菌体扫描器:一种用于噬菌体基因组和宏基因组特征注释的可重构机器学习框架。
Front Microbiol. 2024 Sep 17;15:1446097. doi: 10.3389/fmicb.2024.1446097. eCollection 2024.
6
Deep learning in structural bioinformatics: current applications and future perspectives.结构生物信息学中的深度学习:当前应用与未来展望。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae042.
7
Antimicrobial resistance crisis: could artificial intelligence be the solution?抗菌药物耐药性危机:人工智能能否成为解决方案?
Mil Med Res. 2024 Jan 23;11(1):7. doi: 10.1186/s40779-024-00510-1.
8
Compounding Phages for Therapeutic Applications.用于治疗应用的噬菌体的组合。
Viruses. 2023 Jul 30;15(8):1665. doi: 10.3390/v15081665.
9
PhaVIP: Phage VIrion Protein classification based on chaos game representation and Vision Transformer.基于混沌游戏表示和 Vision Transformer 的噬菌体衣壳蛋白分类
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i30-i39. doi: 10.1093/bioinformatics/btad229.
10
PhageTailFinder: A tool for phage tail module detection and annotation.噬菌体尾部查找器:一种用于噬菌体尾部模块检测和注释的工具。
Front Genet. 2023 Jan 23;14:947466. doi: 10.3389/fgene.2023.947466. eCollection 2023.
一种新型温和噬菌体的特性,来源于一株产生呕吐毒素的蜡样芽胞杆菌。
Res Microbiol. 2011 May;162(4):446-59. doi: 10.1016/j.resmic.2011.02.009. Epub 2011 Feb 22.
4
Assembly of infectious hepatitis C virus particles.丙型肝炎病毒颗粒的组装。
Trends Microbiol. 2011 Feb;19(2):95-103. doi: 10.1016/j.tim.2010.11.005. Epub 2010 Dec 14.
5
Morphogenesis of the T4 tail and tail fibers.T4 尾和尾丝的形态发生。
Virol J. 2010 Dec 3;7:355. doi: 10.1186/1743-422X-7-355.
6
The solution structure of the C-terminal Ig-like domain of the bacteriophage λ tail tube protein.噬菌体 λ 尾管蛋白 C 端 Ig 样结构域的溶液结构。
J Mol Biol. 2010 Oct 29;403(3):468-79. doi: 10.1016/j.jmb.2010.08.044. Epub 2010 Sep 6.
7
In vitro assembly of the T=13 procapsid of bacteriophage T5 with its scaffolding domain.噬菌体 T5 T=13 型预制衣壳与其支架域的体外组装。
J Virol. 2010 Sep;84(18):9350-8. doi: 10.1128/JVI.00942-10. Epub 2010 Jun 23.
8
Does bleach processing increase the accuracy of sputum smear microscopy for diagnosing pulmonary tuberculosis?漂白处理是否会提高痰涂片显微镜检查诊断肺结核的准确性?
J Clin Microbiol. 2010 Jul;48(7):2433-9. doi: 10.1128/JCM.00208-10. Epub 2010 Apr 26.
9
Testing the sensitivity and specificity of the fluorescence microscope (Cyscope) for malaria diagnosis.检测荧光显微镜(Cyscope)诊断疟疾的灵敏度和特异性。
Malar J. 2010 Mar 31;9:88. doi: 10.1186/1475-2875-9-88.
10
Roles of viruses in the environment.病毒在环境中的作用。
Environ Microbiol. 2009 Nov;11(11):2771-4. doi: 10.1111/j.1462-2920.2009.02101.x.