• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 Profile HMM 的灵敏且特异序列检测的合理设计:应用于病毒、噬菌体和 Casposons 的案例研究。

Rational Design of Profile HMMs for Sensitive and Specific Sequence Detection with Case Studies Applied to Viruses, Bacteriophages, and Casposons.

机构信息

Department of Parasitology, Instituto de Ciências Biomédicas, Universidade de São Paulo, São Paulo 05508-000, SP, Brazil.

Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá 111711, Colombia.

出版信息

Viruses. 2023 Feb 13;15(2):519. doi: 10.3390/v15020519.

DOI:10.3390/v15020519
PMID:36851733
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9966878/
Abstract

Profile hidden Markov models (HMMs) are a powerful way of modeling biological sequence diversity and constitute a very sensitive approach to detecting divergent sequences. Here, we report the development of protocols for the rational design of profile HMMs. These methods were implemented on TABAJARA, a program that can be used to either detect all biological sequences of a group or discriminate specific groups of sequences. By calculating position-specific information scores along a multiple sequence alignment, TABAJARA automatically identifies the most informative sequence motifs and uses them to construct profile HMMs. As a proof-of-principle, we applied TABAJARA to generate profile HMMs for the detection and classification of two viral groups presenting different evolutionary rates: bacteriophages of the family and viruses of the genus. We obtained conserved models for the generic detection of any or sequence, and profile HMMs that can specifically discriminate subfamilies or species. In another application, we constructed Cas1 endonuclease-derived profile HMMs that can discriminate CRISPRs and casposons, two evolutionarily related transposable elements. We believe that the protocols described here, and implemented on TABAJARA, constitute a generic toolbox for generating profile HMMs for the highly sensitive and specific detection of sequence classes.

摘要

隐马尔可夫模型(HMMs)是一种对生物序列多样性建模的强大方法,也是一种非常敏感的检测差异序列的方法。在这里,我们报告了用于合理设计 HMM 模型的协议的开发。这些方法是在 TABAJARA 上实现的,TABAJARA 可以用于检测一组中的所有生物序列或区分特定的序列组。通过沿多序列比对计算位置特异性信息评分,TABAJARA 自动识别最具信息量的序列基序,并使用它们来构建 HMM 模型。作为原理验证,我们应用 TABAJARA 为具有不同进化率的两个病毒群(科和属)的检测和分类生成 HMM 模型。我们获得了用于通用检测任何 或 序列的保守模型,以及能够特异性区分 亚科或 物种的 HMM 模型。在另一个应用中,我们构建了 Cas1 内切酶衍生的 HMM 模型,用于区分 CRISPRs 和 casposons,这两种进化上相关的转座元件。我们相信,这里描述的协议,以及在 TABAJARA 上实现的协议,构成了用于高度敏感和特异性检测序列类别的生成 HMM 模型的通用工具包。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a82f/9966878/ebf5eddf2919/viruses-15-00519-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a82f/9966878/bf8da293441e/viruses-15-00519-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a82f/9966878/962df29dfd5f/viruses-15-00519-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a82f/9966878/85859e9f26e3/viruses-15-00519-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a82f/9966878/9da4097ef1d8/viruses-15-00519-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a82f/9966878/ebf5eddf2919/viruses-15-00519-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a82f/9966878/bf8da293441e/viruses-15-00519-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a82f/9966878/962df29dfd5f/viruses-15-00519-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a82f/9966878/85859e9f26e3/viruses-15-00519-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a82f/9966878/9da4097ef1d8/viruses-15-00519-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a82f/9966878/ebf5eddf2919/viruses-15-00519-g005.jpg

相似文献

1
Rational Design of Profile HMMs for Sensitive and Specific Sequence Detection with Case Studies Applied to Viruses, Bacteriophages, and Casposons.基于 Profile HMM 的灵敏且特异序列检测的合理设计:应用于病毒、噬菌体和 Casposons 的案例研究。
Viruses. 2023 Feb 13;15(2):519. doi: 10.3390/v15020519.
2
Rational Design of Profile Hidden Markov Models for Viral Classification and Discovery用于病毒分类与发现的轮廓隐马尔可夫模型的合理设计
3
Profile hidden Markov models for the detection of viruses within metagenomic sequence data.用于在宏基因组序列数据中检测病毒的轮廓隐马尔可夫模型。
PLoS One. 2014 Aug 20;9(8):e105067. doi: 10.1371/journal.pone.0105067. eCollection 2014.
4
GenSeed-HMM: A Tool for Progressive Assembly Using Profile HMMs as Seeds and its Application in Alpavirinae Viral Discovery from Metagenomic Data.GenSeed-HMM:一种使用隐马尔可夫模型轮廓作为种子进行渐进式组装的工具及其在从宏基因组数据中发现阿尔帕病毒科病毒中的应用
Front Microbiol. 2016 Mar 4;7:269. doi: 10.3389/fmicb.2016.00269. eCollection 2016.
5
Predicting conserved protein motifs with Sub-HMMs.用 Sub-HMMs 预测保守蛋白基序。
BMC Bioinformatics. 2010 Apr 26;11:205. doi: 10.1186/1471-2105-11-205.
6
Transposon identification using profile HMMs.使用 Profile HMM 进行转座子鉴定。
BMC Genomics. 2010 Feb 10;11 Suppl 1(Suppl 1):S10. doi: 10.1186/1471-2164-11-S1-S10.
7
Fuzzy Hidden Markov Models: a new approach in multiple sequence alignment.模糊隐马尔可夫模型:多序列比对的一种新方法。
Stud Health Technol Inform. 2006;124:99-104.
8
Profile Hidden Markov Models Are Not Identifiable.轮廓隐马尔可夫模型不可识别。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):162-172. doi: 10.1109/TCBB.2019.2933821. Epub 2021 Feb 3.
9
A network of SCOP hidden Markov models and its analysis.SCOP 隐马尔可夫模型网络及其分析。
BMC Bioinformatics. 2011 May 23;12:191. doi: 10.1186/1471-2105-12-191.
10
Hidden Markov Models and their Applications in Biological Sequence Analysis.隐马尔可夫模型及其在生物序列分析中的应用。
Curr Genomics. 2009 Sep;10(6):402-15. doi: 10.2174/138920209789177575.

引用本文的文献

1
RdDM-Associated Chromatin Remodelers in Soybean: Evolution and Stress-Induced Expression of CLASSY Genes.大豆中与RNA介导的DNA甲基化相关的染色质重塑因子:CLASSY基因的进化与胁迫诱导表达
Plants (Basel). 2025 Aug 15;14(16):2543. doi: 10.3390/plants14162543.
2
Integrating Sequence- and Structure-Based Similarity Metrics for the Demarcation of Multiple Viral Taxonomic Levels.整合基于序列和结构的相似性度量以划分多个病毒分类水平
Viruses. 2025 Apr 29;17(5):642. doi: 10.3390/v17050642.

本文引用的文献

1
VIRify: An integrated detection, annotation and taxonomic classification pipeline using virus-specific protein profile hidden Markov models.VIRify:一种使用病毒特异性蛋白特征隐马尔可夫模型进行综合检测、注释和分类学分类的管道。
PLoS Comput Biol. 2023 Aug 28;19(8):e1011422. doi: 10.1371/journal.pcbi.1011422. eCollection 2023 Aug.
2
Perspective on taxonomic classification of uncultivated viruses.未培养病毒分类的思考。
Curr Opin Virol. 2021 Dec;51:207-215. doi: 10.1016/j.coviro.2021.10.011. Epub 2021 Nov 12.
3
Current challenges to virus discovery by meta-transcriptomics.
元转录组学发现病毒面临的挑战。
Curr Opin Virol. 2021 Dec;51:48-55. doi: 10.1016/j.coviro.2021.09.007. Epub 2021 Sep 27.
4
PHROG: families of prokaryotic virus proteins clustered using remote homology.PHROG:利用远缘同源性聚类的原核病毒蛋白家族。
NAR Genom Bioinform. 2021 Aug 5;3(3):lqab067. doi: 10.1093/nargab/lqab067. eCollection 2021 Sep.
5
Informative Regions In Viral Genomes.病毒基因组中的信息区域。
Viruses. 2021 Jun 18;13(6):1164. doi: 10.3390/v13061164.
6
efam: an expanded, metaproteome-supported HMM profile database of viral protein families.efam:一个扩展的、基于宏基因组支持的病毒蛋白家族 HMM 谱数据库。
Bioinformatics. 2021 Nov 18;37(22):4202-4208. doi: 10.1093/bioinformatics/btab451.
7
Cenote-Taker 2 democratizes virus discovery and sequence annotation.Cenote-Taker 2使病毒发现和序列注释民主化。
Virus Evol. 2020 Dec 30;7(1):veaa100. doi: 10.1093/ve/veaa100. eCollection 2021 Jan.
8
Characterization of a Novel Mitovirus of the Sand Fly Using Genomic and Virus-Host Interaction Signatures.利用基因组和病毒-宿主相互作用特征对沙蝇新型微小病毒进行鉴定。
Viruses. 2020 Dec 23;13(1):9. doi: 10.3390/v13010009.
9
IMG/VR v3: an integrated ecological and evolutionary framework for interrogating genomes of uncultivated viruses.IMG/VR v3:用于探究未培养病毒基因组的综合生态和进化框架。
Nucleic Acids Res. 2021 Jan 8;49(D1):D764-D775. doi: 10.1093/nar/gkaa946.
10
Pfam: The protein families database in 2021.Pfam:2021 年的蛋白质家族数据库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D412-D419. doi: 10.1093/nar/gkaa913.