• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MacSyFinder:一个用于挖掘基因组中分子系统的程序及其在CRISPR-Cas系统中的应用

MacSyFinder: a program to mine genomes for molecular systems with an application to CRISPR-Cas systems.

作者信息

Abby Sophie S, Néron Bertrand, Ménager Hervé, Touchon Marie, Rocha Eduardo P C

机构信息

Microbial Evolutionary Genomics, Institut Pasteur, Paris, France; UMR3525, CNRS, Paris, France.

Centre d'Informatique pour la Biologie, Institut Pasteur, Paris, France.

出版信息

PLoS One. 2014 Oct 17;9(10):e110726. doi: 10.1371/journal.pone.0110726. eCollection 2014.

DOI:10.1371/journal.pone.0110726
PMID:25330359
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4201578/
Abstract

MOTIVATION

Biologists often wish to use their knowledge on a few experimental models of a given molecular system to identify homologs in genomic data. We developed a generic tool for this purpose.

RESULTS

Macromolecular System Finder (MacSyFinder) provides a flexible framework to model the properties of molecular systems (cellular machinery or pathway) including their components, evolutionary associations with other systems and genetic architecture. Modelled features also include functional analogs, and the multiple uses of a same component by different systems. Models are used to search for molecular systems in complete genomes or in unstructured data like metagenomes. The components of the systems are searched by sequence similarity using Hidden Markov model (HMM) protein profiles. The assignment of hits to a given system is decided based on compliance with the content and organization of the system model. A graphical interface, MacSyView, facilitates the analysis of the results by showing overviews of component content and genomic context. To exemplify the use of MacSyFinder we built models to detect and class CRISPR-Cas systems following a previously established classification. We show that MacSyFinder allows to easily define an accurate "Cas-finder" using publicly available protein profiles.

AVAILABILITY AND IMPLEMENTATION

MacSyFinder is a standalone application implemented in Python. It requires Python 2.7, Hmmer and makeblastdb (version 2.2.28 or higher). It is freely available with its source code under a GPLv3 license at https://github.com/gem-pasteur/macsyfinder. It is compatible with all platforms supporting Python and Hmmer/makeblastdb. The "Cas-finder" (models and HMM profiles) is distributed as a compressed tarball archive as Supporting Information.

摘要

动机

生物学家常常希望利用他们对给定分子系统的少数实验模型的了解,在基因组数据中识别同源物。为此我们开发了一种通用工具。

结果

大分子系统查找器(MacSyFinder)提供了一个灵活的框架,用于对分子系统(细胞机制或途径)的特性进行建模,包括其组成部分、与其他系统的进化关联以及遗传结构。建模的特征还包括功能类似物,以及同一组件在不同系统中的多种用途。模型用于在完整基因组或元基因组等非结构化数据中搜索分子系统。使用隐马尔可夫模型(HMM)蛋白质谱通过序列相似性搜索系统的组成部分。根据与系统模型的内容和组织的符合程度,将命中结果分配给给定系统。图形界面MacSyView通过显示组件内容和基因组背景的概述,便于对结果进行分析。为了举例说明MacSyFinder的使用,我们构建了模型,按照先前建立的分类来检测和分类CRISPR-Cas系统。我们表明,MacSyFinder允许使用公开可用的蛋白质谱轻松定义一个准确的“Cas查找器”。

可用性和实现方式

MacSyFinder是一个用Python实现的独立应用程序。它需要Python 2.7、Hmmer和makeblastdb(版本2.2.28或更高)。它在https://github.com/gem-pasteur/macsyfinder上以GPLv3许可免费提供其源代码。它与所有支持Python和Hmmer/makeblastdb的平台兼容。“Cas查找器”(模型和HMM谱)作为支持信息以压缩的tarball存档形式分发。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/4201578/2ebcf929a59a/pone.0110726.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/4201578/d297f1bc1af4/pone.0110726.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/4201578/1277c007dff3/pone.0110726.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/4201578/3e00d47571e9/pone.0110726.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/4201578/2e11835026ae/pone.0110726.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/4201578/2ebcf929a59a/pone.0110726.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/4201578/d297f1bc1af4/pone.0110726.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/4201578/1277c007dff3/pone.0110726.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/4201578/3e00d47571e9/pone.0110726.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/4201578/2e11835026ae/pone.0110726.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a75/4201578/2ebcf929a59a/pone.0110726.g005.jpg

相似文献

1
MacSyFinder: a program to mine genomes for molecular systems with an application to CRISPR-Cas systems.MacSyFinder:一个用于挖掘基因组中分子系统的程序及其在CRISPR-Cas系统中的应用
PLoS One. 2014 Oct 17;9(10):e110726. doi: 10.1371/journal.pone.0110726. eCollection 2014.
2
Identification of Protein Secretion Systems in Bacterial Genomes Using MacSyFinder Version 2.利用 MacSyFinder 版本 2 鉴定细菌基因组中的蛋白分泌系统。
Methods Mol Biol. 2024;2715:1-25. doi: 10.1007/978-1-0716-3445-5_1.
3
Identification of Protein Secretion Systems in Bacterial Genomes Using MacSyFinder.使用MacSyFinder鉴定细菌基因组中的蛋白质分泌系统
Methods Mol Biol. 2017;1615:1-21. doi: 10.1007/978-1-4939-7033-9_1.
4
Genome mining for anti-CRISPR operons using machine learning.基于机器学习的抗 CRISPR 操纵子基因组挖掘。
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad309.
5
DFAST: a flexible prokaryotic genome annotation pipeline for faster genome publication.DFAST:一个灵活的原核生物基因组注释管道,用于更快地发布基因组。
Bioinformatics. 2018 Mar 15;34(6):1037-1039. doi: 10.1093/bioinformatics/btx713.
6
Casboundary: automated definition of integral Cas cassettes.Cas 边界:整 Cas 盒的自动定义。
Bioinformatics. 2021 Jun 16;37(10):1352-1359. doi: 10.1093/bioinformatics/btaa984.
7
Identifying Conjugative Plasmids and Integrative Conjugative Elements with CONJscan.使用 CONJscan 识别可接合质粒和整合接合元件。
Methods Mol Biol. 2020;2075:265-283. doi: 10.1007/978-1-4939-9877-7_19.
8
Survey of clustered regularly interspaced short palindromic repeats and their associated Cas proteins (CRISPR/Cas) systems in multiple sequenced strains of Klebsiella pneumoniae.肺炎克雷伯菌多个测序菌株中规律成簇间隔短回文重复序列及其相关Cas蛋白(CRISPR/Cas)系统的调查
BMC Res Notes. 2015 Aug 4;8:332. doi: 10.1186/s13104-015-1285-7.
9
PyHMMER: a Python library binding to HMMER for efficient sequence analysis.PyHMMER:一个绑定到 HMMER 的 Python 库,用于高效的序列分析。
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad214.
10
Genetic characteristics and phylogenetic analysis of Brazilian clinical strains of Pseudomonas aeruginosa harboring CRISPR/Cas systems.巴西临床分离铜绿假单胞菌中 CRISPR/Cas 系统的遗传特征及系统进化分析。
Curr Genet. 2021 Aug;67(4):663-672. doi: 10.1007/s00294-021-01173-4. Epub 2021 Mar 22.

引用本文的文献

1
Lineage-specific defence systems of pandemic .大流行的谱系特异性防御系统
Philos Trans R Soc Lond B Biol Sci. 2025 Sep 4;380(1934):20240076. doi: 10.1098/rstb.2024.0076.
2
A DNA nicking Class 1 OLD family nuclease mediates phage defense in Vibrio cholerae and is countered by a phage-encoded inhibitor.一种DNA切口1类OLD家族核酸酶介导霍乱弧菌中的噬菌体防御,并被噬菌体编码的抑制剂所对抗。
Nucleic Acids Res. 2025 Aug 27;53(16). doi: 10.1093/nar/gkaf728.
3
Discovery of CRISPR-Cas12a clades using a large language model.使用大语言模型发现CRISPR-Cas12a进化枝

本文引用的文献

1
Development and applications of CRISPR-Cas9 for genome engineering.用于基因组工程的CRISPR-Cas9技术的开发与应用。
Cell. 2014 Jun 5;157(6):1262-1278. doi: 10.1016/j.cell.2014.05.010.
2
RNA events. Cas9 targeting and the CRISPR revolution.RNA相关事件。Cas9靶向作用与CRISPR革命。
Science. 2014 May 16;344(6185):707-8. doi: 10.1126/science.1252964.
3
CRISPR-Cas systems: Prokaryotes upgrade to adaptive immunity.CRISPR-Cas 系统:原核生物升级获得适应性免疫。
Nat Commun. 2025 Aug 23;16(1):7877. doi: 10.1038/s41467-025-63160-4.
4
Design of function-regulating RNA via deep learning and AlphaFold 3.通过深度学习和AlphaFold 3设计功能调控RNA。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf419.
5
Impact of Natural Transformation on the Acquisition of Novel Genes in Bacteria.自然转化对细菌中新基因获得的影响。
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf192.
6
Identification and Pathogenicity Analysis of Qf-1 in Mink ().水貂中Qf-1的鉴定及致病性分析()。 (括号部分原文缺失具体内容)
Microorganisms. 2025 Jul 8;13(7):1604. doi: 10.3390/microorganisms13071604.
7
Cyanophage Infections in a Sponge Intracellular Cyanobacterial Symbiont.海绵细胞内蓝藻共生体中的噬蓝藻体感染
Environ Microbiol. 2025 Jul;27(7):e70155. doi: 10.1111/1462-2920.70155.
8
Structures reveal how the Cas1-2/3 integrase captures, delivers, and integrates foreign DNA into CRISPR loci.结构揭示了Cas1-2/3整合酶如何捕获、传递外源DNA并将其整合到CRISPR基因座中。
bioRxiv. 2025 Jun 11:2025.06.10.658980. doi: 10.1101/2025.06.10.658980.
9
Landscape of mobile genetic elements and their functional cargo across the gastrointestinal tract microbiomes in ruminants.反刍动物胃肠道微生物群中可移动遗传元件及其功能性负载的概况。
Microbiome. 2025 Jul 12;13(1):162. doi: 10.1186/s40168-025-02139-1.
10
Distribution of Genetic Determinants Associated with CRISPR-Cas Systems and Resistance to Antibiotics in the Genomes of Archaea and Bacteria.古菌和细菌基因组中与CRISPR-Cas系统及抗生素抗性相关的遗传决定因素分布
Microorganisms. 2025 Jun 6;13(6):1321. doi: 10.3390/microorganisms13061321.
Mol Cell. 2014 Apr 24;54(2):234-44. doi: 10.1016/j.molcel.2014.03.011.
4
To acquire or resist: the complex biological effects of CRISPR-Cas systems.获取或抵抗:CRISPR-Cas 系统的复杂生物学效应。
Trends Microbiol. 2014 Apr;22(4):218-25. doi: 10.1016/j.tim.2014.01.007. Epub 2014 Feb 26.
5
TIGRFAMs and Genome Properties in 2013.TIGRFAMs 和 2013 年的基因组特性。
Nucleic Acids Res. 2013 Jan;41(Database issue):D387-95. doi: 10.1093/nar/gks1234. Epub 2012 Nov 28.
6
The non-flagellar type III secretion system evolved from the bacterial flagellum and diversified into host-cell adapted systems.非鞭毛型 III 型分泌系统由细菌鞭毛进化而来,并多样化为适应宿主细胞的系统。
PLoS Genet. 2012 Sep;8(9):e1002983. doi: 10.1371/journal.pgen.1002983. Epub 2012 Sep 27.
7
Accelerated Profile HMM Searches.加速轮廓隐马尔可夫模型搜索。
PLoS Comput Biol. 2011 Oct;7(10):e1002195. doi: 10.1371/journal.pcbi.1002195. Epub 2011 Oct 20.
8
Evolution and classification of the CRISPR-Cas systems.CRISPR-Cas 系统的进化与分类。
Nat Rev Microbiol. 2011 Jun;9(6):467-77. doi: 10.1038/nrmicro2577. Epub 2011 May 9.
9
The small, slow and specialized CRISPR and anti-CRISPR of Escherichia and Salmonella.短小、缓慢且专一的 Escherichia 和 Salmonella 的 CRISPR 与抗 CRISPR。
PLoS One. 2010 Jun 15;5(6):e11126. doi: 10.1371/journal.pone.0011126.
10
BLAST+: architecture and applications.BLAST+:体系结构与应用。
BMC Bioinformatics. 2009 Dec 15;10:421. doi: 10.1186/1471-2105-10-421.