• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

巨嘴鸟:一种真菌生物合成基因簇发现框架

TOUCAN: a framework for fungal biosynthetic gene cluster discovery.

作者信息

Almeida Hayda, Palys Sylvester, Tsang Adrian, Diallo Abdoulaye Baniré

机构信息

Departement d'Informatique, UQAM, Montréal, QC, H2X 3Y7, Canada.

Centre for Structural and Functional Genomics, Concordia University, Montréal, QC, H4B 1R6, Canada.

出版信息

NAR Genom Bioinform. 2020 Nov 27;2(4):lqaa098. doi: 10.1093/nargab/lqaa098. eCollection 2020 Dec.

DOI:10.1093/nargab/lqaa098
PMID:33575642
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7694738/
Abstract

Fungal secondary metabolites (SMs) are an important source of numerous bioactive compounds largely applied in the pharmaceutical industry, as in the production of antibiotics and anticancer medications. The discovery of novel fungal SMs can potentially benefit human health. Identifying biosynthetic gene clusters (BGCs) involved in the biosynthesis of SMs can be a costly and complex task, especially due to the genomic diversity of fungal BGCs. Previous studies on fungal BGC discovery present limited scope and can restrict the discovery of new BGCs. In this work, we introduce TOUCAN, a supervised learning framework for fungal BGC discovery. Unlike previous methods, TOUCAN is capable of predicting BGCs on amino acid sequences, facilitating its use on newly sequenced and not yet curated data. It relies on three main pillars: rigorous selection of datasets by BGC experts; combination of functional, evolutionary and compositional features coupled with outperforming classifiers; and robust post-processing methods. TOUCAN best-performing model yields 0.982 -measure on BGC regions in the genome. Overall results show that TOUCAN outperforms previous approaches. TOUCAN focuses on fungal BGCs but can be easily adapted to expand its scope to process other species or include new features.

摘要

真菌次级代谢产物(SMs)是众多生物活性化合物的重要来源,在制药行业有广泛应用,比如抗生素和抗癌药物的生产。新型真菌SMs的发现可能有益于人类健康。识别参与SMs生物合成的生物合成基因簇(BGCs)可能是一项成本高昂且复杂的任务,尤其是由于真菌BGCs的基因组多样性。先前关于真菌BGC发现的研究范围有限,可能会限制新BGCs的发现。在这项工作中,我们引入了TOUCAN,这是一个用于真菌BGC发现的监督学习框架。与先前的方法不同,TOUCAN能够基于氨基酸序列预测BGCs,便于在新测序且尚未整理的数据上使用。它依赖于三个主要支柱:由BGC专家严格选择数据集;结合功能、进化和组成特征以及表现出色的分类器;以及强大的后处理方法。TOUCAN在基因组中的BGC区域上的F1值为0.982。总体结果表明,TOUCAN优于先前的方法。TOUCAN专注于真菌BGCs,但可以轻松调整以扩大其范围,用于处理其他物种或纳入新特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ad6/7694738/2ecf084281f9/lqaa098fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ad6/7694738/b6bbe28f31d6/lqaa098fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ad6/7694738/2ecf084281f9/lqaa098fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ad6/7694738/b6bbe28f31d6/lqaa098fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ad6/7694738/2ecf084281f9/lqaa098fig2.jpg

相似文献

1
TOUCAN: a framework for fungal biosynthetic gene cluster discovery.巨嘴鸟:一种真菌生物合成基因簇发现框架
NAR Genom Bioinform. 2020 Nov 27;2(4):lqaa098. doi: 10.1093/nargab/lqaa098. eCollection 2020 Dec.
2
Improving candidate Biosynthetic Gene Clusters in fungi through reinforcement learning.通过强化学习改进真菌中的候选生物合成基因簇。
Bioinformatics. 2022 Aug 10;38(16):3984-3991. doi: 10.1093/bioinformatics/btac420.
3
Transcription Factor Repurposing Offers Insights into Evolution of Biosynthetic Gene Cluster Regulation.转录因子再利用为生物合成基因簇调控的进化提供了新视角。
mBio. 2021 Aug 31;12(4):e0139921. doi: 10.1128/mBio.01399-21. Epub 2021 Jul 20.
4
Predicting fungal secondary metabolite activity from biosynthetic gene cluster data using machine learning.基于生物合成基因簇数据利用机器学习预测真菌次生代谢物活性。
Microbiol Spectr. 2024 Feb 6;12(2):e0340023. doi: 10.1128/spectrum.03400-23. Epub 2024 Jan 9.
5
Predicting fungal secondary metabolite activity from biosynthetic gene cluster data using machine learning.利用机器学习从生物合成基因簇数据预测真菌次生代谢产物活性
bioRxiv. 2023 Sep 12:2023.09.12.557468. doi: 10.1101/2023.09.12.557468.
6
Fungal Isocyanide Synthases and Xanthocillin Biosynthesis in Aspergillus fumigatus.烟曲霉中的真菌异氰酸合酶与黄蓝素生物合成。
mBio. 2018 May 29;9(3):e00785-18. doi: 10.1128/mBio.00785-18.
7
Advances and Challenges in CRISPR/Cas-Based Fungal Genome Engineering for Secondary Metabolite Production: A Review.基于CRISPR/Cas的真菌基因组工程在次级代谢产物生产中的进展与挑战:综述
J Fungi (Basel). 2023 Mar 15;9(3):362. doi: 10.3390/jof9030362.
8
FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution.功能订单:一种通过计算分子共进化来鉴定必需生物合成基因的强大且半自动的方法。
PLoS Comput Biol. 2021 Sep 27;17(9):e1009372. doi: 10.1371/journal.pcbi.1009372. eCollection 2021 Sep.
9
Strategy for efficient cloning of biosynthetic gene clusters from fungi.真菌生物合成基因簇高效克隆策略。
Sci China Life Sci. 2019 Aug;62(8):1087-1095. doi: 10.1007/s11427-018-9511-7. Epub 2019 Jun 14.
10
The Architecture of Metabolism Maximizes Biosynthetic Diversity in the Largest Class of Fungi.代谢的结构最大限度地提高了最大类真菌的生物合成多样性。
Mol Biol Evol. 2020 Oct 1;37(10):2838-2856. doi: 10.1093/molbev/msaa122.

引用本文的文献

1
Global survey of secondary metabolism in via activation of specific transcription factors.通过特定转录因子的激活对[具体对象]中次生代谢进行的全球调查。 (注:原文中“in via”表述有误,推测可能是“in [具体对象] via”,这里按纠正后的意思翻译,若原文有准确信息可进一步完善)
PNAS Nexus. 2025 Aug 8;4(8):pgaf249. doi: 10.1093/pnasnexus/pgaf249. eCollection 2025 Aug.
2
Advances in the discovery and study of natural products for biological control applications.用于生物防治应用的天然产物的发现与研究进展。
Nat Prod Rep. 2025 Jun 6. doi: 10.1039/d5np00017c.
3
Metabolic engineering approaches for the biosynthesis of antibiotics.

本文引用的文献

1
A comparative genomics study of 23 Aspergillus species from section Flavi.23 种黄曲霉属真菌的比较基因组学研究。
Nat Commun. 2020 Feb 27;11(1):1106. doi: 10.1038/s41467-019-14051-y.
2
MIBiG 2.0: a repository for biosynthetic gene clusters of known function.MIBiG 2.0:已知功能的生物合成基因簇的存储库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D454-D458. doi: 10.1093/nar/gkz882.
3
A deep learning genome-mining strategy for biosynthetic gene cluster prediction.深度学习基因组挖掘策略用于生物合成基因簇预测。
用于抗生素生物合成的代谢工程方法。
Microb Cell Fact. 2025 Jan 31;24(1):35. doi: 10.1186/s12934-024-02628-2.
4
Microbial secondary metabolites: advancements to accelerate discovery towards application.微生物次级代谢产物:加速从发现到应用进程的进展
Nat Rev Microbiol. 2025 Jun;23(6):338-354. doi: 10.1038/s41579-024-01141-y. Epub 2025 Jan 17.
5
Discovery of fungal onoceroid triterpenoids through domainless enzyme-targeted global genome mining.通过无结构域酶靶向的全基因组挖掘发现真菌角鲨烯三萜。
Nat Commun. 2024 May 21;15(1):4312. doi: 10.1038/s41467-024-48771-7.
6
Fungal BGCs for Production of Secondary Metabolites: Main Types, Central Roles in Strain Improvement, and Regulation According to the Piano Principle.真菌生物合成基因簇用于次生代谢产物的生产:主要类型、在菌株改良中的核心作用,以及根据钢琴原理进行的调控。
Int J Mol Sci. 2023 Jul 6;24(13):11184. doi: 10.3390/ijms241311184.
7
Segregation of the genus (Hypoxylaceae, Xylariales) from by a polyphasic taxonomic approach.通过多相分类学方法将 属(炭角菌科,炭角菌目)从 中分离出来。
MycoKeys. 2023 Feb 20;95:131-162. doi: 10.3897/mycokeys.95.98125. eCollection 2023.
8
Advanced Methods for Natural Products Discovery: Bioactivity Screening, Dereplication, Metabolomics Profiling, Genomic Sequencing, Databases and Informatic Tools, and Structure Elucidation.天然产物发现的先进方法:生物活性筛选、去重复、代谢组学分析、基因组测序、数据库和信息工具以及结构解析。
Mar Drugs. 2023 May 19;21(5):308. doi: 10.3390/md21050308.
9
FunARTS, the Fungal bioActive compound Resistant Target Seeker, an exploration engine for target-directed genome mining in fungi.真菌生物活性化合物抗性靶标搜索器 FunARTS,一种针对真菌中靶标定向基因组挖掘的探索引擎。
Nucleic Acids Res. 2023 Jul 5;51(W1):W191-W197. doi: 10.1093/nar/gkad386.
10
Unlocking the magic in mycelium: Using synthetic biology to optimize filamentous fungi for biomanufacturing and sustainability.揭开菌丝体的神奇之处:利用合成生物学优化丝状真菌用于生物制造和可持续发展。
Mater Today Bio. 2023 Jan 21;19:100560. doi: 10.1016/j.mtbio.2023.100560. eCollection 2023 Apr.
Nucleic Acids Res. 2019 Oct 10;47(18):e110. doi: 10.1093/nar/gkz654.
4
Fungal secondary metabolism: regulation, function and drug discovery.真菌次生代谢:调控、功能与药物发现。
Nat Rev Microbiol. 2019 Mar;17(3):167-180. doi: 10.1038/s41579-018-0121-1.
5
UniProt: a worldwide hub of protein knowledge.UniProt:蛋白质知识的全球枢纽。
Nucleic Acids Res. 2019 Jan 8;47(D1):D506-D515. doi: 10.1093/nar/gky1049.
6
OrthoDB v10: sampling the diversity of animal, plant, fungal, protist, bacterial and viral genomes for evolutionary and functional annotations of orthologs.OrthoDB v10:从动物、植物、真菌、原生生物、细菌和病毒基因组中采样,以进行同源基因的进化和功能注释。
Nucleic Acids Res. 2019 Jan 8;47(D1):D807-D811. doi: 10.1093/nar/gky1053.
7
The Pfam protein families database in 2019.2019 年 Pfam 蛋白质家族数据库。
Nucleic Acids Res. 2019 Jan 8;47(D1):D427-D432. doi: 10.1093/nar/gky995.
8
FunGeneClusterS: Predicting fungal gene clusters from genome and transcriptome data.FunGeneClusterS:从基因组和转录组数据预测真菌基因簇
Synth Syst Biotechnol. 2016 Feb 23;1(2):122-129. doi: 10.1016/j.synbio.2016.01.002. eCollection 2016 Jun.
9
RiPPMiner: a bioinformatics resource for deciphering chemical structures of RiPPs based on prediction of cleavage and cross-links.RiPPMiner:一种基于裂解和交联预测的破译 RiPPs 化学结构的生物信息学资源。
Nucleic Acids Res. 2017 Jul 3;45(W1):W80-W88. doi: 10.1093/nar/gkx408.
10
antiSMASH 4.0-improvements in chemistry prediction and gene cluster boundary identification.antiSMASH 4.0——化学预测和基因簇边界识别的改进。
Nucleic Acids Res. 2017 Jul 3;45(W1):W36-W41. doi: 10.1093/nar/gkx319.