• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过强化学习改进真菌中的候选生物合成基因簇。

Improving candidate Biosynthetic Gene Clusters in fungi through reinforcement learning.

机构信息

Departement d'Informatique, UQAM, Montréal, QC H2X 3Y7, Canada.

Centre for Structural and Functional Genomics, Concordia University, Montréal, QC H4B 1R6, Canada.

出版信息

Bioinformatics. 2022 Aug 10;38(16):3984-3991. doi: 10.1093/bioinformatics/btac420.

DOI:10.1093/bioinformatics/btac420
PMID:35762945
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9364373/
Abstract

MOTIVATION

Precise identification of Biosynthetic Gene Clusters (BGCs) is a challenging task. Performance of BGC discovery tools is limited by their capacity to accurately predict components belonging to candidate BGCs, often overestimating cluster boundaries. To support optimizing the composition and boundaries of candidate BGCs, we propose reinforcement learning approach relying on protein domains and functional annotations from expert curated BGCs.

RESULTS

The proposed reinforcement learning method aims to improve candidate BGCs obtained with state-of-the-art tools. It was evaluated on candidate BGCs obtained for two fungal genomes, Aspergillus niger and Aspergillus nidulans. The results highlight an improvement of the gene precision by above 15% for TOUCAN, fungiSMASH and DeepBGC; and cluster precision by above 25% for fungiSMASH and DeepBCG, allowing these tools to obtain almost perfect precision in cluster prediction. This can pave the way of optimizing current prediction of candidate BGCs in fungi, while minimizing the curation effort required by domain experts.

AVAILABILITY AND IMPLEMENTATION

https://github.com/bioinfoUQAM/RL-bgc-components.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

精确识别生物合成基因簇 (BGC) 是一项具有挑战性的任务。BGC 发现工具的性能受到其准确预测属于候选 BGC 的成分的能力的限制,通常会高估簇边界。为了支持优化候选 BGC 的组成和边界,我们提出了一种依赖于专家 curated BGC 中的蛋白质结构域和功能注释的强化学习方法。

结果

所提出的强化学习方法旨在改进使用最先进工具获得的候选 BGC。它在两种真菌基因组(黑曲霉和构巢曲霉)的候选 BGC 上进行了评估。结果突出了 TOUCAN、fungiSMASH 和 DeepBGC 的基因精度提高了 15%以上;fungiSMASH 和 DeepBCG 的簇精度提高了 25%以上,使这些工具能够几乎完美地预测簇。这为优化真菌中当前的候选 BGC 预测铺平了道路,同时最大限度地减少了领域专家所需的注释工作。

可用性和实现

https://github.com/bioinfoUQAM/RL-bgc-components。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b7/9364373/b1de3fb28ab5/btac420f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b7/9364373/2a5c12f16770/btac420f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b7/9364373/2e44e3021ac7/btac420f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b7/9364373/b1de3fb28ab5/btac420f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b7/9364373/2a5c12f16770/btac420f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b7/9364373/2e44e3021ac7/btac420f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b7/9364373/b1de3fb28ab5/btac420f3.jpg

相似文献

1
Improving candidate Biosynthetic Gene Clusters in fungi through reinforcement learning.通过强化学习改进真菌中的候选生物合成基因簇。
Bioinformatics. 2022 Aug 10;38(16):3984-3991. doi: 10.1093/bioinformatics/btac420.
2
Deep Learning to Predict the Biosynthetic Gene Clusters in Bacterial Genomes.深度学习预测细菌基因组中的生物合成基因簇。
J Mol Biol. 2022 Aug 15;434(15):167597. doi: 10.1016/j.jmb.2022.167597. Epub 2022 May 6.
3
Comprehensive curation and analysis of fungal biosynthetic gene clusters of published natural products.已发表天然产物的真菌生物合成基因簇的综合整理与分析
Fungal Genet Biol. 2016 Apr;89:18-28. doi: 10.1016/j.fgb.2016.01.012. Epub 2016 Jan 22.
4
Predicting fungal secondary metabolite activity from biosynthetic gene cluster data using machine learning.基于生物合成基因簇数据利用机器学习预测真菌次生代谢物活性。
Microbiol Spectr. 2024 Feb 6;12(2):e0340023. doi: 10.1128/spectrum.03400-23. Epub 2024 Jan 9.
5
Transcription Factor Repurposing Offers Insights into Evolution of Biosynthetic Gene Cluster Regulation.转录因子再利用为生物合成基因簇调控的进化提供了新视角。
mBio. 2021 Aug 31;12(4):e0139921. doi: 10.1128/mBio.01399-21. Epub 2021 Jul 20.
6
A deep learning genome-mining strategy for biosynthetic gene cluster prediction.深度学习基因组挖掘策略用于生物合成基因簇预测。
Nucleic Acids Res. 2019 Oct 10;47(18):e110. doi: 10.1093/nar/gkz654.
7
TOUCAN: a framework for fungal biosynthetic gene cluster discovery.巨嘴鸟:一种真菌生物合成基因簇发现框架
NAR Genom Bioinform. 2020 Nov 27;2(4):lqaa098. doi: 10.1093/nargab/lqaa098. eCollection 2020 Dec.
8
FunARTS, the Fungal bioActive compound Resistant Target Seeker, an exploration engine for target-directed genome mining in fungi.真菌生物活性化合物抗性靶标搜索器 FunARTS,一种针对真菌中靶标定向基因组挖掘的探索引擎。
Nucleic Acids Res. 2023 Jul 5;51(W1):W191-W197. doi: 10.1093/nar/gkad386.
9
Are there conserved biosynthetic genes in lichens? Genome-wide assessment of terpene biosynthetic genes suggests ubiquitous distribution of the squalene synthase cluster.地衣中是否存在保守的生物合成基因?萜类生物合成基因的全基因组评估表明角鲨烯合酶簇的普遍分布。
BMC Genomics. 2024 Oct 7;25(1):936. doi: 10.1186/s12864-024-10806-0.
10
BiG-SLiCE: A highly scalable tool maps the diversity of 1.2 million biosynthetic gene clusters.BiG-SLiCE:一个高度可扩展的工具,可绘制 120 万个生物合成基因簇的多样性图谱。
Gigascience. 2021 Jan 13;10(1). doi: 10.1093/gigascience/giaa154.

引用本文的文献

1
Analyzing the Challenges and Opportunities Associated With Harnessing New Antibiotics From the Fungal Microbiome.分析从真菌微生物组中获取新型抗生素所面临的挑战与机遇。
Microbiologyopen. 2025 Aug;14(4):e70034. doi: 10.1002/mbo3.70034.
2
Advances in the discovery and study of natural products for biological control applications.用于生物防治应用的天然产物的发现与研究进展。
Nat Prod Rep. 2025 Jun 6. doi: 10.1039/d5np00017c.
3
Exploring the secrets of marine microorganisms: Unveiling secondary metabolites through metagenomics.探索海洋微生物的奥秘:通过宏基因组学揭示次生代谢产物。

本文引用的文献

1
Identification of a Novel Biosynthetic Gene Cluster in Using Comparative Genomics.利用比较基因组学鉴定[具体对象]中的一个新型生物合成基因簇。
J Fungi (Basel). 2021 May 11;7(5):374. doi: 10.3390/jof7050374.
2
antiSMASH 6.0: improving cluster detection and comparison capabilities.antiSMASH 6.0:提高簇检测和比较能力。
Nucleic Acids Res. 2021 Jul 2;49(W1):W29-W35. doi: 10.1093/nar/gkab335.
3
TOUCAN: a framework for fungal biosynthetic gene cluster discovery.巨嘴鸟:一种真菌生物合成基因簇发现框架
Microb Biotechnol. 2024 Aug;17(8):e14533. doi: 10.1111/1751-7915.14533.
4
Enhancing chemical and biological diversity by co-cultivation.通过共培养增强化学和生物多样性。
Front Microbiol. 2023 Feb 1;14:1117559. doi: 10.3389/fmicb.2023.1117559. eCollection 2023.
NAR Genom Bioinform. 2020 Nov 27;2(4):lqaa098. doi: 10.1093/nargab/lqaa098. eCollection 2020 Dec.
4
Diversity of Secondary Metabolism in Aspergillus nidulans Clinical Isolates.《土曲霉临床分离株的次生代谢多样性》。
mSphere. 2020 Apr 8;5(2):e00156-20. doi: 10.1128/mSphere.00156-20.
5
A comparative genomics study of 23 Aspergillus species from section Flavi.23 种黄曲霉属真菌的比较基因组学研究。
Nat Commun. 2020 Feb 27;11(1):1106. doi: 10.1038/s41467-019-14051-y.
6
MIBiG 2.0: a repository for biosynthetic gene clusters of known function.MIBiG 2.0:已知功能的生物合成基因簇的存储库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D454-D458. doi: 10.1093/nar/gkz882.
7
A deep learning genome-mining strategy for biosynthetic gene cluster prediction.深度学习基因组挖掘策略用于生物合成基因簇预测。
Nucleic Acids Res. 2019 Oct 10;47(18):e110. doi: 10.1093/nar/gkz654.
8
Strategies to establish the link between biosynthetic gene clusters and secondary metabolites.建立生物合成基因簇与次生代谢产物之间联系的策略。
Fungal Genet Biol. 2019 Sep;130:107-121. doi: 10.1016/j.fgb.2019.06.001. Epub 2019 Jun 10.
9
Unlocking the trove of metabolic treasures: activating silent biosynthetic gene clusters in bacteria and fungi.解锁代谢宝藏:激活细菌和真菌中沉默的生物合成基因簇。
Curr Opin Microbiol. 2019 Oct;51:9-15. doi: 10.1016/j.mib.2019.03.003. Epub 2019 Apr 15.
10
Fungal secondary metabolism: regulation, function and drug discovery.真菌次生代谢:调控、功能与药物发现。
Nat Rev Microbiol. 2019 Mar;17(3):167-180. doi: 10.1038/s41579-018-0121-1.