• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用迁移学习对植物特殊代谢基因进行种内和跨物种预测。

Within- and cross-species predictions of plant specialized metabolism genes using transfer learning.

作者信息

Moore Bethany M, Wang Peipei, Fan Pengxiang, Lee Aaron, Leong Bryan, Lou Yann-Ru, Schenck Craig A, Sugimoto Koichi, Last Robert, Lehti-Shiu Melissa D, Barry Cornelius S, Shiu Shin-Han

机构信息

Department of Plant Biology, Michigan State University, East Lansing, MI, USA.

Ecology, Evolutionary Biology, and Behavior Program, Michigan State University, East Lansing, MI, USA.

出版信息

In Silico Plants. 2020;2(1):diaa005. doi: 10.1093/insilicoplants/diaa005. Epub 2020 Jul 30.

DOI:10.1093/insilicoplants/diaa005
PMID:33344884
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7731531/
Abstract

Plant specialized metabolites mediate interactions between plants and the environment and have significant agronomical/pharmaceutical value. Most genes involved in specialized metabolism (SM) are unknown because of the large number of metabolites and the challenge in differentiating SM genes from general metabolism (GM) genes. Plant models like have extensive, experimentally derived annotations, whereas many non-model species do not. Here we employed a machine learning strategy, transfer learning, where knowledge from is transferred to predict gene functions in cultivated tomato with fewer experimentally annotated genes. The first tomato SM/GM prediction model using only tomato data performs well (-measure = 0.74, compared with 0.5 for random and 1.0 for perfect predictions), but from manually curating 88 SM/GM genes, we found many mis-predicted entries were likely mis-annotated. When the SM/GM prediction models built with data were used to filter out genes where the based model predictions disagreed with tomato annotations, the new tomato model trained with filtered data improved significantly (-measure = 0.92). Our study demonstrates that SM/GM genes can be better predicted by leveraging cross-species information. Additionally, our findings provide an example for transfer learning in genomics where knowledge can be transferred from an information-rich species to an information-poor one.

摘要

植物特化代谢产物介导植物与环境之间的相互作用,具有重要的农学/药学价值。由于代谢产物数量众多,且难以将特化代谢(SM)基因与一般代谢(GM)基因区分开来,参与特化代谢的大多数基因尚不清楚。像[具体植物模型名称未给出]这样的植物模型有广泛的、通过实验得出的注释,而许多非模式物种则没有。在这里,我们采用了一种机器学习策略——迁移学习,即将来自[具体植物模型名称未给出]的知识进行迁移,以预测栽培番茄中基因的功能,而栽培番茄中通过实验注释的基因较少。第一个仅使用番茄数据的番茄SM/GM预测模型表现良好(F值 = 0.74,随机预测为0.5,完美预测为1.0),但通过人工筛选88个SM/GM基因,我们发现许多错误预测的条目可能是注释错误。当使用基于[具体植物模型名称未给出]数据构建的SM/GM预测模型来筛选出基于该模型的预测与番茄注释不一致的基因时,用筛选后的数据训练的新番茄模型有了显著改进(F值 = 0.92)。我们的研究表明,利用跨物种信息可以更好地预测SM/GM基因。此外,我们的研究结果为基因组学中的迁移学习提供了一个范例,即知识可以从信息丰富的物种转移到信息匮乏的物种。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02eb/7731531/c2968d1b4d63/diaa005_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02eb/7731531/3c1a099145f6/diaa005_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02eb/7731531/82b8edbd3e9b/diaa005_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02eb/7731531/41d9a083ca04/diaa005_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02eb/7731531/2a0be865e4ec/diaa005_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02eb/7731531/2350909d7f48/diaa005_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02eb/7731531/8daef1adced9/diaa005_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02eb/7731531/c2968d1b4d63/diaa005_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02eb/7731531/3c1a099145f6/diaa005_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02eb/7731531/82b8edbd3e9b/diaa005_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02eb/7731531/41d9a083ca04/diaa005_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02eb/7731531/2a0be865e4ec/diaa005_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02eb/7731531/2350909d7f48/diaa005_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02eb/7731531/8daef1adced9/diaa005_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02eb/7731531/c2968d1b4d63/diaa005_fig7.jpg

相似文献

1
Within- and cross-species predictions of plant specialized metabolism genes using transfer learning.利用迁移学习对植物特殊代谢基因进行种内和跨物种预测。
In Silico Plants. 2020;2(1):diaa005. doi: 10.1093/insilicoplants/diaa005. Epub 2020 Jul 30.
2
Robust predictions of specialized metabolism genes through machine learning.通过机器学习对特殊代谢基因进行可靠预测。
Proc Natl Acad Sci U S A. 2019 Feb 5;116(6):2344-2353. doi: 10.1073/pnas.1817074116. Epub 2019 Jan 23.
3
Machine learning assists prediction of genes responsible for plant specialized metabolite biosynthesis by integrating multi-omics data.机器学习通过整合多组学数据来辅助预测负责植物特化代谢物生物合成的基因。
BMC Genomics. 2024 Apr 29;25(1):418. doi: 10.1186/s12864-024-10258-6.
4
Genetic variation, environment and demography intersect to shape Arabidopsis defense metabolite variation across Europe.遗传变异、环境和人口统计学因素相互作用,塑造了欧洲拟南芥防御代谢物的变异。
Elife. 2021 May 5;10:e67784. doi: 10.7554/eLife.67784.
5
Prediction of plant-level tomato biomass and yield using machine learning with unmanned aerial vehicle imagery.利用机器学习和无人机图像预测单株番茄生物量和产量
Plant Methods. 2021 Jul 15;17(1):77. doi: 10.1186/s13007-021-00761-2.
6
7
Safety and nutritional assessment of GM plants and derived food and feed: the role of animal feeding trials.转基因植物及其衍生食品和饲料的安全性与营养评估:动物饲养试验的作用
Food Chem Toxicol. 2008 Mar;46 Suppl 1:S2-70. doi: 10.1016/j.fct.2008.02.008. Epub 2008 Feb 13.
8
Functional knowledge transfer for high-accuracy prediction of under-studied biological processes.功能知识转移可实现对研究不足的生物过程的高精度预测。
PLoS Comput Biol. 2013;9(3):e1002957. doi: 10.1371/journal.pcbi.1002957. Epub 2013 Mar 14.
9
Synergistic use of plant-prokaryote comparative genomics for functional annotations.利用植物-原核生物比较基因组学进行功能注释的协同作用。
BMC Genomics. 2011 Jun 15;12 Suppl 1(Suppl 1):S2. doi: 10.1186/1471-2164-12-S1-S2.
10
A Global Coexpression Network Approach for Connecting Genes to Specialized Metabolic Pathways in Plants.一种用于将植物基因与特定代谢途径相连接的全球共表达网络方法。
Plant Cell. 2017 May;29(5):944-959. doi: 10.1105/tpc.17.00009. Epub 2017 Apr 13.

引用本文的文献

1
Gaining insights into epigenetic memories through artificial intelligence and omics science in plants.通过人工智能和植物组学科学深入了解表观遗传记忆。
J Integr Plant Biol. 2025 Sep;67(9):2320-2349. doi: 10.1111/jipb.13953. Epub 2025 Jun 24.
2
Improving plant breeding through AI-supported data integration.通过人工智能支持的数据整合改进植物育种。
Theor Appl Genet. 2025 Jun 2;138(6):132. doi: 10.1007/s00122-025-04910-2.
3
Application of machine learning and genomics for orphan crop improvement.机器学习与基因组学在小众作物改良中的应用。

本文引用的文献

1
Tip of the trichome: evolution of acylsugar metabolic diversity in Solanaceae.毛状体尖端:茄科酰基糖代谢多样性的进化。
Curr Opin Plant Biol. 2019 Jun;49:8-16. doi: 10.1016/j.pbi.2019.03.005. Epub 2019 Apr 19.
2
Robust predictions of specialized metabolism genes through machine learning.通过机器学习对特殊代谢基因进行可靠预测。
Proc Natl Acad Sci U S A. 2019 Feb 5;116(6):2344-2353. doi: 10.1073/pnas.1817074116. Epub 2019 Jan 23.
3
Factors Influencing Gene Family Size Variation Among Related Species in a Plant Family, Solanaceae.
Nat Commun. 2025 Jan 24;16(1):982. doi: 10.1038/s41467-025-56330-x.
4
Integrating machine learning and genome editing for crop improvement.整合机器学习与基因组编辑技术以改良作物。
aBIOTECH. 2024 Feb 29;5(2):262-277. doi: 10.1007/s42994-023-00133-5. eCollection 2024 Jun.
5
A review of artificial intelligence-assisted omics techniques in plant defense: current trends and future directions.植物防御中人工智能辅助组学技术综述:当前趋势与未来方向
Front Plant Sci. 2024 Mar 5;15:1292054. doi: 10.3389/fpls.2024.1292054. eCollection 2024.
6
Arabidopsis as a model for translational research.拟南芥作为转化研究的模型。
Plant Cell. 2025 May 9;37(5). doi: 10.1093/plcell/koae065.
7
Fruity, sticky, stinky, spicy, bitter, addictive, and deadly: evolutionary signatures of metabolic complexity in the Solanaceae.香甜、黏糊、恶臭、辛辣、苦涩、上瘾、致命:茄科植物代谢复杂性的进化特征。
Nat Prod Rep. 2022 Jul 20;39(7):1438-1464. doi: 10.1039/d2np00003b.
8
Using interdisciplinary, phylogeny-guided approaches to understand the evolution of plant metabolism.利用跨学科、系统发育指导的方法来理解植物代谢的演化。
Plant Mol Biol. 2022 Jul;109(4-5):355-367. doi: 10.1007/s11103-021-01220-1. Epub 2021 Nov 23.
9
Machine learning in plant science and plant breeding.植物科学与植物育种中的机器学习
iScience. 2020 Dec 5;24(1):101890. doi: 10.1016/j.isci.2020.101890. eCollection 2021 Jan 22.
10
Evolution of a plant gene cluster in Solanaceae and emergence of metabolic diversity.茄科植物基因簇的进化和代谢多样性的出现。
Elife. 2020 Jul 2;9:e56717. doi: 10.7554/eLife.56717.
影响茄科植物相关物种基因家族大小变异的因素。
Genome Biol Evol. 2018 Oct 1;10(10):2596-2613. doi: 10.1093/gbe/evy193.
4
Parallelization of MAFFT for large-scale multiple sequence alignments.并行化 MAFFT 进行大规模多序列比对。
Bioinformatics. 2018 Jul 15;34(14):2490-2492. doi: 10.1093/bioinformatics/bty121.
5
A Global Coexpression Network Approach for Connecting Genes to Specialized Metabolic Pathways in Plants.一种用于将植物基因与特定代谢途径相连接的全球共表达网络方法。
Plant Cell. 2017 May;29(5):944-959. doi: 10.1105/tpc.17.00009. Epub 2017 Apr 13.
6
Mini tomato genotypes resistant to the silverleaf whitefly and to two-spotted spider mites.对烟粉虱和二斑叶螨具有抗性的微型番茄基因型。
Genet Mol Res. 2017 Mar 22;16(1):gmr-16-01-gmr.16019539. doi: 10.4238/gmr16019539.
7
Genome-Wide Prediction of Metabolic Enzymes, Pathways, and Gene Clusters in Plants.植物中代谢酶、代谢途径和基因簇的全基因组预测
Plant Physiol. 2017 Apr;173(4):2041-2059. doi: 10.1104/pp.16.01942. Epub 2017 Feb 22.
8
Repellent effects of various cherry tomato accessions on the two-spotted spider mite Tetranychus urticae Koch (Acari: Tetranychidae).不同樱桃番茄品种对二斑叶螨(Tetranychus urticae Koch,蜱螨目:叶螨科)的驱避作用
Genet Mol Res. 2016 Mar 24;15(1):gmr7736. doi: 10.4238/gmr.15017736.
9
Single-Cell Metabolite Profiling of Stalk and Glandular Cells of Intact Trichomes with Internal Electrode Capillary Pressure Probe Electrospray Ionization Mass Spectrometry.采用内电极毛细管压力探针电喷雾电离质谱法对完整毛状体的茎细胞和腺细胞进行单细胞代谢物分析。
Anal Chem. 2016 Mar 15;88(6):3049-57. doi: 10.1021/acs.analchem.5b03366. Epub 2016 Feb 29.
10
OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy.OrthoFinder:解决全基因组比较中的基本偏差可显著提高直系同源组推断准确性。
Genome Biol. 2015 Aug 6;16(1):157. doi: 10.1186/s13059-015-0721-2.