• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于识别泛基因组中核心基因和非必需基因的机器学习方法。

Machine learning approaches to identify core and dispensable genes in pangenomes.

作者信息

Yocca Alan E, Edger Patrick P

机构信息

Dep. of Plant Biology, Michigan State Univ., East Lansing, MI, 48824, USA.

Dep. of Horticulture, Michigan State Univ., East Lansing, MI, 48824, USA.

出版信息

Plant Genome. 2022 Mar;15(1):e20135. doi: 10.1002/tpg2.20135. Epub 2021 Sep 17.

DOI:10.1002/tpg2.20135
PMID:34533282
Abstract

A gene in a given taxonomic group is either present in every individual (core) or absent in at least a single individual (dispensable). Previous pangenomic studies have identified certain functional differences between core and dispensable genes. However, identifying if a gene belongs to the core or dispensable portion of the genome requires the construction of a pangenome, which involves sequencing the genomes of many individuals. Here we aim to leverage the previously characterized core and dispensable gene content for two grass species [Brachypodium distachyon (L.) P. Beauv. and Oryza sativa L.] to construct a machine learning model capable of accurately classifying genes as core or dispensable using only a single annotated reference genome. Such a model may mitigate the need for pangenome construction, an expensive hurdle especially in orphan crops, which often lack the adequate genomic resources.

摘要

在给定的分类群中,一个基因要么存在于每个个体中(核心基因),要么至少在一个个体中不存在(可有可无基因)。先前的泛基因组研究已经确定了核心基因和可有可无基因之间的某些功能差异。然而,要确定一个基因属于基因组的核心部分还是可有可无部分,需要构建一个泛基因组,这涉及对许多个体的基因组进行测序。在这里,我们旨在利用先前表征的两种禾本科植物[短柄草(Brachypodium distachyon (L.) P. Beauv.)和水稻(Oryza sativa L.)]的核心基因和可有可无基因内容,构建一个机器学习模型,该模型仅使用单个注释参考基因组就能准确地将基因分类为核心基因或可有可无基因。这样一个模型可能会减少构建泛基因组的需求,泛基因组构建是一个昂贵的障碍,尤其是在孤儿作物中,这些作物往往缺乏足够的基因组资源。

相似文献

1
Machine learning approaches to identify core and dispensable genes in pangenomes.用于识别泛基因组中核心基因和非必需基因的机器学习方法。
Plant Genome. 2022 Mar;15(1):e20135. doi: 10.1002/tpg2.20135. Epub 2021 Sep 17.
2
Pangenomics Comes of Age: From Bacteria to Plant and Animal Applications.泛基因组学时代的到来:从细菌到动植物的应用。
Trends Genet. 2020 Feb;36(2):132-145. doi: 10.1016/j.tig.2019.11.006. Epub 2019 Dec 24.
3
Crop pangenomes.作物泛基因组
Vavilovskii Zhurnal Genet Selektsii. 2021 Feb;25(1):57-63. doi: 10.18699/VJ21.007.
4
Brachypodium genomics.短柄草基因组学
Int J Plant Genomics. 2008;2008:536104. doi: 10.1155/2008/536104.
5
Pangenomics in crop improvement-from coding structural variations to finding regulatory variants with pangenome graphs.作物改良中的泛基因组学——从编码结构变异到利用泛基因组图谱寻找调控变异
Plant Genome. 2022 Mar;15(1):e20177. doi: 10.1002/tpg2.20177. Epub 2021 Dec 13.
6
A BAC-based physical map of Brachypodium distachyon and its comparative analysis with rice and wheat.拟南芥基物理图谱的构建及其与水稻和小麦的比较分析。
BMC Genomics. 2009 Oct 27;10:496. doi: 10.1186/1471-2164-10-496.
7
Prospects of pan-genomics in barley.大麦泛基因组学的前景。
Theor Appl Genet. 2019 Mar;132(3):785-796. doi: 10.1007/s00122-018-3234-z. Epub 2018 Nov 16.
8
Genome-wide scans of selection highlight the impact of biotic and abiotic constraints in natural populations of the model grass Brachypodium distachyon.全基因组选择扫描突出了生物和非生物限制因素对模式草短柄草自然种群的影响。
Plant J. 2018 Oct;96(2):438-451. doi: 10.1111/tpj.14042. Epub 2018 Sep 1.
9
Gene space dynamics during the evolution of Aegilops tauschii, Brachypodium distachyon, Oryza sativa, and Sorghum bicolor genomes.小麦族物种粗山羊草、二穗短柄草、水稻和高粱基因组进化过程中的基因空间动态。
Mol Biol Evol. 2011 Sep;28(9):2537-47. doi: 10.1093/molbev/msr080. Epub 2011 Apr 6.
10
Comparison of orthologous loci from small grass genomes Brachypodium and rice: implications for wheat genomics and grass genome annotation.小草基因组短柄草属与水稻直系同源基因座的比较:对小麦基因组学和禾本科基因组注释的启示
Plant J. 2007 Feb;49(4):704-17. doi: 10.1111/j.1365-313X.2006.02991.x. Epub 2007 Jan 18.

引用本文的文献

1
Patchy Phylogenetic Distribution and Poor Translational Adaptation of a Nested ORF in the Mammalian Mitochondrial Gene.哺乳动物线粒体基因中一个嵌套开放阅读框的斑驳系统发育分布及较差的翻译适应性
Genes (Basel). 2025 Jul 17;16(7):833. doi: 10.3390/genes16070833.
2
Genomic, Phylogenetic and Physiological Characterization of the PAH-Degrading Strain 135.多环芳烃降解菌株135的基因组、系统发育和生理学特征
Biology (Basel). 2024 May 13;13(5):339. doi: 10.3390/biology13050339.
3
Blueberry and cranberry pangenomes as a resource for future genetic studies and breeding efforts.
蓝莓和蔓越莓泛基因组作为未来基因研究和育种工作的资源。
Hortic Res. 2023 Oct 10;10(11):uhad202. doi: 10.1093/hr/uhad202. eCollection 2023 Nov.
4
Maize Feature Store: A centralized resource to manage and analyze curated maize multi-omics features for machine learning applications.玉米特征库:一个集中式资源,用于管理和分析经过策展的玉米多组学特征,以用于机器学习应用。
Database (Oxford). 2023 Nov 6;2023. doi: 10.1093/database/baad078.
5
The Progression in Developing Genomic Resources for Crop Improvement.用于作物改良的基因组资源开发进展
Life (Basel). 2023 Jul 31;13(8):1668. doi: 10.3390/life13081668.
6
Application of crop wild relatives in modern breeding: An overview of resources, experimental and computational methodologies.作物野生近缘种在现代育种中的应用:资源、实验及计算方法概述
Front Plant Sci. 2022 Nov 17;13:1008904. doi: 10.3389/fpls.2022.1008904. eCollection 2022.
7
and gene family in barley: Identification, characterization and expression analyses.大麦中的基因家族:鉴定、特征分析及表达分析。
Front Plant Sci. 2022 Aug 19;13:964888. doi: 10.3389/fpls.2022.964888. eCollection 2022.