Suppr超能文献

泛基因组分析(Pandagma):一种用于识别泛基因集和基因家族的工具,可以在期望的进化深度下使用,并适应全基因组复制。

Pandagma: a tool for identifying pan-gene sets and gene families at desired evolutionary depths and accommodating whole-genome duplications.

机构信息

USDA-Agricultural Research Service, Corn Insects and Crop Genetics Research Unit, 819 Wallace Rd., Ames, IA 50011, United States.

ORISE Fellow, USDA-Agricultural Research Service, Corn Insects and Crop Genetics Research Unit, 819 Wallace Rd., Ames, IA 50011, United States.

出版信息

Bioinformatics. 2024 Sep 2;40(9). doi: 10.1093/bioinformatics/btae526.

Abstract

SUMMARY

Identification of allelic or corresponding genes (pan-genes) within a species or genus is important for discovery of biologically significant genetic conservation and variation. Similarly, identification of orthologs (gene families) across wider evolutionary distances is important for understanding the genetic basis for similar or differing traits. Especially in plants, several complications make identification of pan-genes and gene families challenging, including whole-genome duplications, evolutionary rate differences among lineages, and varying qualities of assemblies and annotations. Here, we document and distribute a set of workflows that we have used to address these problems.

RESULTS

Pandagma is a set of configurable workflows for identifying and comparing pan-gene sets and gene families for annotation sets from eukaryotic genomes, using a combination of homology, synteny, and expected rates of synonymous change in coding sequence.

AVAILABILITY AND IMPLEMENTATION

The Pandagma workflows, example configurations, implementation details, and scripts for retrieving public datasets, are available at https://github.com/legumeinfo/pandagma.

摘要

摘要

在一个物种或属内鉴定等位基因或相应基因(泛基因)对于发现具有生物学意义的遗传保守性和变异性很重要。同样,在更广泛的进化距离上鉴定同源基因(基因家族)对于理解相似或不同特征的遗传基础也很重要。特别是在植物中,由于全基因组加倍、谱系间进化率差异以及组装和注释质量的不同,使得泛基因和基因家族的鉴定变得具有挑战性。在这里,我们记录并分发了一组我们用于解决这些问题的工作流程。

结果

Pandagma 是一组可配置的工作流程,用于使用同源性、共线性和编码序列中同义突变的预期速率的组合,识别和比较真核基因组注释集的泛基因集和基因家族。

可用性和实现

Pandagma 工作流程、示例配置、实现细节以及用于检索公共数据集的脚本可在 https://github.com/legumeinfo/pandagma 上获得。

相似文献

3
COGNATE: comparative gene annotation characterizer.同源物:比较基因注释特征分析器。
BMC Genomics. 2017 Jul 17;18(1):535. doi: 10.1186/s12864-017-3870-8.
9
zDB: bacterial comparative genomics made easy.zDB:轻松进行细菌比较基因组学研究。
mSystems. 2024 Jul 23;9(7):e0047324. doi: 10.1128/msystems.00473-24. Epub 2024 Jun 28.

本文引用的文献

5
Plant genome sequences: past, present, future.植物基因组序列:过去、现在和未来。
Curr Opin Plant Biol. 2019 Apr;48:1-8. doi: 10.1016/j.pbi.2018.11.001. Epub 2018 Dec 19.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验