• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于对二倍体和多倍体物种的测序基因分型数据进行无参考基因组多样性分析的强大且高效的软件。

Robust and efficient software for reference-free genomic diversity analysis of genotyping-by-sequencing data on diploid and polyploid species.

作者信息

Parra-Salazar Andrea, Gomez Jorge, Lozano-Arce Daniela, Reyes-Herrera Paula H, Duitama Jorge

机构信息

Department of Systems and Computing Engineering, Universidad de los Andes, Bogotá, Colombia.

Corporación Colombiana de Investigación Agropecuaria (AGROSAVIA), Bogotá, Colombia.

出版信息

Mol Ecol Resour. 2022 Jan;22(1):439-454. doi: 10.1111/1755-0998.13477. Epub 2021 Jul 29.

DOI:10.1111/1755-0998.13477
PMID:34288487
Abstract

Genotyping-by-sequencing (GBS) is a widely used and cost-effective technique for obtaining large numbers of genetic markers from populations by sequencing regions adjacent to restriction cut sites. Although a standard reference-based pipeline can be followed to analyse GBS reads, a reference genome is still not available for a large number of species. Hence, reference-free approaches are required to generate the genetic variability information that can be obtained from a GBS experiment. Unfortunately, available tools to perform de novo analysis of GBS reads face issues of usability, accuracy and performance. Furthermore, few available tools are suitable for analysing data sets from polyploid species. In this manuscript, we describe a novel algorithm to perform reference-free variant detection and genotyping from GBS reads. Nonexact searches on a dynamic hash table of consensus sequences allow for efficient read clustering and sorting. This algorithm was integrated in the Next Generation Sequencing Experience Platform (NGSEP) to integrate the state-of-the-art variant detector already implemented in this tool. We performed benchmark experiments with three different empirical data sets of plants and animals with different population structures and ploidies, and sequenced with different GBS protocols at different read depths. These experiments show that NGSEP has comparable and in some cases better accuracy and always better computational efficiency compared to existing solutions. We expect that this new development will be useful for many research groups conducting population genetic studies in a wide variety of species.

摘要

简化基因组测序(GBS)是一种广泛使用且经济高效的技术,可通过对限制性酶切位点附近区域进行测序,从群体中获取大量遗传标记。尽管可以遵循基于标准参考基因组的流程来分析GBS读数,但仍有大量物种没有可用的参考基因组。因此,需要采用无参考基因组的方法来生成可从GBS实验中获得的遗传变异信息。不幸的是,现有的用于对GBS读数进行从头分析的工具存在可用性、准确性和性能方面的问题。此外,很少有可用工具适用于分析多倍体物种的数据集。在本论文中,我们描述了一种用于从GBS读数中进行无参考基因组变异检测和基因分型的新算法。在共有序列的动态哈希表上进行非精确搜索可实现高效的读数聚类和排序。该算法已集成到下一代测序体验平台(NGSEP)中,以整合该工具中已实现的最先进的变异检测器。我们使用了具有不同群体结构和倍性的植物和动物的三个不同实证数据集进行基准实验,并在不同的读数深度下采用不同的GBS方案进行测序。这些实验表明,与现有解决方案相比,NGSEP具有相当的准确性,在某些情况下准确性更高,并且计算效率始终更高。我们期望这一新进展将对许多在各种物种中进行群体遗传学研究的研究小组有用。

相似文献

1
Robust and efficient software for reference-free genomic diversity analysis of genotyping-by-sequencing data on diploid and polyploid species.用于对二倍体和多倍体物种的测序基因分型数据进行无参考基因组多样性分析的强大且高效的软件。
Mol Ecol Resour. 2022 Jan;22(1):439-454. doi: 10.1111/1755-0998.13477. Epub 2021 Jul 29.
2
Bioinformatic analysis of genotype by sequencing (GBS) data with NGSEP.使用NGSEP对测序基因分型(GBS)数据进行生物信息学分析。
BMC Genomics. 2016 Aug 31;17 Suppl 5(Suppl 5):498. doi: 10.1186/s12864-016-2827-7.
3
UGbS-Flex, a novel bioinformatics pipeline for imputation-free SNP discovery in polyploids without a reference genome: finger millet as a case study.UGbS-Flex,一种新型的生物信息学管道,用于在没有参考基因组的情况下对多倍体进行无插补 SNP 发现:以手指小米为例。
BMC Plant Biol. 2018 Jun 15;18(1):117. doi: 10.1186/s12870-018-1316-3.
4
A comparison of genotyping-by-sequencing analysis methods on low-coverage crop datasets shows advantages of a new workflow, GB-eaSy.对低覆盖作物数据集的测序分析方法的比较表明,一种新的工作流程 GB-eaSy 具有优势。
BMC Bioinformatics. 2017 Dec 28;18(1):586. doi: 10.1186/s12859-017-2000-6.
5
NGSEP3: accurate variant calling across species and sequencing protocols.NGSEP3:跨物种和测序协议的准确变异调用。
Bioinformatics. 2019 Nov 1;35(22):4716-4723. doi: 10.1093/bioinformatics/btz275.
6
GBS-SNP-CROP: a reference-optional pipeline for SNP discovery and plant germplasm characterization using variable length, paired-end genotyping-by-sequencing data.GBS-SNP-CROP:一种用于单核苷酸多态性(SNP)发现和植物种质特征分析的无参考序列流程,使用可变长度的双端测序基因分型数据。
BMC Bioinformatics. 2016 Jan 12;17:29. doi: 10.1186/s12859-016-0879-y.
7
Sequence coverage required for accurate genotyping by sequencing in polyploid species.多倍体物种中通过测序进行准确基因分型所需的序列覆盖度。
Mol Ecol Resour. 2022 May;22(4):1417-1426. doi: 10.1111/1755-0998.13558. Epub 2021 Dec 20.
8
Haplotag: Software for Haplotype-Based Genotyping-by-Sequencing Analysis.Haplotag:用于基于单倍型的测序基因分型分析的软件。
G3 (Bethesda). 2016 Apr 7;6(4):857-63. doi: 10.1534/g3.115.024596.
9
Mining sequence variations in representative polyploid sugarcane germplasm accessions.挖掘代表性多倍体甘蔗种质资源中的序列变异。
BMC Genomics. 2017 Aug 9;18(1):594. doi: 10.1186/s12864-017-3980-3.
10
One Size Doesn't Fit All - RefEditor: Building Personalized Diploid Reference Genome to Improve Read Mapping and Genotype Calling in Next Generation Sequencing Studies.一刀切并不适用——RefEditor:构建个性化二倍体参考基因组以改善下一代测序研究中的读段映射和基因型调用
PLoS Comput Biol. 2015 Aug 12;11(8):e1004448. doi: 10.1371/journal.pcbi.1004448. eCollection 2015 Aug.

引用本文的文献

1
Data reuse in agricultural genomics research: challenges and recommendations.农业基因组学研究中的数据重用:挑战与建议。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giae106.
2
Genetic diversity of Anadara tuberculosa in two localities of the Colombian Pacific Coast.哥伦比亚太平洋沿岸两个地区的泥蚶遗传多样性。
Sci Rep. 2024 Nov 18;14(1):28467. doi: 10.1038/s41598-024-78869-3.
3
Phylogenetic relationships and genetic diversity of the Korean endemic Phedimus latiovalifolius (Crassulaceae) and its close relatives.朝鲜特有种宽叶瓦松(景天科)及其近缘种的系统发育关系和遗传多样性。
Sci Rep. 2024 Jul 15;14(1):16255. doi: 10.1038/s41598-024-63272-9.
4
Coral microbiomes are structured by environmental gradients in deep waters.珊瑚微生物群落在深水中由环境梯度构成。
Environ Microbiome. 2024 Jun 10;19(1):38. doi: 10.1186/s40793-024-00579-0.
5
Whole-genome resequencing-based characterization of a durum wheat landrace showing similarity to 'Senatore Cappelli'.基于全基因组重测序的杜伦小麦地方品种的特征分析,该品种与“Senatore Cappelli”相似。
PLoS One. 2023 Sep 21;18(9):e0291430. doi: 10.1371/journal.pone.0291430. eCollection 2023.
6
MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms.MultiGWAS:一种用于四倍体生物全基因组关联研究的整合工具。
Ecol Evol. 2021 May 12;11(12):7411-7426. doi: 10.1002/ece3.7572. eCollection 2021 Jun.