• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GAEP:一个全面的基因组组装评估管道。

GAEP: a comprehensive genome assembly evaluating pipeline.

机构信息

Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China.

State Key Laboratory of Rice Biology and Breeding, China National Rice Research Institute, Chinese Academy of Agricultural Sciences, Hangzhou, Zhejiang 311401, China.

出版信息

J Genet Genomics. 2023 Oct;50(10):747-754. doi: 10.1016/j.jgg.2023.05.009. Epub 2023 May 26.

DOI:10.1016/j.jgg.2023.05.009
PMID:37245652
Abstract

With the rapid development of sequencing technologies, especially the maturity of third-generation sequencing technologies, there has been a significant increase in the number and quality of published genome assemblies. The emergence of these high-quality genomes has raised higher requirements for genome evaluation. Although numerous computational methods have been developed to evaluate assembly quality from various perspectives, the selective use of these evaluation methods can be arbitrary and inconvenient for fairly comparing the assembly quality. To address this issue, we have developed the Genome Assembly Evaluating Pipeline (GAEP), which provides a comprehensive assessment pipeline for evaluating genome quality from multiple perspectives, including continuity, completeness, and correctness. Additionally, GAEP includes new functions for detecting misassemblies and evaluating the assembly redundancy, which performs well in our testing. GAEP is publicly available at https://github.com/zy-optimistic/GAEP under the GPL3.0 License. With GAEP, users can quickly obtain accurate and reliable evaluation results, facilitating the comparison and selection of high-quality genome assemblies.

摘要

随着测序技术的快速发展,特别是第三代测序技术的成熟,已发布的基因组组装数量和质量都有了显著提高。这些高质量基因组的出现对基因组评估提出了更高的要求。虽然已经开发了许多从不同角度评估组装质量的计算方法,但这些评估方法的选择性使用可能是任意的,并且不方便对组装质量进行公平比较。为了解决这个问题,我们开发了基因组组装评估管道(GAEP),它提供了一个从多个角度评估基因组质量的综合评估管道,包括连续性、完整性和正确性。此外,GAEP 还包括用于检测组装错误和评估组装冗余的新功能,在我们的测试中表现良好。GAEP 可在 https://github.com/zy-optimistic/GAEP 上以 GPL3.0 许可证获得。使用 GAEP,用户可以快速获得准确可靠的评估结果,方便对高质量基因组组装进行比较和选择。

相似文献

1
GAEP: a comprehensive genome assembly evaluating pipeline.GAEP:一个全面的基因组组装评估管道。
J Genet Genomics. 2023 Oct;50(10):747-754. doi: 10.1016/j.jgg.2023.05.009. Epub 2023 May 26.
2
AssemblyQC: a Nextflow pipeline for reproducible reporting of assembly quality.组装质量控制 (AssemblyQC):用于可重复报告组装质量的 Nextflow 管道。
Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae477.
3
scanPAV: a pipeline for extracting presence-absence variations in genome pairs.scanPAV:用于提取基因组对中存在-缺失变异的管道。
Bioinformatics. 2018 Sep 1;34(17):3022-3024. doi: 10.1093/bioinformatics/bty189.
4
GenomeQC: a quality assessment tool for genome assemblies and gene structure annotations.基因组 QC:基因组组装和基因结构注释的质量评估工具。
BMC Genomics. 2020 Mar 2;21(1):193. doi: 10.1186/s12864-020-6568-2.
5
Subset selection of high-depth next generation sequencing reads for de novo genome assembly using MapReduce framework.使用MapReduce框架进行从头基因组组装时对高深度下一代测序读数的子集选择。
BMC Genomics. 2015;16 Suppl 12(Suppl 12):S9. doi: 10.1186/1471-2164-16-S12-S9. Epub 2015 Dec 9.
6
HaploMerger2: rebuilding both haploid sub-assemblies from high-heterozygosity diploid genome assembly.HaploMerger2:从高杂合度二倍体基因组组装中重建两个单倍体亚组装体。
Bioinformatics. 2017 Aug 15;33(16):2577-2579. doi: 10.1093/bioinformatics/btx220.
7
GCI: a continuity inspector for complete genome assembly.GCI:用于完整基因组组装的连续性检查器。
Bioinformatics. 2024 Nov 1;40(11). doi: 10.1093/bioinformatics/btae633.
8
SQUAT: a Sequencing Quality Assessment Tool for data quality assessments of genome assemblies.SQUAT:用于基因组组装数据质量评估的测序质量评估工具。
BMC Genomics. 2019 Apr 18;19(Suppl 9):238. doi: 10.1186/s12864-019-5445-3.
9
Genome sequence assembly algorithms and misassembly identification methods.基因组序列组装算法和错误组装识别方法。
Mol Biol Rep. 2022 Nov;49(11):11133-11148. doi: 10.1007/s11033-022-07919-8. Epub 2022 Sep 23.
10
ARAMIS: From systematic errors of NGS long reads to accurate assemblies.ARAMIS:从 NGS 长读的系统误差到精确组装。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab170.

引用本文的文献

1
Genome Evaluation Pipeline (GEP): a fully automated quality control tool for parallel evaluation of genome assemblies.基因组评估流程(GEP):一种用于并行评估基因组组装的全自动质量控制工具。
Bioinform Adv. 2025 Jun 26;5(1):vbaf147. doi: 10.1093/bioadv/vbaf147. eCollection 2025.
2
A draft genome assembly for the dart-poison frog .箭毒蛙的基因组组装草图。
GigaByte. 2025 Jun 20;2025:gigabyte157. doi: 10.46471/gigabyte.157. eCollection 2025.
3
CloseRead: a tool for assessing assembly errors in immunoglobulin loci applied to vertebrate long-read genome assemblies.
CloseRead:一种用于评估免疫球蛋白基因座装配错误的工具,应用于脊椎动物长读长基因组装配。
Genome Biol. 2025 May 20;26(1):131. doi: 10.1186/s13059-025-03594-7.
4
Chromosome-scale assemblies of three Ormosia species: repetitive sequences distribution and structural rearrangement.三种红豆属植物的染色体水平组装:重复序列分布与结构重排
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf047.
5
Klumpy: A tool to evaluate the integrity of long-read genome assemblies and illusive sequence motifs.Klumpy:一种评估长读长基因组组装完整性和难以捉摸的序列基序的工具。
Mol Ecol Resour. 2025 Jan;25(1):e13982. doi: 10.1111/1755-0998.13982. Epub 2024 May 27.
6
Genome assembly of autotetraploid Actinidia arguta highlights adaptive evolution and enables dissection of important economic traits.猕猴桃基因组组装揭示了其适应进化,并为重要经济性状的解析提供了可能。
Plant Commun. 2024 Jun 10;5(6):100856. doi: 10.1016/j.xplc.2024.100856. Epub 2024 Mar 2.
7
Genomic insights into biased allele loss and increased gene numbers after genome duplication in autotetraploid Cyclocarya paliurus.基因组分析揭示了同源四倍体青钱柳在基因组加倍后偏性等位基因丢失和基因数量增加的机制。
BMC Biol. 2023 Aug 8;21(1):168. doi: 10.1186/s12915-023-01668-1.