• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Assemblathon2:在三个脊椎动物物种中评估从头组装基因组方法。

Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species.

机构信息

Genome Center, UC, Davis, CA 95616, USA.

出版信息

Gigascience. 2013 Jul 22;2(1):10. doi: 10.1186/2047-217X-2-10.

DOI:10.1186/2047-217X-2-10
PMID:23870653
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3844414/
Abstract

BACKGROUND

The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly.

RESULTS

In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies.

CONCLUSIONS

Many current genome assemblers produced useful assemblies, containing a significant representation of their genes and overall genome structure. However, the high degree of variability between the entries suggests that there is still much room for improvement in the field of genome assembly and that approaches which work well in assembling the genome of one species may not necessarily work well for another.

摘要

背景

生成原始基因组序列数据的过程继续变得更便宜、更快、更准确。然而,将这些数据组装成高质量的完成基因组序列仍然具有挑战性。有许多基因组组装工具可用,但它们在性能(速度、可扩展性、硬件要求、对较新技术的接受程度)和最终输出(组装序列的组成)方面有很大的不同。更重要的是,如何最好地评估组装基因组序列的质量在很大程度上仍不清楚。Assemblathon 竞赛旨在评估基因组组装的当前最先进方法。

结果

在 Assemblathon 2 中,我们提供了多种序列数据供三种脊椎动物物种(鸟类、鱼类和蛇类)进行组装。这总共产生了来自 21 个参赛团队的 43 个提交的组装结果。我们使用光学图谱数据、Fosmid 序列和几种统计方法来评估这些组装结果。在 100 多种不同的指标中,我们选择了十个关键指标来评估组装的整体质量。

结论

许多当前的基因组组装器生成了有用的组装结果,其中包含了其基因和整体基因组结构的重要代表。然而,这些条目之间高度的可变性表明,在基因组组装领域仍有很大的改进空间,并且在组装一种物种的基因组方面表现良好的方法不一定在另一种物种上也能很好地发挥作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/287b91b1d41f/2047-217X-2-10-21.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/bef3791d5503/2047-217X-2-10-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/6a7c8f76a07b/2047-217X-2-10-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/b8c8ca0ad65c/2047-217X-2-10-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/3654ee7ac901/2047-217X-2-10-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/a271f7a273ad/2047-217X-2-10-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/cab1077a98c4/2047-217X-2-10-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/c753204bc207/2047-217X-2-10-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/5c9174965e28/2047-217X-2-10-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/5e5846749f80/2047-217X-2-10-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/ad58dda544df/2047-217X-2-10-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/e1e9025e2627/2047-217X-2-10-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/2de4b1e6917e/2047-217X-2-10-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/9a17c661033c/2047-217X-2-10-13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/c122593d64a4/2047-217X-2-10-14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/896c254de24d/2047-217X-2-10-15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/f5ad6146ee1c/2047-217X-2-10-16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/ae9112692be0/2047-217X-2-10-17.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/ce322a712b1e/2047-217X-2-10-18.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/66861e0b86a7/2047-217X-2-10-19.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/e04d9515beb6/2047-217X-2-10-20.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/287b91b1d41f/2047-217X-2-10-21.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/bef3791d5503/2047-217X-2-10-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/6a7c8f76a07b/2047-217X-2-10-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/b8c8ca0ad65c/2047-217X-2-10-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/3654ee7ac901/2047-217X-2-10-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/a271f7a273ad/2047-217X-2-10-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/cab1077a98c4/2047-217X-2-10-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/c753204bc207/2047-217X-2-10-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/5c9174965e28/2047-217X-2-10-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/5e5846749f80/2047-217X-2-10-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/ad58dda544df/2047-217X-2-10-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/e1e9025e2627/2047-217X-2-10-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/2de4b1e6917e/2047-217X-2-10-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/9a17c661033c/2047-217X-2-10-13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/c122593d64a4/2047-217X-2-10-14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/896c254de24d/2047-217X-2-10-15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/f5ad6146ee1c/2047-217X-2-10-16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/ae9112692be0/2047-217X-2-10-17.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/ce322a712b1e/2047-217X-2-10-18.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/66861e0b86a7/2047-217X-2-10-19.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/e04d9515beb6/2047-217X-2-10-20.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/3844414/287b91b1d41f/2047-217X-2-10-21.jpg

相似文献

1
Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species.Assemblathon2:在三个脊椎动物物种中评估从头组装基因组方法。
Gigascience. 2013 Jul 22;2(1):10. doi: 10.1186/2047-217X-2-10.
2
De novo likelihood-based measures for comparing genome assemblies.用于比较基因组组装的基于从头似然性的度量
BMC Res Notes. 2013 Aug 22;6:334. doi: 10.1186/1756-0500-6-334.
3
Assemblathon 1: a competitive assessment of de novo short read assembly methods.Assemblathon 1:从头开始的短读序列组装方法的竞争性评估。
Genome Res. 2011 Dec;21(12):2224-41. doi: 10.1101/gr.126599.111. Epub 2011 Sep 16.
4
Fragmentation and Coverage Variation in Viral Metagenome Assemblies, and Their Effect in Diversity Calculations.病毒宏基因组组装中的碎片化和覆盖度变化,及其对多样性计算的影响。
Front Bioeng Biotechnol. 2015 Sep 17;3:141. doi: 10.3389/fbioe.2015.00141. eCollection 2015.
5
Evaluation of genome scaffolding tools using pooled clone sequencing.使用混合克隆测序评估基因组支架工具。
Turk J Biol. 2018 Dec 10;42(6):471-476. doi: 10.3906/biy-1805-42. eCollection 2018.
6
High-coverage sequencing and annotated assemblies of the budgerigar genome.高覆盖度测序和虎皮鹦鹉基因组注释组装。
Gigascience. 2014 Jul 8;3:11. doi: 10.1186/2047-217X-3-11. eCollection 2014.
7
The complex task of choosing a de novo assembly: lessons from fungal genomes.选择从头组装的复杂任务:来自真菌基因组的经验教训。
Comput Biol Chem. 2014 Dec;53 Pt A:97-107. doi: 10.1016/j.compbiolchem.2014.08.014. Epub 2014 Aug 29.
8
Identifying wrong assemblies in de novo short read primary sequence assembly contigs.在从头短读长初级序列组装重叠群中识别错误的组装。
J Biosci. 2016 Sep;41(3):455-74. doi: 10.1007/s12038-016-9630-0.
9
Benchmarking of de novo assembly algorithms for Nanopore data reveals optimal performance of OLC approaches.用于纳米孔数据的从头组装算法基准测试揭示了重叠布局一致(OLC)方法的最佳性能。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):507. doi: 10.1186/s12864-016-2895-8.
10
Positional bias in variant calls against draft reference assemblies.针对草图参考基因组组装的变异位点调用中的位置偏差。
BMC Genomics. 2017 Mar 28;18(1):263. doi: 10.1186/s12864-017-3637-2.

引用本文的文献

1
Chromosome-level assembly of the 400-year-old Goethe's Palm (Chamaerops humilis L.).400岁歌德棕(矮棕)的染色体水平组装
Sci Data. 2025 Sep 10;12(1):1542. doi: 10.1038/s41597-025-05673-7.
2
Using reinforcement learning in genome assembly: in-depth analysis of a Q-learning assembler.基因组组装中强化学习的应用:Q学习组装器的深入分析
Front Bioinform. 2025 Aug 20;5:1633623. doi: 10.3389/fbinf.2025.1633623. eCollection 2025.
3
New Genome Assemblies for Poeciliidae: A Foundation for Adaptation Studies.花鳉科的新基因组组装:适应性研究的基础

本文引用的文献

1
High-coverage sequencing and annotated assemblies of the budgerigar genome.高覆盖度测序和虎皮鹦鹉基因组注释组装。
Gigascience. 2014 Jul 8;3:11. doi: 10.1186/2047-217X-3-11. eCollection 2014.
2
Why assembling plant genome sequences is so challenging.为什么组装植物基因组序列如此具有挑战性。
Biology (Basel). 2012 Sep 18;1(2):439-59. doi: 10.3390/biology1020439.
3
REAPR: a universal tool for genome assembly evaluation.REAPR:一种用于基因组组装评估的通用工具。
Genome Biol Evol. 2025 May 30;17(6). doi: 10.1093/gbe/evaf111.
4
Characterisation of the Gillenia S-locus provides insight into evolution of the nonself-recognition self-incompatibility system in apple.吉列尼花S位点的特征分析为苹果非自我识别自交不亲和系统的进化提供了见解。
Sci Rep. 2025 Apr 26;15(1):14630. doi: 10.1038/s41598-025-99335-8.
5
Parameterizing Pantherinae: De Novo Mutation Rate Estimates from Panthera and Neofelis Pedigrees.豹亚科的参数化:基于豹属和云豹属谱系的新生突变率估计
Genome Biol Evol. 2025 Apr 3;17(4). doi: 10.1093/gbe/evaf060.
6
Mechanisms underlying the loss of migratory behaviour in a long-lived bird.一种长寿鸟类迁徙行为丧失的潜在机制。
J Anim Ecol. 2025 May;94(5):1061-1075. doi: 10.1111/1365-2656.70035. Epub 2025 Apr 2.
7
A detailed guide to assessing genome assembly based on long-read sequencing data using Inspector.使用Inspector基于长读长测序数据评估基因组组装的详细指南。
Nat Protoc. 2025 Mar 26. doi: 10.1038/s41596-025-01149-5.
8
Haplotype-resolved genome assembly of Ficus carica L. reveals allele-specific expression in the fruit.无花果单倍型解析基因组组装揭示果实中的等位基因特异性表达。
Plant J. 2025 Feb;121(4):e70012. doi: 10.1111/tpj.70012.
9
Selection on the vascular-remodeling gene is associated with altitudinal adaptation in an insular lizard.对血管重塑基因的选择与一种岛屿蜥蜴的海拔适应性相关。
Evol Lett. 2024 Sep 22;9(1):41-50. doi: 10.1093/evlett/qrae047. eCollection 2025 Feb.
10
Maptcha: an efficient parallel workflow for hybrid genome scaffolding.Maptcha:一种用于混合基因组支架构建的高效并行工作流程。
BMC Bioinformatics. 2024 Aug 8;25(1):263. doi: 10.1186/s12859-024-05878-4.
Genome Biol. 2013 May 27;14(5):R47. doi: 10.1186/gb-2013-14-5-r47.
4
SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.SOAPdenovo2:一种经验丰富的、内存效率高的短读长从头组装器。
Gigascience. 2012 Dec 27;1(1):18. doi: 10.1186/2047-217X-1-18.
5
Reevaluating assembly evaluations with feature response curves: GAGE and assemblathons.重新评估装配评估的特征响应曲线:GAGE 和装配竞赛。
PLoS One. 2012;7(12):e52210. doi: 10.1371/journal.pone.0052210. Epub 2012 Dec 28.
6
Scaffolding low quality genomes using orthologous protein sequences.利用直系同源蛋白序列构建低质量基因组支架。
Bioinformatics. 2013 Jan 15;29(2):160-5. doi: 10.1093/bioinformatics/bts661. Epub 2012 Nov 18.
7
Normalized N50 assembly metric using gap-restricted co-linear chaining.使用带间隙限制的共线性链接进行标准化 N50 组装度量。
BMC Bioinformatics. 2012 Oct 3;13:255. doi: 10.1186/1471-2105-13-255.
8
Telescoper: de novo assembly of highly repetitive regions.望远镜:高度重复区域的从头组装。
Bioinformatics. 2012 Sep 15;28(18):i311-i317. doi: 10.1093/bioinformatics/bts399.
9
Hybrid error correction and de novo assembly of single-molecule sequencing reads.单分子测序reads 的混合纠错与从头组装。
Nat Biotechnol. 2012 Jul 1;30(7):693-700. doi: 10.1038/nbt.2280.
10
Next-generation sequencing and large genome assemblies.下一代测序和大型基因组组装。
Pharmacogenomics. 2012 Jun;13(8):901-15. doi: 10.2217/pgs.12.72.