• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

下一代基因组序列组装的局限性。

Limitations of next-generation genome sequence assembly.

机构信息

Department of Genome Sciences, University of Washington School of Medicine and Howard Hughes Medical Institute, Seattle, Washington, USA.

出版信息

Nat Methods. 2011 Jan;8(1):61-5. doi: 10.1038/nmeth.1527. Epub 2010 Nov 21.

DOI:10.1038/nmeth.1527
PMID:21102452
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3115693/
Abstract

High-throughput sequencing technologies promise to transform the fields of genetics and comparative biology by delivering tens of thousands of genomes in the near future. Although it is feasible to construct de novo genome assemblies in a few months, there has been relatively little attention to what is lost by sole application of short sequence reads. We compared the recent de novo assemblies using the short oligonucleotide analysis package (SOAP), generated from the genomes of a Han Chinese individual and a Yoruban individual, to experimentally validated genomic features. We found that de novo assemblies were 16.2% shorter than the reference genome and that 420.2 megabase pairs of common repeats and 99.1% of validated duplicated sequences were missing from the genome. Consequently, over 2,377 coding exons were completely missing. We conclude that high-quality sequencing approaches must be considered in conjunction with high-throughput sequencing for comparative genomics analyses and studies of genome evolution.

摘要

高通量测序技术有望在不久的将来为数万基因组的测序提供便利,从而彻底改变遗传学和比较生物学领域。虽然在短短几个月内构建从头基因组组装是可行的,但人们对单纯应用短序列读段所导致的信息缺失关注甚少。我们将基于短寡核苷酸分析包(SOAP)生成的中国汉族个体和非洲约鲁巴个体的全基因组从头组装与实验验证的基因组特征进行了比较。我们发现,从头组装的基因组比参考基因组短 16.2%,并且有 420.2 Mb 的常见重复序列和 99.1%的已验证重复序列缺失。结果,超过 2377 个编码外显子完全缺失。我们得出的结论是,在进行比较基因组学分析和研究基因组进化时,必须结合高质量测序方法考虑高通量测序。

相似文献

1
Limitations of next-generation genome sequence assembly.下一代基因组序列组装的局限性。
Nat Methods. 2011 Jan;8(1):61-5. doi: 10.1038/nmeth.1527. Epub 2010 Nov 21.
2
The complete genome of an individual by massively parallel DNA sequencing.通过大规模平行DNA测序获得个体的完整基因组。
Nature. 2008 Apr 17;452(7189):872-6. doi: 10.1038/nature06884.
3
Personal genomes: Standard and pores.个人基因组:标准与孔隙。
Nature. 2008 Nov 6;456(7218):23-5. doi: 10.1038/456023a.
4
Anchored pseudo-de novo assembly of human genomes identifies extensive sequence variation from unmapped sequence reads.人类基因组的锚定伪从头组装可从未映射的序列读取中识别出广泛的序列变异。
Hum Genet. 2016 Jul;135(7):727-40. doi: 10.1007/s00439-016-1667-5. Epub 2016 Apr 9.
5
Highly accurate long reads are crucial for realizing the potential of biodiversity genomics.高质量的长读长序列对于实现生物多样性基因组学的潜力至关重要。
BMC Genomics. 2023 Mar 16;24(1):117. doi: 10.1186/s12864-023-09193-9.
6
Misassembly of long reads undermines de novo-assembled ethnicity-specific genomes: validation in a Chinese Han population.长读段的组装错误会破坏从头组装的特定族群基因组:在中国汉族人群中的验证。
Hum Genet. 2019 Jul;138(7):757-769. doi: 10.1007/s00439-019-02032-6. Epub 2019 Jun 5.
7
China buys US sequencing firm.中国收购美国测序公司。
Nature. 2012 Sep 27;489(7417):485-6. doi: 10.1038/489485a.
8
AlignGraph: algorithm for secondary de novo genome assembly guided by closely related references.AlignGraph:一种基于密切相关参考序列指导的二级从头基因组组装算法。
Bioinformatics. 2014 Jun 15;30(12):i319-i328. doi: 10.1093/bioinformatics/btu291.
9
Technology: The $1,000 genome.技术:千美元基因组。
Nature. 2014 Mar 20;507(7492):294-5. doi: 10.1038/507294a.
10
Genetic variation and the de novo assembly of human genomes.人类基因组的遗传变异与从头组装
Nat Rev Genet. 2015 Nov;16(11):627-40. doi: 10.1038/nrg3933. Epub 2015 Oct 7.

引用本文的文献

1
A Multi-Omics Perspective on : From Genomics to Future Directions.关于:从基因组学到未来方向的多组学视角。
Int J Mol Sci. 2025 Aug 28;26(17):8343. doi: 10.3390/ijms26178343.
2
Chromosome level de Novo hybrid assembly of Asian honeybee, Apis cerana Koreana.亚洲蜜蜂(Apis cerana Koreana)的染色体水平从头杂交组装
Sci Rep. 2025 Jul 24;15(1):26912. doi: 10.1038/s41598-025-12338-3.
3
CGC1, a new reference genome for .CGC1,一个用于……的新参考基因组。 (原文不完整,翻译只能到这里)

本文引用的文献

1
Assembly of large genomes using second-generation sequencing.使用第二代测序技术进行大基因组组装。
Genome Res. 2010 Sep;20(9):1165-73. doi: 10.1101/gr.101360.109. Epub 2010 May 27.
2
Characterization of missing human genome sequences and copy-number polymorphic insertions.人类基因组序列缺失特征及拷贝数多态性插入分析。
Nat Methods. 2010 May;7(5):365-71. doi: 10.1038/nmeth.1451.
3
Genome assembly quality: assessment and improvement using the neutral indel model.基因组组装质量:使用中性插入缺失模型进行评估和改进。
Genome Res. 2025 Aug 1;35(8):1902-1918. doi: 10.1101/gr.280274.124.
4
Comparative analysis of the mobilome yields new insights into its diversity, dynamics and evolution in parasites of the Trypanosomatidae family.动质体组的比较分析为锥虫科寄生虫的多样性、动态变化及进化提供了新见解。
Parasitology. 2025 May;152(6):602-617. doi: 10.1017/S0031182025100231.
5
New gene annotations for three pea aphid genome assemblies allow comparative analyses of genes and gene family evolution.三个豌豆蚜基因组组装的新基因注释允许对基因和基因家族进化进行比较分析。
bioRxiv. 2025 May 13:2025.05.08.652899. doi: 10.1101/2025.05.08.652899.
6
The complete chloroplast genomes of four Aspidopterys species and a comparison with other Malpighiaceae species.四种星毛藤属植物的完整叶绿体基因组及其与其他金虎尾科植物的比较。
Sci Rep. 2025 May 23;15(1):17893. doi: 10.1038/s41598-025-01724-6.
7
Comprehensive genome analysis of two (Cytosporaceae, Diaporthales) species associated with canker disease of spruce: and sp. nov.与云杉溃疡病相关的两种(座囊孢科,间座壳目)真菌的全基因组分析:[具体物种1]和新物种[具体物种2]
MycoKeys. 2025 May 5;117:89-119. doi: 10.3897/mycokeys.117.145445. eCollection 2025.
8
Efficient Detection and Characterization of Targets of Natural Selection Using Transfer Learning.利用迁移学习对自然选择目标进行高效检测与特征描述
Mol Biol Evol. 2025 Apr 30;42(5). doi: 10.1093/molbev/msaf094.
9
Genetic and Phenotypic Investigations of Viral Subpopulations Detected in Different Tissues of Laying Hens Following Infectious Bronchitis Virus Infection.传染性支气管炎病毒感染后蛋鸡不同组织中检测到的病毒亚群的遗传和表型研究
Viruses. 2025 Apr 4;17(4):527. doi: 10.3390/v17040527.
10
Kastor: a reference-based comparative approach for assessment and correction of gene-fragmenting errors in long-read assemblies of small genomes.卡斯特:一种基于参考的比较方法,用于评估和纠正小基因组长读长组装中的基因片段化错误。
BMC Genomics. 2025 Apr 18;26(1):388. doi: 10.1186/s12864-025-11569-y.
Genome Res. 2010 May;20(5):675-84. doi: 10.1101/gr.096966.109. Epub 2010 Mar 19.
4
Complete Khoisan and Bantu genomes from southern Africa.完成来自南非的科伊桑和班图人的全基因组。
Nature. 2010 Feb 18;463(7283):943-7. doi: 10.1038/nature08795.
5
Genetics: Decoding a national treasure.遗传学:解码一件国宝。
Nature. 2010 Jan 21;463(7279):303-4. doi: 10.1038/463303a.
6
De novo assembly of human genomes with massively parallel short read sequencing.利用大规模平行短读测序进行人类基因组从头组装。
Genome Res. 2010 Feb;20(2):265-72. doi: 10.1101/gr.097261.109. Epub 2009 Dec 17.
7
The sequence and de novo assembly of the giant panda genome.大熊猫基因组的序列与从头组装。
Nature. 2010 Jan 21;463(7279):311-7. doi: 10.1038/nature08696. Epub 2009 Dec 13.
8
Building the sequence map of the human pan-genome.构建人类泛基因组的序列图谱。
Nat Biotechnol. 2010 Jan;28(1):57-63. doi: 10.1038/nbt.1596. Epub 2009 Dec 7.
9
Genome 10K: a proposal to obtain whole-genome sequence for 10,000 vertebrate species.基因组 10K:获取 10000 种脊椎动物全基因组序列的提案。
J Hered. 2009 Nov-Dec;100(6):659-74. doi: 10.1093/jhered/esp086. Epub 2009 Nov 5.
10
The genome of the cucumber, Cucumis sativus L.黄瓜基因组, Cucumis sativus L.
Nat Genet. 2009 Dec;41(12):1275-81. doi: 10.1038/ng.475. Epub 2009 Nov 1.