• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种从极短读段获得更好基因组组装的新策略。

A new strategy for better genome assembly from very short reads.

机构信息

Bioinformatics Center, Key Laboratory of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, PR China.

出版信息

BMC Bioinformatics. 2011 Dec 30;12:493. doi: 10.1186/1471-2105-12-493.

DOI:10.1186/1471-2105-12-493
PMID:22208765
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3268122/
Abstract

BACKGROUND

With the rapid development of the next generation sequencing (NGS) technology, large quantities of genome sequencing data have been generated. Because of repetitive regions of genomes and some other factors, assembly of very short reads is still a challenging issue.

RESULTS

A novel strategy for improving genome assembly from very short reads is proposed. It can increase accuracies of assemblies by integrating de novo contigs, and produce comparative contigs by allowing multiple references without limiting to genomes of closely related strains. Comparative contigs are used to scaffold de novo contigs. Using simulated and real datasets, it is shown that our strategy can effectively improve qualities of assemblies of isolated microbial genomes and metagenomes.

CONCLUSIONS

With more and more reference genomes available, our strategy will be useful to improve qualities of genome assemblies from very short reads. Some scripts are provided to make our strategy applicable at http://code.google.com/p/cd-hybrid/.

摘要

背景

随着下一代测序(NGS)技术的飞速发展,产生了大量的基因组测序数据。由于基因组的重复区域和其他一些因素,非常短的读取序列的组装仍然是一个具有挑战性的问题。

结果

提出了一种改进非常短读序列组装的新策略。通过整合从头组装的序列,它可以提高组装的准确性,并通过允许多个参考序列而不是限制在密切相关的菌株的基因组上,产生比较组装的序列。比较组装的序列用于支架从头组装的序列。使用模拟和真实数据集,结果表明,我们的策略可以有效地提高微生物基因组和宏基因组的组装质量。

结论

随着越来越多的参考基因组的出现,我们的策略将有助于提高非常短读序列组装的基因组质量。一些脚本可以在 http://code.google.com/p/cd-hybrid/ 上使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a59e/3268122/e396c2e10012/1471-2105-12-493-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a59e/3268122/afe375d3acc8/1471-2105-12-493-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a59e/3268122/5439f8fe84a0/1471-2105-12-493-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a59e/3268122/f4e42017e429/1471-2105-12-493-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a59e/3268122/80a4eb2262af/1471-2105-12-493-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a59e/3268122/da530cd4d771/1471-2105-12-493-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a59e/3268122/672853df80f4/1471-2105-12-493-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a59e/3268122/e396c2e10012/1471-2105-12-493-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a59e/3268122/afe375d3acc8/1471-2105-12-493-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a59e/3268122/5439f8fe84a0/1471-2105-12-493-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a59e/3268122/f4e42017e429/1471-2105-12-493-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a59e/3268122/80a4eb2262af/1471-2105-12-493-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a59e/3268122/da530cd4d771/1471-2105-12-493-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a59e/3268122/672853df80f4/1471-2105-12-493-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a59e/3268122/e396c2e10012/1471-2105-12-493-7.jpg

相似文献

1
A new strategy for better genome assembly from very short reads.一种从极短读段获得更好基因组组装的新策略。
BMC Bioinformatics. 2011 Dec 30;12:493. doi: 10.1186/1471-2105-12-493.
2
LR_Gapcloser: a tiling path-based gap closer that uses long reads to complete genome assembly.LR_Gapcloser:一种基于平铺路径的缺口闭合器,它使用长读长来完成基因组组装。
Gigascience. 2019 Jan 1;8(1):giy157. doi: 10.1093/gigascience/giy157.
3
Evaluation and Validation of Assembling Corrected PacBio Long Reads for Microbial Genome Completion via Hybrid Approaches.通过混合方法组装校正后的PacBio长读段以完成微生物基因组的评估与验证
PLoS One. 2015 Dec 7;10(12):e0144305. doi: 10.1371/journal.pone.0144305. eCollection 2015.
4
Evaluation of short read metagenomic assembly.短读宏基因组组装评估。
BMC Genomics. 2011;12 Suppl 2(Suppl 2):S8. doi: 10.1186/1471-2164-12-S2-S8. Epub 2011 Jul 27.
5
SOPRA: Scaffolding algorithm for paired reads via statistical optimization.SOPRA:基于统计优化的配对读取支架算法。
BMC Bioinformatics. 2010 Jun 24;11:345. doi: 10.1186/1471-2105-11-345.
6
AlignGraph: algorithm for secondary de novo genome assembly guided by closely related references.AlignGraph:一种基于密切相关参考序列指导的二级从头基因组组装算法。
Bioinformatics. 2014 Jun 15;30(12):i319-i328. doi: 10.1093/bioinformatics/btu291.
7
Hybrid assembly of ultra-long Nanopore reads augmented with 10x-Genomics contigs: Demonstrated with a human genome.超长纳米孔读长的混合组装,扩增 10x 基因组图谱:用人基因组展示。
Genomics. 2019 Dec;111(6):1896-1901. doi: 10.1016/j.ygeno.2018.12.013. Epub 2018 Dec 27.
8
Multi-CAR: a tool of contig scaffolding using multiple references.多连续片段比对组装工具(Multi-CAR):一种使用多个参考序列进行重叠群搭建的工具。
BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):469. doi: 10.1186/s12859-016-1328-7.
9
Fragmentation and Coverage Variation in Viral Metagenome Assemblies, and Their Effect in Diversity Calculations.病毒宏基因组组装中的碎片化和覆盖度变化,及其对多样性计算的影响。
Front Bioeng Biotechnol. 2015 Sep 17;3:141. doi: 10.3389/fbioe.2015.00141. eCollection 2015.
10
U: A New Metric for Measuring Assembly Output Based on Non-Overlapping, Target-Specific Contigs.U:一种基于非重叠、特定目标重叠群测量装配输出的新指标。
J Comput Biol. 2017 Nov;24(11):1071-1080. doi: 10.1089/cmb.2017.0013. Epub 2017 Apr 18.

引用本文的文献

1
Genome-Wide Development and Characterization of Microsatellite Markers in the Great Web-Spinning Sawfly .织网叶蜂微卫星标记的全基因组开发与特征分析
Ecol Evol. 2024 Nov 4;14(11):e70500. doi: 10.1002/ece3.70500. eCollection 2024 Nov.
2
Chromatin structure analysis enables detection of DNA insertions into the mammalian nuclear genome.染色质结构分析能够检测DNA插入到哺乳动物核基因组的情况。
Biochem Biophys Rep. 2015 Jun 10;2:143-152. doi: 10.1016/j.bbrep.2015.06.002. eCollection 2015 Jul.
3
A scalable and accurate targeted gene assembly tool (SAT-Assembler) for next-generation sequencing data.

本文引用的文献

1
Evaluating the fidelity of de novo short read metagenomic assembly using simulated data.使用模拟数据评估从头开始的短读宏基因组组装的保真度。
PLoS One. 2011;6(5):e19984. doi: 10.1371/journal.pone.0019984. Epub 2011 May 23.
2
Evaluation of next-generation sequencing software in mapping and assembly.下一代测序软件在图谱绘制和组装方面的评估。
J Hum Genet. 2011 Jun;56(6):406-14. doi: 10.1038/jhg.2011.43. Epub 2011 Apr 28.
3
Next generation sequence assembly with AMOS.使用AMOS进行下一代序列组装。
一种用于下一代测序数据的可扩展且准确的靶向基因组装工具(SAT组装器)。
PLoS Comput Biol. 2014 Aug 14;10(8):e1003737. doi: 10.1371/journal.pcbi.1003737. eCollection 2014 Aug.
4
Metagenomic identification of novel enteric viruses in urban wild rats and genome characterization of a group A rotavirus.城市野生大鼠中新型肠道病毒的宏基因组鉴定及一组A组轮状病毒的基因组特征分析
J Gen Virol. 2014 Dec;95(Pt 12):2734-2747. doi: 10.1099/vir.0.070029-0. Epub 2014 Aug 13.
5
A base composition analysis of natural patterns for the preprocessing of metagenome sequences.对宏基因组序列进行预处理的自然模式的碱基组成分析。
BMC Bioinformatics. 2013;14 Suppl 11(Suppl 11):S5. doi: 10.1186/1471-2105-14-S11-S5. Epub 2013 Nov 4.
6
Microevolution of Renibacterium salmoninarum: evidence for intercontinental dissemination associated with fish movements.鲑鱼传染性造血器官坏死病病毒的微观进化:与鱼类活动相关的洲际传播证据。
ISME J. 2014 Apr;8(4):746-56. doi: 10.1038/ismej.2013.186. Epub 2013 Oct 31.
Curr Protoc Bioinformatics. 2011 Mar;Chapter 11:Unit 11.8. doi: 10.1002/0471250953.bi1108s33.
4
Limitations of next-generation genome sequence assembly.下一代基因组序列组装的局限性。
Nat Methods. 2011 Jan;8(1):61-5. doi: 10.1038/nmeth.1527. Epub 2010 Nov 21.
5
De novo assembly of short sequence reads.从头组装短序列读段。
Brief Bioinform. 2010 Sep;11(5):457-72. doi: 10.1093/bib/bbq020. Epub 2010 Aug 19.
6
Whole genome assembly of a natto production strain Bacillus subtilis natto from very short read data.纳豆生产菌株枯草芽孢杆菌纳豆的全基因组组装来自非常短的读取数据。
BMC Genomics. 2010 Apr 16;11:243. doi: 10.1186/1471-2164-11-243.
7
Improving draft assemblies by iterative mapping and assembly of short reads to eliminate gaps.通过将短读段迭代映射和组装来消除缺口,从而改进草案组装。
Genome Biol. 2010;11(4):R41. doi: 10.1186/gb-2010-11-4-r41. Epub 2010 Apr 13.
8
De novo assembly of a 40 Mb eukaryotic genome from short sequence reads: Sordaria macrospora, a model organism for fungal morphogenesis.从头组装 40Mb 真核生物基因组的短序列读取:Sordaria macrospora,一种真菌形态发生的模式生物。
PLoS Genet. 2010 Apr 8;6(4):e1000891. doi: 10.1371/journal.pgen.1000891.
9
A primer on metagenomics.元基因组学简介。
PLoS Comput Biol. 2010 Feb 26;6(2):e1000667. doi: 10.1371/journal.pcbi.1000667.
10
Assembling genomes using short-read sequencing technology.使用短读测序技术组装基因组。
Genome Biol. 2010 Jan 28;11(1):202. doi: 10.1186/gb-2010-11-1-202.