• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

扩展短DNA序列的组装以处理错误。

Extending assembly of short DNA sequences to handle error.

作者信息

Jeck William R, Reinhardt Josephine A, Baltrus David A, Hickenbotham Matthew T, Magrini Vincent, Mardis Elaine R, Dangl Jeffery L, Jones Corbin D

机构信息

Department of Biology, University of Carolina-Chapel Hill, Chapel Hill, NC 27599, USA.

出版信息

Bioinformatics. 2007 Nov 1;23(21):2942-4. doi: 10.1093/bioinformatics/btm451. Epub 2007 Sep 24.

DOI:10.1093/bioinformatics/btm451
PMID:17893086
Abstract

UNLABELLED

Inexpensive de novo genome sequencing, particularly in organisms with small genomes, is now possible using several new sequencing technologies. Some of these technologies such as that from Illumina's Solexa Sequencing, produce high genomic coverage by generating a very large number of small reads ( approximately 30 bp). While prior work shows that partial assembly can be performed by k-mer extension in error-free reads, this algorithm is unsuccessful with the sequencing error rates found in practice. We present VCAKE (Verified Consensus Assembly by K-mer Extension), a modification of simple k-mer extension that overcomes error by using high depth coverage. Though it is a simple modification of a previous approach, we show significant improvements in assembly results on simulated and experimental datasets that include error.

AVAILABILITY

http://152.2.15.114/~labweb/VCAKE

摘要

未标注

利用几种新的测序技术,现在可以进行低成本的从头基因组测序,尤其是对于基因组较小的生物。其中一些技术,如Illumina公司的Solexa测序技术,通过生成大量短读段(约30个碱基对)实现高基因组覆盖度。虽然先前的工作表明在无错误读段中可通过k-mer扩展进行部分组装,但该算法在实际测序错误率情况下并不成功。我们提出了VCAKE(通过k-mer扩展进行验证的一致性组装),它是对简单k-mer扩展的一种改进,通过使用高深度覆盖来克服错误。尽管它是对先前方法的简单修改,但我们在包含错误的模拟和实验数据集上的组装结果有显著改进。

可用性

http://152.2.15.114/~labweb/VCAKE

相似文献

1
Extending assembly of short DNA sequences to handle error.扩展短DNA序列的组装以处理错误。
Bioinformatics. 2007 Nov 1;23(21):2942-4. doi: 10.1093/bioinformatics/btm451. Epub 2007 Sep 24.
2
Assembling millions of short DNA sequences using SSAKE.使用SSAKE组装数百万条短DNA序列。
Bioinformatics. 2007 Feb 15;23(4):500-1. doi: 10.1093/bioinformatics/btl629. Epub 2006 Dec 8.
3
Fragment assembly with short reads.使用短读段进行片段组装。
Bioinformatics. 2004 Sep 1;20(13):2067-74. doi: 10.1093/bioinformatics/bth205. Epub 2004 Apr 1.
4
A sensitive repeat identification framework based on short and long reads.基于短读长读的敏感重复序列识别框架。
Nucleic Acids Res. 2021 Sep 27;49(17):e100. doi: 10.1093/nar/gkab563.
5
HISEA: HIerarchical SEed Aligner for PacBio data.HISEA:用于PacBio数据的分层种子比对器。
BMC Bioinformatics. 2017 Dec 19;18(1):564. doi: 10.1186/s12859-017-1953-9.
6
Illumina error correction near highly repetitive DNA regions improves de novo genome assembly.Illumina 纠错技术在高度重复 DNA 区域的应用提高了从头基因组组装的质量。
BMC Bioinformatics. 2019 Jun 3;20(1):298. doi: 10.1186/s12859-019-2906-2.
7
Quantification of rare allelic variants from pooled genomic DNA.从混合基因组DNA中对罕见等位基因变异进行定量分析。
Nat Methods. 2009 Apr;6(4):263-5. doi: 10.1038/nmeth.1307. Epub 2009 Mar 1.
8
De novo sequencing and variant calling with nanopores using PoreSeq.使用PoreSeq通过纳米孔进行从头测序和变异检测。
Nat Biotechnol. 2015 Oct;33(10):1087-91. doi: 10.1038/nbt.3360. Epub 2015 Sep 9.
9
De novo assembly of the Pseudomonas syringae pv. syringae B728a genome using Illumina/Solexa short sequence reads.利用Illumina/Solexa短序列 reads 对丁香假单胞菌丁香致病变种B728a基因组进行从头组装。
FEMS Microbiol Lett. 2009 Feb;291(1):103-11. doi: 10.1111/j.1574-6968.2008.01441.x. Epub 2008 Dec 9.
10
Benchmarking of de novo assembly algorithms for Nanopore data reveals optimal performance of OLC approaches.用于纳米孔数据的从头组装算法基准测试揭示了重叠布局一致(OLC)方法的最佳性能。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):507. doi: 10.1186/s12864-016-2895-8.

引用本文的文献

1
Conventional and Omics Approaches for Understanding the Abiotic Stress Response in Cereal Crops-An Updated Overview.用于理解谷类作物非生物胁迫响应的传统方法和组学方法——最新综述
Plants (Basel). 2022 Oct 26;11(21):2852. doi: 10.3390/plants11212852.
2
Era of Molecular Diagnostics Techniques before and after the COVID-19 Pandemic.新冠疫情前后的分子诊断技术时代
Curr Issues Mol Biol. 2022 Oct 11;44(10):4769-4789. doi: 10.3390/cimb44100325.
3
Empirical evaluation of methods for genome assembly.基因组组装方法的实证评估。
PeerJ Comput Sci. 2021 Jul 9;7:e636. doi: 10.7717/peerj-cs.636. eCollection 2021.
4
Features of sRNA biogenesis in rice revealed by genetic dissection of sRNA expression level.通过对小RNA表达水平的遗传剖析揭示水稻中小RNA生物合成的特征
Comput Struct Biotechnol J. 2020 Oct 23;18:3207-3216. doi: 10.1016/j.csbj.2020.10.012. eCollection 2020.
5
Exon sequencing mutation detection algorithm based on PCR matching.基于聚合酶链式反应匹配的外显子测序突变检测算法
PLoS One. 2020 Aug 13;15(8):e0236709. doi: 10.1371/journal.pone.0236709. eCollection 2020.
6
GAAP: A Genome Assembly + Annotation Pipeline.GAAP:基因组组装与注释流水线。
Biomed Res Int. 2019 Jun 26;2019:4767354. doi: 10.1155/2019/4767354. eCollection 2019.
7
NanoARG: a web service for detecting and contextualizing antimicrobial resistance genes from nanopore-derived metagenomes.NanoARG:一个从纳米孔宏基因组中检测和分析抗菌药物耐药基因的网络服务。
Microbiome. 2019 Jun 7;7(1):88. doi: 10.1186/s40168-019-0703-9.
8
GMASS: a novel measure for genome assembly structural similarity.GMASS:一种用于基因组组装结构相似性的新度量。
BMC Bioinformatics. 2019 Mar 18;20(1):147. doi: 10.1186/s12859-019-2710-z.
9
TraRECo: a greedy approach based de novo transcriptome assembler with read error correction using consensus matrix.TraRECo:一种基于贪心策略的从头转录组组装方法,使用一致矩阵进行读错误校正。
BMC Genomics. 2018 Sep 4;19(1):653. doi: 10.1186/s12864-018-5034-x.
10
Survey of gene splicing algorithms based on reads.基于读取的基因剪接算法调查。
Bioengineered. 2017 Nov 2;8(6):750-758. doi: 10.1080/21655979.2017.1373538. Epub 2017 Sep 21.