Clum Alicia
United States Department of Energy Joint Genome Institute, Walnut Creek, CA, USA.
Methods Mol Biol. 2018;1775:141-153. doi: 10.1007/978-1-4939-7804-5_13.
Genome assembly uses sequence similarity to go from sequencing reads to longer contiguous sequences (contigs). Scaffolds are contigs linked together by gaps where the order and orientation of the contigs is known but the exact sequence connecting two contigs is unknown, represented by Ns which estimate the gap length. Here we describe recommendations for genome assembly for different sequencing technologies, describe organelle assembly, and review how to perform assembly quality control.
基因组组装利用序列相似性,从测序读段生成更长的连续序列(重叠群)。支架是通过间隙连接在一起的重叠群,其中重叠群的顺序和方向已知,但连接两个重叠群的精确序列未知,用N表示,N用于估计间隙长度。在这里,我们描述了针对不同测序技术的基因组组装建议,描述了细胞器组装,并回顾了如何进行组装质量控制。