Department of Multimedia Engineering, Dongguk University, Seoul 04620, Korea.
Department of Information and Communication Engineering, Myongji University, Yongin-si 17058, Korea.
Int J Mol Sci. 2022 Sep 29;23(19):11484. doi: 10.3390/ijms231911484.
Advances in the next-generation sequencing technology have led to a dramatic decrease in read-generation cost and an increase in read output. Reconstruction of short DNA sequence reads generated by next-generation sequencing requires a read alignment method that reconstructs a reference genome. In addition, it is essential to analyze the results of read alignments for a biologically meaningful inference. However, read alignment from vast amounts of genomic data from various organisms is challenging in that it involves repeated automatic and manual analysis steps. We, here, devised cPlot software for read alignment of nucleotide sequences, with automated read alignment and position analysis, which allows visual assessment of the analysis results by the user. cPlot compares sequence similarity of reads by performing multiple read alignments, with FASTA format files as the input. This application provides a web-based interface for the user for facile implementation, without the need for a dedicated computing environment. cPlot identifies the location and order of the sequencing reads by comparing the sequence to a genetically close reference sequence in a way that is effective for visualizing the assembly of short reads generated by NGS and rapid gene map construction.
下一代测序技术的进步使得读取生成成本大幅降低,读取输出量增加。通过下一代测序生成的短 DNA 序列读取的重建需要一种读取对齐方法,该方法可以重建参考基因组。此外,对读取对齐的结果进行生物意义上的推断也很重要。然而,从各种生物体的大量基因组数据中进行读取对齐具有挑战性,因为它涉及到重复的自动和手动分析步骤。我们在这里设计了 cPlot 软件,用于核苷酸序列的读取对齐,具有自动化的读取对齐和位置分析功能,允许用户通过视觉评估分析结果。cPlot 通过执行多次读取对齐来比较读取的序列相似性,其输入为 FASTA 格式的文件。该应用程序为用户提供了基于网络的界面,便于实施,无需专用计算环境。cPlot 通过将序列与遗传上接近的参考序列进行比较来识别测序读取的位置和顺序,这种方法对于可视化 NGS 生成的短读取的组装和快速基因图谱构建非常有效。