Ono Yukiteru, Hamada Michiaki, Asai Kiyoshi
Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa 277-8561, Japan.
Department of Electrical Engineering and Bioscience, Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1, Okubo, Shinjuku-ku, Tokyo 169-8555, Japan.
NAR Genom Bioinform. 2022 Dec 1;4(4):lqac092. doi: 10.1093/nargab/lqac092. eCollection 2022 Dec.
Long-read sequencers, such as Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) sequencers, have improved their read length and accuracy, thereby opening up unprecedented research. Many tools and algorithms have been developed to analyze long reads, and rapid progress in PacBio and ONT has further accelerated their development. Together with the development of high-throughput sequencing technologies and their analysis tools, many read simulators have been developed and effectively utilized. PBSIM is one of the popular long-read simulators. In this study, we developed PBSIM3 with three new functions: error models for long reads, multi-pass sequencing for high-fidelity read simulation and transcriptome sequencing simulation. Therefore, PBSIM3 is now able to meet a wide range of long-read simulation requirements.
长读长测序仪,如太平洋生物科学公司(PacBio)和牛津纳米孔技术公司(ONT)的测序仪,已经提高了它们的读长和准确性,从而开启了前所未有的研究。已经开发了许多工具和算法来分析长读长,PacBio和ONT的快速发展进一步加速了它们的开发。随着高通量测序技术及其分析工具的发展,许多读长模拟器已经被开发并得到有效利用。PBSIM是最受欢迎的长读长模拟器之一。在本研究中,我们开发了具有三项新功能的PBSIM3:长读长错误模型、用于高保真读长模拟的多次测序和转录组测序模拟。因此,PBSIM3现在能够满足广泛的长读长模拟需求。