Molecular Cancer Research Center, School of Medicine, Shenzhen Campus of Sun Yat-sen University, Sun Yat-sen University, Shenzhen, 518107, China.
Institute of Cancer Research, Shenzhen Bay Laboratory, Shenzhen, 518132, China.
Commun Biol. 2024 Jun 1;7(1):675. doi: 10.1038/s42003-024-06382-4.
The three-dimensional (3D) organization of genome is fundamental to cell biology. To explore 3D genome, emerging high-throughput approaches have produced billions of sequencing reads, which is challenging and time-consuming to analyze. Here we present Microcket, a package for mapping and extracting interacting pairs from 3D genomics data, including Hi-C, Micro-C, and derivant protocols. Microcket utilizes a unique read-stitch strategy that takes advantage of the long read cycles in modern DNA sequencers; benchmark evaluations reveal that Microcket runs much faster than the current tools along with improved mapping efficiency, and thus shows high potential in accelerating and enhancing the biological investigations into 3D genome. Microcket is freely available at https://github.com/hellosunking/Microcket .
基因组的三维(3D)组织是细胞生物学的基础。为了探索 3D 基因组,新兴的高通量方法已经产生了数十亿个测序读段,这在分析上具有挑战性且耗时。在这里,我们展示了 Microcket,这是一个用于从包括 Hi-C、Micro-C 和衍生协议在内的 3D 基因组学数据中映射和提取相互作用对的软件包。Microcket 利用了现代 DNA 测序仪中长读段循环的独特读段拼接策略;基准评估表明,Microcket 的运行速度比当前工具快得多,同时提高了映射效率,因此在加速和增强对 3D 基因组的生物学研究方面具有很高的潜力。Microcket 可在 https://github.com/hellosunking/Microcket 上免费获得。