Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, United States.
Department of Genetics, The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St Louis, MO 63110, United States.
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae404.
With single-cell DNA methylation studies yielding vast datasets, existing data formats struggle with the unique challenges of storage and efficient operations, highlighting a need for improved solutions.
BAllC (Binary All Cytosines) emerges as a tailored format for methylation data, addressing these challenges. BAllCools, its complementary software toolkit, enhances parsing, indexing, and querying capabilities, promising superior operational speeds and reduced storage needs.
单细胞 DNA 甲基化研究产生了大量数据集,现有数据格式在存储和高效操作方面面临独特的挑战,这凸显了对改进解决方案的需求。
BAllC(二进制全胞嘧啶)作为一种针对甲基化数据的定制格式出现,解决了这些挑战。BAllCools 是其配套的软件工具包,增强了解析、索引和查询功能,有望实现更高的操作速度和更低的存储需求。