Department of Data Science, Dana-Faber Cancer Institute, Boston, MA 02215, USA.
Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02215, USA.
Bioinformatics. 2021 Jun 9;37(9):1315-1316. doi: 10.1093/bioinformatics/btaa827.
We present bedtk, a new toolkit for manipulating genomic intervals in the BED format. It supports sorting, merging, intersection, subtraction and the calculation of the breadth of coverage. Bedtk uses implicit interval tree, a data structure for fast interval overlap queries. It is several to tens of times faster than existing tools and tends to use less memory.
The source code is available at https://github.com/lh3/bedtk.
我们介绍了 bedtk,这是一个用于操作 BED 格式基因组区间的新工具包。它支持排序、合并、交集、差集和覆盖度计算。bedtk 使用隐式区间树,这是一种用于快速区间重叠查询的数据结构。它比现有工具快几倍到几十倍,并且倾向于使用更少的内存。
源代码可在 https://github.com/lh3/bedtk 获得。