Chen Rui, Sodergren Erica, Weinstock George M, Gibbs Richard A
Department of Molecular and Human Genetics, Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA.
Genome Res. 2004 Apr;14(4):679-84. doi: 10.1101/gr.2171704.
CLONEPICKER is a software pipeline that integrates sequence data with BAC clone fingerprints to dynamically select a minimal overlapping clone set covering the whole genome. In the Rat Genome Sequencing Project (RGSP), a hybrid strategy of "clone by clone" and "whole genome shotgun" approaches was used to maximize the merits of both approaches. Like the "clone by clone" method, one key challenge for this strategy was to select a low-redundancy clone set that covered the whole genome while the sequencing is in progress. The CLONEPICKER pipeline met this challenge using restriction enzyme fingerprint data, BAC end sequence data, and sequences generated from individual BAC clones as well as WGS reads. In the RGSP, an average of 7.5 clones was identified from each side of a seed clone, and the minimal overlapping clones were reliably selected. Combined with the assembled BAC fingerprint map, a set of BAC clones that covered >97% of the genome was identified and used in the RGSP.
CLONEPICKER是一个软件流程,它将序列数据与BAC克隆指纹相结合,以动态选择覆盖整个基因组的最小重叠克隆集。在大鼠基因组测序计划(RGSP)中,采用了“逐个克隆”和“全基因组鸟枪法”相结合的混合策略,以充分发挥两种方法的优点。与“逐个克隆”方法一样,该策略面临的一个关键挑战是在测序过程中选择一个覆盖整个基因组的低冗余克隆集。CLONEPICKER流程利用限制性内切酶指纹数据、BAC末端序列数据、单个BAC克隆产生的序列以及全基因组鸟枪法测序读段来应对这一挑战。在RGSP中,从一个种子克隆的每一侧平均鉴定出7.5个克隆,并可靠地选择了最小重叠克隆。结合组装好的BAC指纹图谱,鉴定出一组覆盖基因组97%以上的BAC克隆,并用于RGSP。