Robertson Hugh M, Reese Justin T, Milshina Natalia V, Agarwala Richa, Solignac Michel, Walden Kimberly K O, Elsik Christine G
Department of Entomology, University of Illinois at Urbana-Champaign, Urbana 61801, USA.
Insect Mol Biol. 2007 Aug;16(4):401-10. doi: 10.1111/j.1365-2583.2007.00738.x. Epub 2007 May 16.
The euchromatic arms of the five smallest telocentric chromosomes in the honey bee genome draft Assembly v4 were manually connected into superscaffolds. This effort reduced chromosomes 12-16 from 30, 21, 25, 42, and 21 mapped scaffolds to five, four, five, six, and five superscaffolds, respectively, and incorporated 178 unmapped contigs and scaffolds totalling 2.6 Mb, a 6.4% increase in length. The superscaffolds extend from the genetically mapped location of the centromere to their identified distal telomeres on the long arms. Only two major misassemblies of 146 kb and 65 kb sections were identified in this 23% of the mapped assembly. Nine duplicate gene models on chromosomes 15 and 16 were made redundant, while another 15 gene models were improved, most spectacularly the MAD (MAX dimerization protein) gene which extends across 11 scaffolds for at least 400 kb.
在蜜蜂基因组草图装配v4中,五条最小的端着丝粒染色体的常染色质臂被手动连接成超级支架。这一工作使得12号至16号染色体分别从30个、21个、25个、42个和21个已映射支架减少到5个、4个、5个、6个和5个超级支架,并纳入了178个未映射的重叠群和支架,总长2.6 Mb,长度增加了6.4%。这些超级支架从着丝粒的遗传映射位置延伸到其在长臂上确定的远端端粒。在这个占已映射装配23%的区域中,仅发现了两个分别为146 kb和65 kb片段的主要错误装配。15号和16号染色体上的9个重复基因模型变得多余,同时另外15个基因模型得到了改进,最显著的是横跨11个支架、长度至少为400 kb的MAD(MAX二聚化蛋白)基因。