Douglas Susan E, Knickle Leah C, Kimball Jennifer, Reith Michael E
Institute for Marine Biosciences,Halifax, Nova Scotia, Canada.
BMC Genomics. 2007 Jun 4;8:144. doi: 10.1186/1471-2164-8-144.
An essential first step in the genomic characterisation of a new species, in this case Atlantic halibut (Hippoglossus hippoglossus), is the generation of EST information. This forms the basis for subsequent microarray design, SNP detection and the placement of novel markers on genetic linkage maps.
Normalised directional cDNA libraries were constructed from five different larval stages (hatching, mouth-opening, midway to metamorphosis, premetamorphosis, and post-metamorphosis) and eight different adult tissues (testis, ovary, liver, head kidney, spleen, skin, gill, and intestine). Recombination efficiency of the libraries ranged from 91-98% and insert size averaged 1.4 kb. Approximately 1000 clones were sequenced from the 5'-end of each library and after trimming, 12675 good sequences were obtained. Redundancy within each library was very low and assembly of the entire EST collection into contigs resulted in 7738 unique sequences of which 6722 (87%) had matches in Genbank. Removal of ESTs and contigs that originated from bacteria or food organisms resulted in a total of 7710 unique halibut sequences.
A Unigene collection of 7710 functionally annotated ESTs has been assembled from Atlantic halibut. These have been incorporated into a publicly available, searchable database and form the basis for an oligonucleotide microarray that can be used as a tool to study gene expression in this economically important aquacultured fish.
对新物种(在此为大西洋庸鲽,Hippoglossus hippoglossus)进行基因组特征分析的关键第一步是生成EST信息。这为后续的微阵列设计、单核苷酸多态性(SNP)检测以及在遗传连锁图谱上定位新标记奠定了基础。
从五个不同的幼体阶段(孵化期、开口期、变态中期、变态前期和变态后期)以及八个不同的成体组织(睾丸、卵巢、肝脏、头肾、脾脏、皮肤、鳃和肠道)构建了标准化的定向cDNA文库。文库的重组效率在91%至98%之间,插入片段大小平均为1.4 kb。从每个文库的5'端对大约1000个克隆进行测序,经过修剪后,获得了12675条高质量序列。每个文库内的冗余度非常低,将整个EST集合组装成重叠群后得到7738个独特序列,其中6722个(87%)在Genbank中有匹配项。去除源自细菌或食物生物体的EST和重叠群后,总共得到7710个独特的庸鲽序列。
已从大西洋庸鲽组装了一个包含7710个功能注释EST的单基因集。这些已被纳入一个可公开访问、可搜索的数据库,并构成了一个寡核苷酸微阵列的基础,该微阵列可作为研究这种经济上重要的养殖鱼类基因表达的工具。