Kim Tae-Hun, Kim Nam-Soon, Lim Dajeong, Lee Kyung-Tai, Oh Jung-Hwa, Park Hye-Sook, Jang Gil-Won, Kim Hyung-Yong, Jeon Mina, Choi Bong-Hwan, Lee Hae-Young, Chung H Y, Kim Heebal
Division of Animal Genomics & Bioinformatics, National Livestock Research Institute, Rural Development Administration, Omokchun-dong 564, Suwon, Korea.
BMC Genomics. 2006 Feb 27;7:36. doi: 10.1186/1471-2164-7-36.
Genome research in farm animals will expand our basic knowledge of the genetic control of complex traits, and the results will be applied in the livestock industry to improve meat quality and productivity, as well as to reduce the incidence of disease. A combination of quantitative trait locus mapping and microarray analysis is a useful approach to reduce the overall effort needed to identify genes associated with quantitative traits of interest.
We constructed a full-length enriched cDNA library from porcine backfat tissue. The estimated average size of the cDNA inserts was 1.7 kb, and the cDNA fullness ratio was 70%. In total, we deposited 16,110 high-quality sequences in the dbEST division of GenBank (accession numbers: DT319652-DT335761). For all the expressed sequence tags (ESTs), approximately 10.9 Mb of porcine sequence were generated with an average length of 674 bp per EST (range: 200-952 bp). Clustering and assembly of these ESTs resulted in a total of 5,008 unique sequences with 1,776 contigs (35.46%) and 3,232 singleton (65.54%) ESTs. From a total of 5,008 unique sequences, 3,154 (62.98%) were similar to other sequences, and 1,854 (37.02%) were identified as having no hit or low identity (<95%) and 60% coverage in The Institute for Genomic Research (TIGR) gene index of Sus scrofa. Gene ontology (GO) annotation of unique sequences showed that approximately 31.7, 32.3, and 30.8% were assigned molecular function, biological process, and cellular component GO terms, respectively. A total of 1,854 putative novel transcripts resulted after comparison and filtering with the TIGR SsGI; these included a large percentage of singletons (80.64%) and a small proportion of contigs (13.36%).
The sequence data generated in this study will provide valuable information for studying expression profiles using EST-based microarrays and assist in the condensation of current pig TCs into clusters representing longer stretches of cDNA sequences. The isolation of genes expressed in backfat tissue is the first step toward a better understanding of backfat tissue on a genomic basis.
家畜基因组研究将扩展我们对复杂性状遗传控制的基础知识,研究结果将应用于畜牧业,以提高肉质和生产力,并降低疾病发生率。数量性状基因座定位和微阵列分析相结合是一种有用的方法,可减少鉴定与感兴趣的数量性状相关基因所需的总体工作量。
我们构建了一个来自猪背膘组织的全长富集cDNA文库。cDNA插入片段的估计平均大小为1.7 kb,cDNA完整性比率为70%。我们总共向GenBank的dbEST部门提交了16,110条高质量序列(登录号:DT319652 - DT335761)。对于所有表达序列标签(EST),共生成了约10.9 Mb的猪序列,每个EST的平均长度为674 bp(范围:200 - 952 bp)。这些EST的聚类和组装产生了总共5,008个独特序列,其中有1,776个重叠群(35.46%)和3,232个单拷贝(65.54%)EST。在总共5,008个独特序列中,3,154个(62.98%)与其他序列相似,1,854个(37.02%)在猪的基因组研究所(TIGR)基因索引中被鉴定为无匹配或低同一性(<95%)且覆盖率<60%。独特序列的基因本体(GO)注释显示,分别约有31.7%、32.3%和30.8%被分配了分子功能、生物学过程和细胞成分GO术语。与TIGR SsGI进行比较和筛选后,共产生了1,854个推定的新转录本;其中包括很大比例的单拷贝(80.64%)和一小部分重叠群(13.36%)。
本研究产生的序列数据将为使用基于EST的微阵列研究表达谱提供有价值的信息,并有助于将当前猪的转录本聚类浓缩为代表更长cDNA序列片段的簇。分离在背膘组织中表达的基因是在基因组基础上更好地理解背膘组织的第一步。