Department of Animal Biotechnology, College of Veterinary Science & Animal Husbandry, Anand Agricultural University, Anand 388 001, India.
Genomics Proteomics Bioinformatics. 2012 Apr;10(2):107-13. doi: 10.1016/j.gpb.2012.05.005. Epub 2012 Jun 12.
Whole genome sequencing of buffalo is yet to be completed, and in the near future it may not be possible to identify an exome (coding region of genome) through bioinformatics for designing probes to capture it. In the present study, we employed in solution hybridization to sequence tissue specific temporal exomes (TST exome) in buffalo. We utilized cDNA prepared from buffalo muscle tissue as a probe to capture TST exomes from the buffalo genome. This resulted in a prominent reduction of repeat sequences (up to 40%) and an enrichment of coding sequences (up to 60%). Enriched targets were sequenced on a 454 pyro-sequencing platform, generating 101,244 reads containing 24,127,779 high quality bases. The data revealed 40,100 variations, of which 403 were indels and 39,218 SNPs containing 195 nonsynonymous candidate SNPs in protein-coding regions. The study has indicated that 80% of the total genes identified from capture data were expressed in muscle tissue. The present study is the first of its kind to sequence TST exomes captured by use of cDNA molecules for SNPs found in the coding region without any prior sequence information of targeted molecules.
水牛的全基因组测序尚未完成,在不久的将来,通过生物信息学来识别外显子(基因组的编码区域)以设计捕获它的探针可能是不可能的。在本研究中,我们采用溶液杂交技术对水牛的组织特异性时间外显子(TST 外显子)进行测序。我们利用从水牛肌肉组织中制备的 cDNA 作为探针,从水牛基因组中捕获 TST 外显子。这导致重复序列(高达 40%)显著减少,编码序列(高达 60%)富集。富集的靶标在 454 焦磷酸测序平台上进行测序,生成包含 24,127,779 个高质量碱基的 101,244 条reads。该数据揭示了 40,100 个变异,其中 403 个是插入缺失,39,218 个 SNPs 包含 195 个蛋白编码区中的非同义候选 SNPs。该研究表明,从捕获数据中鉴定的总基因的 80%在肌肉组织中表达。本研究首次在没有靶向分子的任何先前序列信息的情况下,利用 cDNA 分子对编码区中发现的 SNPs 进行 TST 外显子的测序。