Department of Chemical Engineering and Materials Science, University of Minnesota, 421 Washington Avenue SE, Minneapolis, Minnesota.
Biotechnol Bioeng. 2014 Apr;111(4):770-81. doi: 10.1002/bit.25135. Epub 2013 Nov 19.
Baby Hamster Kidney (BHK) cell lines are used in the production of veterinary vaccines and recombinant proteins. To facilitate transcriptome analysis of BHK cell lines, we embarked on an effort to sequence, assemble, and annotate transcript sequences from a recombinant BHK cell line and Syrian hamster liver and brain. RNA-seq data were supplemented with 6,170 Sanger ESTs from parental and recombinant BHK lines to generate 221,583 contigs. Annotation by homology to other species, primarily mouse, yielded more than 15,000 unique Ensembl mouse gene IDs with high coverage of KEGG canonical pathways. High coverage of enzymes and isoforms was seen for cell metabolism and N-glycosylation pathways, areas of highest interest for biopharmaceutical production. With the high sequencing depth in RNA-seq data, we set out to identify single-nucleotide variants in the transcripts. A majority of the high-confidence variants detected in both hamster tissue libraries occurred at a frequency of 50%, indicating their origin as heterozygous germline variants. In contrast, the cell line libraries' variants showed a wide range of occurrence frequency, indicating the presence of a heterogeneous population in cultured cells. The extremely high coverage of transcripts of highly abundant genes in RNA-seq enabled us to identify low-frequency variants. Experimental verification through Sanger sequencing confirmed the presence of two variants in the cDNA of a highly expressed gene in the BHK cell line. Furthermore, we detected seven potential missense mutations in the genes of the growth signaling pathways that may have arisen during the cell line derivation process. The development and characterization of a BHK reference transcriptome will facilitate future efforts to understand, monitor, and manipulate BHK cells. Our study on sequencing variants is crucial for improved understanding of the errors inherent in high-throughput sequencing and to increase the accuracy of variant calling in BHK or other systems.
仓鼠肾(BHK)细胞系用于兽用疫苗和重组蛋白的生产。为了促进 BHK 细胞系的转录组分析,我们着手对重组 BHK 细胞系和叙利亚仓鼠肝脏和大脑的转录序列进行测序、组装和注释。RNA-seq 数据辅以来自亲本和重组 BHK 系的 6170 个 Sanger EST,生成了 221583 个重叠群。通过与其他物种(主要是小鼠)的同源性注释,获得了超过 15000 个具有高覆盖度的 KEGG 经典途径的独特 Ensembl 小鼠基因 ID。细胞代谢和 N-糖基化途径的酶和同工酶具有高覆盖率,这是生物制药生产最感兴趣的领域。在 RNA-seq 数据的高测序深度下,我们着手鉴定转录本中的单核苷酸变异。在两种仓鼠组织文库中检测到的高可信度变异中的大多数,其出现频率为 50%,表明它们是杂合性种系变异的起源。相比之下,细胞系文库中的变异则显示出广泛的出现频率,表明培养细胞中存在异质群体。高丰度基因在 RNA-seq 中转录本的极高覆盖率使我们能够鉴定低频变异。通过 Sanger 测序进行的实验验证证实了在 BHK 细胞系中高度表达基因的 cDNA 中存在两种变异。此外,我们在细胞信号转导途径的基因中检测到了七个可能的错义突变,这些突变可能是在细胞系衍生过程中产生的。BHK 参考转录组的开发和特征将有助于未来理解、监测和操纵 BHK 细胞。我们对测序变异的研究对于提高对高通量测序固有错误的理解以及提高 BHK 或其他系统中变异调用的准确性至关重要。