Chen Wei-Hua, Wang Xue-Xia, Lin Wei, He Xiao-Wei, Wu Zhen-Qiang, Lin Ying, Hu Song-Nian, Wang Xiao-Ning
Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China.
BMC Genomics. 2006 Apr 18;7:82. doi: 10.1186/1471-2164-7-82.
The cynomolgus monkey (Macaca fascicularis) is one of the most widely used surrogate animal models for an increasing number of human diseases and vaccines, especially immune-system-related ones. Towards a better understanding of the gene expression background upon its immunogenetics, we constructed a cDNA library from Epstein-Barr virus (EBV)-transformed B lymphocytes of a cynomolgus monkey and sequenced 10,000 randomly picked clones.
After processing, 8,312 high-quality expressed sequence tags (ESTs) were generated and assembled into 3,728 unigenes. Annotations of these uniquely expressed transcripts demonstrated that out of the 2,524 open reading frame (ORF) positive unigenes (mitochondrial and ribosomal sequences were not included), 98.8% shared significant similarities (E-value less than 1e-10) with the NCBI nucleotide (nt) database, while only 67.7% (E-value less than 1e-5) did so with the NCBI non-redundant protein (nr) database. Further analysis revealed that 90.0% of the unigenes that shared no similarities to the nr database could be assigned to human chromosomes, in which 75 did not match significantly to any cynomolgus monkey and human ESTs. The mapping regions to known human genes on the human genome were described in detail. The protein family and domain analysis revealed that the first, second and fourth of the most abundantly expressed protein families were all assigned to immunoglobulin and major histocompatibility complex (MHC)-related proteins. The expression profiles of these genes were compared with that of homologous genes in human blood, lymph nodes and a RAMOS cell line, which demonstrated expression changes after transformation with EBV. The degree of sequence similarity of the MHC class I and II genes to the human reference sequences was evaluated. The results indicated that class I molecules showed weak amino acid identities (<90%), while class II showed slightly higher ones.
These results indicated that the genes expressed in the cynomolgus monkey could be used to identify novel protein-coding genes and revise those incomplete or incorrect annotations in the human genome by comparative methods, since the old world monkeys and humans share high similarities at the molecular level, especially within coding regions. The identification of multiple genes involved in the immune response, their sequence variations to the human homologues, and their responses to EBV infection could provide useful information to improve our understanding of the cynomolgus monkey immune system.
食蟹猴(Macaca fascicularis)是越来越多人类疾病和疫苗,尤其是免疫系统相关疾病和疫苗,应用最为广泛的替代动物模型之一。为了更好地了解其免疫遗传学的基因表达背景,我们从一只食蟹猴的爱泼斯坦 - 巴尔病毒(EBV)转化的B淋巴细胞构建了一个cDNA文库,并对10,000个随机挑选的克隆进行了测序。
经过处理,产生了8,312个高质量的表达序列标签(EST),并组装成3,728个单基因。这些独特表达转录本的注释表明,在2,524个开放阅读框(ORF)阳性单基因(不包括线粒体和核糖体序列)中,98.8%与NCBI核苷酸(nt)数据库具有显著相似性(E值小于1e - 10),而与NCBI非冗余蛋白质(nr)数据库具有显著相似性(E值小于1e - 5)的仅为67.7%。进一步分析表明,90.0%与nr数据库无相似性的单基因可定位到人类染色体上,其中75个与任何食蟹猴和人类EST均无显著匹配。详细描述了在人类基因组上与已知人类基因的映射区域。蛋白质家族和结构域分析表明,表达量最高的蛋白质家族中的第一、第二和第四个家族均与免疫球蛋白和主要组织相容性复合体(MHC)相关蛋白有关。将这些基因的表达谱与人类血液、淋巴结和RAMOS细胞系中的同源基因进行了比较,结果表明EBV转化后表达发生了变化。评估了MHC I类和II类基因与人类参考序列的序列相似程度。结果表明,I类分子的氨基酸同一性较弱(<9),而II类分子的氨基酸同一性略高。
这些结果表明,食蟹猴中表达的基因可用于通过比较方法鉴定新的蛋白质编码基因,并修正人类基因组中那些不完整或不正确的注释,因为旧世界猴和人类在分子水平上具有高度相似性,尤其是在编码区域。鉴定出多个参与免疫反应的基因、它们与人类同源物的序列变异以及它们对EBV感染的反应,可为增进我们对食蟹猴免疫系统的理解提供有用信息。