College of Biology, Hunan Provincial Key Laboratory of Medical Virology, Hunan University, Changsha, China.
Hunan Yuelu mountain data science and Technology Research Institute Co., Ltd.
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa380.
African swine fever virus (ASFV) poses serious threats to the pig industry. The multigene family (MGF) proteins are extensively distributed in ASFVs and are generally classified into five families, including MGF-100, MGF-110, MGF-300, MGF-360 and MGF-505. Most MGF proteins, however, have not been well characterized and classified within each family. To bridge this gap, this study first classified MGF proteins into 31 groups based on protein sequence homology and network clustering. A web server for classifying MGF proteins was established and kept available for free at http://www.computationalbiology.cn/MGF/home.html. Results showed that MGF groups of the same family were most similar to each other and had conserved sequence motifs; the genetic diversity of MGF groups varied widely, mainly due to the occurrence of indels. In addition, the MGF proteins were predicted to have large structural and functional diversity, and MGF proteins of the same MGF family tended to have similar structure, location and function. Reconstruction of the ancestral states of MGF groups along the ASFV phylogeny showed that most MGF groups experienced either the copy number variations or the gain-or-loss changes, and most of these changes happened within strains of the same genotype. It is found that the copy number decrease and the loss of MGF groups were much larger than the copy number increase and the gain of MGF groups, respectively, suggesting the ASFV tended to lose MGF proteins in the evolution. Overall, the work provides a detailed classification for MGF proteins and would facilitate further research on MGF proteins.
非洲猪瘟病毒(ASFV)对养猪业构成严重威胁。多基因家族(MGF)蛋白广泛分布于 ASFVs 中,通常分为五个家族,包括 MGF-100、MGF-110、MGF-300、MGF-360 和 MGF-505。然而,大多数 MGF 蛋白尚未得到很好的特征描述和分类。为了弥补这一空白,本研究首先根据蛋白质序列同源性和网络聚类将 MGF 蛋白分为 31 组。建立了一个用于分类 MGF 蛋白的网络服务器,并在 http://www.computationalbiology.cn/MGF/home.html 上免费提供。结果表明,同一家族的 MGF 组彼此最为相似,具有保守的序列基序;MGF 组的遗传多样性差异很大,主要是由于插入缺失的发生。此外,预测 MGF 蛋白具有很大的结构和功能多样性,同一 MGF 家族的 MGF 蛋白往往具有相似的结构、位置和功能。根据 ASFV 系统发育沿 MGF 组重建祖先状态表明,大多数 MGF 组经历了拷贝数变化或获得或丢失的变化,这些变化大多发生在同一基因型的菌株内。发现 MGF 组的拷贝数减少和丢失比 MGF 组的拷贝数增加和获得要大得多,这表明 ASFV 在进化过程中倾向于失去 MGF 蛋白。总的来说,这项工作为 MGF 蛋白提供了详细的分类,将有助于进一步研究 MGF 蛋白。