Lapp S A, Geraldo J A, Chien J-T, Ay F, Pakala S B, Batugedara G, Humphrey J, DeBARRY J D, Le Roch K G, Galinski M R, Kissinger J C
Emory Vaccine Center,Yerkes National Primate Research Center,Emory University,Atlanta, GA,USA.
Federal University of Minas Gerais,Belo Horizonte, MG,Brazil.
Parasitology. 2018 Jan;145(1):71-84. doi: 10.1017/S0031182017001329. Epub 2017 Jul 19.
Plasmodium knowlesi has risen in importance as a zoonotic parasite that has been causing regular episodes of malaria throughout South East Asia. The P. knowlesi genome sequence generated in 2008 highlighted and confirmed many similarities and differences in Plasmodium species, including a global view of several multigene families, such as the large SICAvar multigene family encoding the variant antigens known as the schizont-infected cell agglutination proteins. However, repetitive DNA sequences are the bane of any genome project, and this and other Plasmodium genome projects have not been immune to the gaps, rearrangements and other pitfalls created by these genomic features. Today, long-read PacBio and chromatin conformation technologies are overcoming such obstacles. Here, based on the use of these technologies, we present a highly refined de novo P. knowlesi genome sequence of the Pk1(A+) clone. This sequence and annotation, referred to as the 'MaHPIC Pk genome sequence', includes manual annotation of the SICAvar gene family with 136 full-length members categorized as type I or II. This sequence provides a framework that will permit a better understanding of the SICAvar repertoire, selective pressures acting on this gene family and mechanisms of antigenic variation in this species and other pathogens.
诺氏疟原虫作为一种人畜共患寄生虫,其重要性日益凸显,在东南亚地区不断引发疟疾疫情。2008年生成的诺氏疟原虫基因组序列突出并证实了疟原虫物种之间的许多异同,包括对几个多基因家族的整体认识,例如编码被称为裂殖体感染细胞凝集蛋白的可变抗原的大型SICAvar多基因家族。然而,重复DNA序列是所有基因组项目的难题,该项目以及其他疟原虫基因组项目也未能幸免这些基因组特征所造成的缺口、重排及其他问题。如今,长读长PacBio和染色质构象技术正在克服这些障碍。在此,基于这些技术的应用,我们展示了Pk1(A+)克隆的高度精细的诺氏疟原虫从头基因组序列。这个序列及注释,即“MaHPIC Pk基因组序列”,包括对SICAvar基因家族的人工注释,其中136个全长成员被归类为I型或II型。该序列提供了一个框架,有助于更好地理解SICAvar基因库、作用于该基因家族的选择压力以及该物种和其他病原体的抗原变异机制。