Beghain Johann, Langlois Anne-Claire, Legrand Eric, Grange Laura, Khim Nimol, Witkowski Benoit, Duru Valentine, Ma Laurence, Bouchier Christiane, Ménard Didier, Paul Richard E, Ariey Frédéric
Institut Pasteur, Génome et Génomique des Insectes Vecteurs, Paris, France.
Institut Pasteur du Cambodge, Epidémiologie Moléculaire du Paludisme, Phnom Penh, Cambodia.
Malar J. 2016 Apr 12;15:206. doi: 10.1186/s12936-016-1258-x.
In eukaryotic genomes, deletion or amplification rates have been estimated to be a thousand more frequent than single nucleotide variation. In Plasmodium falciparum, relatively few transcription factors have been identified, and the regulation of transcription is seemingly largely influenced by gene amplification events. Thus copy number variation (CNV) is a major mechanism enabling parasite genomes to adapt to new environmental changes.
Currently, the detection of CNVs is based on quantitative PCR (qPCR), which is significantly limited by the relatively small number of genes that can be analysed at any one time. Technological advances that facilitate whole-genome sequencing, such as next generation sequencing (NGS) enable deeper analyses of the genomic variation to be performed. Because the characteristics of Plasmodium CNVs need special consideration in algorithms and strategies for which classical CNV detection programs are not suited a dedicated algorithm to detect CNVs across the entire exome of P. falciparum was developed. This algorithm is based on a custom read depth strategy through NGS data and called PlasmoCNVScan.
The analysis of CNV identification on three genes known to have different levels of amplification and which are located either in the nuclear, apicoplast or mitochondrial genomes is presented. The results are correlated with the qPCR experiments, usually used for identification of locus specific amplification/deletion.
This tool will facilitate the study of P. falciparum genomic adaptation in response to ecological changes: drug pressure, decreased transmission, reduction of the parasite population size (transition to pre-elimination endemic area).
在真核生物基因组中,缺失或扩增率估计比单核苷酸变异频繁一千倍。在恶性疟原虫中,已鉴定出的转录因子相对较少,转录调控似乎在很大程度上受基因扩增事件影响。因此,拷贝数变异(CNV)是使寄生虫基因组适应新环境变化的主要机制。
目前,CNV的检测基于定量PCR(qPCR),但一次可分析的基因数量相对较少,这极大地限制了其应用。新一代测序(NGS)等促进全基因组测序的技术进步,使得能够对基因组变异进行更深入的分析。由于恶性疟原虫CNV的特征在算法和策略上需要特殊考虑,而经典的CNV检测程序并不适用,因此开发了一种专门用于检测恶性疟原虫整个外显子组中CNV的算法。该算法基于通过NGS数据的自定义读取深度策略,称为PlasmoCNVScan。
展示了对已知具有不同扩增水平且位于核基因组、质体或线粒体基因组中的三个基因的CNV鉴定分析。结果与通常用于鉴定位点特异性扩增/缺失的qPCR实验相关。
该工具将有助于研究恶性疟原虫基因组在应对生态变化(如药物压力、传播减少、寄生虫种群数量减少(向消除前流行地区过渡))时的适应性。