Vergara Ismael A, Tarailo-Graovac Maja, Frech Christian, Wang Jun, Qin Zhaozhao, Zhang Ting, She Rong, Chu Jeffrey S C, Wang Ke, Chen Nansheng
Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, British Columbia V5A 1S6, Canada.
BMC Genomics. 2014 Apr 2;15:255. doi: 10.1186/1471-2164-15-255.
Increasing genetic and phenotypic differences found among natural isolates of C. elegans have encouraged researchers to explore the natural variation of this nematode species.
Here we report on the identification of genomic differences between the reference strain N2 and the Hawaiian strain CB4856, one of the most genetically distant strains from N2. To identify both small- and large-scale genomic variations (GVs), we have sequenced the CB4856 genome using both Roche 454 (~400 bps single reads) and Illumina GA DNA sequencing methods (101 bps paired-end reads). Compared to previously described variants (available in WormBase), our effort uncovered twice as many single nucleotide variants (SNVs) and increased the number of small InDels almost 20-fold. Moreover, we identified and validated large insertions, most of which range from 150 bps to 1.2 kb in length in the CB4856 strain. Identified GVs had a widespread impact on protein-coding sequences, including 585 single-copy genes that have associated severe phenotypes of reduced viability in RNAi and genetics studies. Sixty of these genes are homologs of human genes associated with diseases. Furthermore, our work confirms previously identified GVs associated with differences in behavioural and biological traits between the N2 and CB4856 strains.
The identified GVs provide a rich resource for future studies that aim to explain the genetic basis for other trait differences between the N2 and CB4856 strains.
秀丽隐杆线虫自然分离株中发现的遗传和表型差异不断增加,这促使研究人员探索这种线虫物种的自然变异。
在此,我们报告了参考菌株N2与夏威夷菌株CB4856(与N2遗传距离最远的菌株之一)之间基因组差异的鉴定。为了鉴定小规模和大规模的基因组变异(GVs),我们使用罗氏454(约400 bp单读长)和Illumina GA DNA测序方法(101 bp双端读长)对CB4856基因组进行了测序。与先前描述的变异(可在WormBase中获得)相比,我们的工作发现的单核苷酸变异(SNVs)数量增加了一倍,小插入缺失(InDels)数量增加了近20倍。此外,我们鉴定并验证了大的插入片段,其中大多数在CB4856菌株中的长度范围为150 bp至1.2 kb。鉴定出的GVs对蛋白质编码序列产生了广泛影响,包括585个单拷贝基因,这些基因在RNA干扰和遗传学研究中具有与生存力降低相关的严重表型。其中60个基因是与人类疾病相关基因的同源物。此外,我们的工作证实了先前鉴定出的与N2和CB4856菌株之间行为和生物学特征差异相关的GVs。
鉴定出的GVs为未来旨在解释N2和CB4856菌株之间其他性状差异的遗传基础的研究提供了丰富的资源。