Department of Biology, San Diego State University, 5500 Campanile Drive, San Diego, CA, 92182, USA.
Department of Biology, University of California, 900 University Avenue, Riverside, CA, 92521, USA.
Mol Ecol Resour. 2017 May;17(3):418-430. doi: 10.1111/1755-0998.12575. Epub 2016 Aug 18.
Next-generation sequencing technologies now allow researchers of non-model systems to perform genome-based studies without the requirement of a (often unavailable) closely related genomic reference. We evaluated the role of restriction endonuclease (RE) selection in double-digest restriction-site-associated DNA sequencing (ddRADseq) by generating reduced representation genome-wide data using four different RE combinations. Our expectation was that RE selections targeting longer, more complex restriction sites would recover fewer loci than RE with shorter, less complex sites. We sequenced a diverse sample of non-model arachnids, including five congeneric pairs of harvestmen (Opiliones) and four pairs of spiders (Araneae). Sample pairs consisted of either conspecifics or closely related congeneric taxa, and in total 26 sample pair analyses were tested. Sequence demultiplexing, read clustering and variant calling were performed in the pyRAD program. The 6-base pair cutter EcoRI combined with methylated site-specific 4-base pair cutter MspI produced, on average, the greatest numbers of intra-individual loci and shared loci per sample pair. As expected, the number of shared loci recovered for a sample pair covaried with the degree of genetic divergence, estimated with cytochrome oxidase I sequences, although this relationship was non-linear. Our comparative results will prove useful in guiding protocol selection for ddRADseq experiments on many arachnid taxa where reference genomes, even from closely related species, are unavailable.
下一代测序技术现在使非模式系统的研究人员能够进行基于基因组的研究,而无需(通常不可用的)密切相关的基因组参考。我们通过使用四种不同的 RE 组合生成减少代表性的全基因组数据来评估限制内切酶 (RE) 选择在双消化限制位点相关 DNA 测序 (ddRADseq) 中的作用。我们预计,针对更长、更复杂的限制位点的 RE 选择会比具有较短、较简单位点的 RE 恢复更少的基因座。我们对包括五种同属 harvestmen(Opiliones)和四种蜘蛛(Araneae)在内的多种非模式蛛形纲动物进行了测序。样本对由同种或密切相关的同属分类群组成,共测试了 26 个样本对分析。序列去复用、读取聚类和变异调用在 pyRAD 程序中进行。6 个碱基对的 EcoRI 与甲基化位点特异性的 4 个碱基对的 MspI 联合使用,平均而言,每个样本对产生的个体基因座和共享基因座数量最多。正如预期的那样,从 COI 序列估计的遗传分化程度与每个样本对回收的共享基因座数量相关,尽管这种关系是非线性的。我们的比较结果将有助于指导 ddRADseq 实验方案的选择,对于许多蛛形纲动物类群,即使是来自密切相关的物种,也无法获得参考基因组。