D'Ercole Jacopo, Prosser Sean W J, Hebert Paul D N
Centre for Biodiversity Genomics, Guelph, ON, Canada.
Department of Integrative Biology, University of Guelph, Guelph, ON, Canada.
PeerJ. 2021 Jan 14;9:e10420. doi: 10.7717/peerj.10420. eCollection 2021.
Natural history collections are a valuable resource for molecular taxonomic studies and for examining patterns of evolutionary diversification, particularly in the case of rare or extinct species. However, the recovery of sequence information is often complicated by DNA degradation. This article describes use of the Sequel platform (Pacific Biosciences) to recover the 658 bp barcode region of the mitochondrial cytochrome oxidase I (COI) gene from 380 butterflies with an average age of 50 years. Nested multiplex PCR was employed for library preparation to facilitate sequence recovery from extracts with low concentrations of highly degraded DNA. By employing circular consensus sequencing (CCS) of short amplicons (circa 150 bp), full-length barcodes could be assembled without a reference sequence, an important advance from earlier protocols which required reference sequences to guide contig assembly. The Sequel protocol recovered COI sequences (499 bp on average) from 318 of 380 specimens (84%), much higher than for Sanger sequencing (26%). Because each read derives from a single molecule, it was also possible to quantify the incidence of substitutions arising from DNA damage. In agreement with past work on sequence changes induced by DNA degradation, the transition C/G → T/A was the most prevalent category of change, but its rate of occurrence (4.58E-4) was so low that it did not impede the recovery of reliable sequences. Because the current protocol recovers COI sequence from most museum specimens, and because sequence fidelity is unaffected by nucleotide misincorporations, large-scale sequence characterization of museum specimens is feasible.
自然历史藏品是分子分类学研究以及考察进化多样化模式的宝贵资源,对于珍稀或灭绝物种而言尤其如此。然而,DNA降解常常会使序列信息的获取变得复杂。本文描述了使用Sequel平台(太平洋生物科学公司)从380只平均年龄为50年的蝴蝶中获取线粒体细胞色素氧化酶I(COI)基因658 bp条形码区域的方法。采用巢式多重PCR进行文库制备,以便从低浓度高度降解的DNA提取物中促进序列获取。通过对短扩增子(约150 bp)采用环形一致序列测序(CCS),无需参考序列即可组装全长条形码,这是相对于早期需要参考序列来指导重叠群组装的方案的一项重要进展。Sequel方案从380个标本中的318个(84%)中获得了COI序列(平均499 bp),远高于桑格测序法(26%)。由于每个读数都来自单个分子,因此还能够对由DNA损伤引起的替换发生率进行量化。与过去关于DNA降解诱导的序列变化的研究一致,转换C/G→T/A是最常见的变化类型,但其发生率(4.58E - 4)非常低,并未妨碍可靠序列的获取。由于当前方案能从大多数博物馆标本中获得COI序列,且序列保真度不受核苷酸错掺入的影响,因此对博物馆标本进行大规模序列表征是可行的。