Peck Michelle A, Brandhagen Michael D, Marshall Charla, Diegoli Toni M, Irwin Jodi A, Sturk-Andreaggi Kimberly
Armed Forces DNA Identification Laboratory, 115 Purple Heart Drive, Dover AFB, DE 19902, United States; American Registry of Pathology Sciences, 9210 Corporate Boulevard, Suite 120, Rockville, MD 20850, United States.
Federal Bureau of Investigation Laboratory, 2501 Investigation Parkway, Quantico, VA 22135, United States.
Forensic Sci Int Genet. 2016 Sep;24:103-111. doi: 10.1016/j.fsigen.2016.06.003. Epub 2016 Jun 6.
Sanger-type sequencing (STS) of mitochondrial DNA (mtDNA), specifically the control region (CR), is routinely employed in forensics in human identification and missing persons scenarios. Yet next-generation sequencing (NGS) has the potential to overcome some of the major limitations of STS processing, permitting reasonable paths forward for full mitochondrial genome (mtGenome) sequencing, while also offering higher-throughput and higher sensitivity capabilities. To establish the accuracy and reproducibility of NGS for the development of mtDNA data, 90 DNA extracts that were previously used to generate forensic quality full mtGenomes using STS were sequenced using Nextera XT library preparation and the Illumina MiSeq. Using the same amplicon product, replicate library sets were generated and sequenced at different laboratories, and analysis was performed in replicate using the CLC Genomics Workbench. Both sequencing sets resulted in 99.998% of positions with greater than 10X coverage when 96 samples (including controls) were multiplexed. Overall, 99.9996% concordance was observed between the NGS data and the STS data for the full mtGenome. The only "discordant" calls involved low level point heteroplasmies, with the differences resulting from stochastic variation and/or the increased sensitivity of NGS. Higher sensitivity also allowed for the detection of a mixed sample previously not detected with STS. Additionally, variant calls were reproducible between sequencing sets and between software analysis versions with the variant frequency only differing by 0.23% and 0.01%, respectively. Further validation studies and specialized software functionality tailored to forensic practice should facilitate the incorporation of NGS processing into standard casework applications. The data herein comprise the largest, and likely most thoroughly examined, complete mtGenome STS-NGS concordance dataset available.
线粒体DNA(mtDNA)的桑格型测序(STS),特别是控制区(CR),在法医学的人类身份鉴定和失踪人员调查中经常使用。然而,下一代测序(NGS)有可能克服STS处理的一些主要限制,为完整线粒体基因组(mtGenome)测序提供合理的途径,同时还具有更高的通量和更高的灵敏度。为了确定用于mtDNA数据开发的NGS的准确性和可重复性,使用Nextera XT文库制备和Illumina MiSeq对90份先前用于通过STS生成法医级完整mtGenomes的DNA提取物进行了测序。使用相同的扩增子产物,在不同实验室生成并测序重复文库集,并使用CLC基因组学工作台进行重复分析。当对96个样本(包括对照)进行多重分析时,两个测序集的99.998%的位置覆盖率大于10倍。总体而言,完整mtGenome的NGS数据和STS数据之间的一致性为99.9996%。唯一的“不一致”调用涉及低水平的点异质性,差异是由随机变异和/或NGS的更高灵敏度导致的。更高的灵敏度还允许检测到先前用STS未检测到的混合样本。此外,变异调用在测序集之间和软件分析版本之间是可重复的,变异频率仅分别相差0.23%和0.01%。进一步的验证研究和针对法医实践定制的专门软件功能应有助于将NGS处理纳入标准案件工作应用中。本文中的数据构成了可用的最大且可能是最全面检查的完整mtGenome STS-NGS一致性数据集。