Department of Forensic Genetics and Forensic Toxicology, National Board of Forensic Medicine, Linköping, Sweden; Department of Clinical and Experimental Medicine, Faculty of Medicine and Health Sciences, Linköping University, Linköping, Sweden.
Peter Sjölund AB, Härnösand, Sweden.
Forensic Sci Int Genet. 2020 May;46:102233. doi: 10.1016/j.fsigen.2020.102233. Epub 2020 Jan 17.
Recently a number of high profile crime cases (e.g. the "Golden State Killer") have successfully been solved or given new leads with the use of genome wide DNA data in combination with pairwise matching from individuals present in genealogy DNA databases. Such databases will primarily involve distant relatives which in turn require a large amount of genetic information, in the range of several hundred thousand to millions of SNPs, to be genotyped. While it nowadays is fairly straightforward to obtain such as data from high quality and high quantity DNA, it is still a challenge for degraded DNA of low quantity such in the case of forensic samples. Here we present a successful effort in obtaining genome-wide genotype data from human remains. The goal was to get investigative leads in order to identify the remains of an unknown male ("the Ekeby man") that was found murdered in the south of Sweden in 2003. Whole-genome sequencing was performed on DNA originating from a bone sample. Three replicates of libraries were prepared using ThruPLEX DNA-seq Kit (Takara) which were sequenced on a HiSeq X instrument (Illumina). A mean coverage of 30X was obtained when the sequencing reads were mapped to a human reference genome. Following further bioinformatic processing, allele calling, quality checks and filtering to match the genealogy DNA database SNPs, genotypes for approximately one million SNPs were established. The resulting SNP genotypes were then used to search for relatives in the genealogy DNA database GEDmatch (www.gedmatch.com). A candidate list of relatives was obtained which was further processed using traditional genealogy methods in order to get leads about the identity of the unknown. In summary, this report shows how whole-genome sequencing successfully can be applied on forensic samples to create the SNP genotypes required for searches in genealogy DNA databases for the purpose of generating leads to identify missing or unknown persons, including perpetrators and victims.
最近,一些备受瞩目的犯罪案件(例如“金州杀手”案)成功地利用全基因组 DNA 数据与家谱 DNA 数据库中个体的成对匹配相结合得到了解决或新的线索。此类数据库主要涉及远亲,这反过来又需要大量的遗传信息,范围从几十万到几百万个 SNP 进行基因分型。虽然现在从高质量和大量的 DNA 中获取此类数据相当简单,但对于法医样本中低质量和低数量的 DNA 来说仍然是一个挑战。在这里,我们成功地从人类遗骸中获取了全基因组基因型数据。目的是获取调查线索,以确定在 2003 年于瑞典南部被谋杀的一名未知男性(“埃克比男子”)的遗体身份。对源自骨样本的 DNA 进行了全基因组测序。使用 ThruPLEX DNA-seq 试剂盒(Takara)制备了三个文库重复样本,然后在 HiSeq X 仪器(Illumina)上进行测序。当测序reads 被映射到人类参考基因组时,获得了 30X 的平均覆盖度。进一步进行生物信息学处理、等位基因调用、质量检查和过滤以匹配家谱 DNA 数据库 SNPs 后,建立了大约一百万个 SNPs 的基因型。然后将产生的 SNP 基因型用于在家谱 DNA 数据库 GEDmatch(www.gedmatch.com)中搜索亲属。获得了一个候选亲属列表,然后使用传统的家谱学方法进一步处理,以获取有关未知身份的线索。总之,本报告展示了全基因组测序如何成功地应用于法医样本,以创建用于家谱 DNA 数据库搜索的 SNP 基因型,从而为识别失踪或未知人员,包括犯罪者和受害者,生成线索。