Department of Biology, Stanford University, CA 94305.
Department of Biology, Stanford University, CA 94305
G3 (Bethesda). 2020 Aug 5;10(8):2893-2902. doi: 10.1534/g3.120.401473.
In forensic familial search methods, a query DNA profile is tested against a database to determine if the query profile represents a close relative of a database entrant. One challenge for familial search is that the calculations may require specification of allele frequencies for the unknown population from which the query profile has originated. The choice of allele frequencies affects the rate at which non-relatives are erroneously classified as relatives, and allele-frequency misspecification can substantially inflate false positive rates compared to use of allele frequencies drawn from the same population as the query profile. Here, we use ancestry inference on the query profile to circumvent the high false positive rates that result from highly misspecified allele frequencies. In particular, we perform ancestry inference on the query profile and make use of allele frequencies based on its inferred genetic ancestry. In a test for sibling matches on profiles that represent unrelated individuals, we demonstrate that false positive rates for familial search with use of ancestry inference to specify the allele frequencies are similar to those seen when allele frequencies align with the population of origin of a profile. Because ancestry inference is possible to perform on query profiles, the extreme allele-frequency misspecifications that produce the highest false positive rates can be avoided. We discuss the implications of the results in the context of concerns about the forensic use of familial searching.
在法医亲缘关系搜索方法中,查询 DNA 图谱与数据库进行测试,以确定查询图谱是否代表数据库录入者的近亲。亲缘关系搜索的一个挑战是,计算可能需要指定查询图谱来源的未知人群的等位基因频率。等位基因频率的选择会影响将非亲属错误分类为亲属的速度,与使用与查询图谱来自同一人群的等位基因频率相比,等位基因频率的错误指定会大大增加假阳性率。在这里,我们使用查询图谱的祖先推断来避免由于高度错误指定的等位基因频率而导致的高假阳性率。具体来说,我们对查询图谱进行祖先推断,并利用基于其推断的遗传祖先的等位基因频率。在针对代表无关个体的图谱进行兄弟姐妹匹配测试中,我们证明使用基于祖先推断来指定等位基因频率的亲缘关系搜索的假阳性率与与图谱来源人群的等位基因频率一致时看到的假阳性率相似。由于可以对查询图谱进行祖先推断,因此可以避免产生最高假阳性率的极端等位基因频率错误指定。我们在讨论结果时考虑了对法医使用亲缘关系搜索的担忧。