Suppr超能文献

一种用于从动态选择的单核苷酸多态性推断近亲关系的似然比框架。

A likelihood ratio framework for inferring close kinship from dynamically selected SNPs.

作者信息

Ge Jianye, Budowle Bruce, Cariaso Michael, Mittelman Kristen, Mittelman David

机构信息

Othram Inc., The Woodlands, TX, United States.

Department of Forensic Medicine, University of Helsinki, Helsinki, Finland.

出版信息

Front Genet. 2025 Jul 23;16:1635734. doi: 10.3389/fgene.2025.1635734. eCollection 2025.

Abstract

Forensic genetic genealogy (FGG) is a force-multiplier for human identification, leveraging dense single nucleotide polymorphism (SNP) data to infer relationships through identity by descent (IBD) segment analysis. Although powerful for investigative lead generation, broad adoption of SNP-based identification methods by the forensic community, especially medical examiners and crime laboratories, necessitates likelihood ratio (LR)-based relationship testing, to align with traditional kinship testing standards. To address this gap, a novel method was developed that incorporates LR calculations into FGG and SNP testing workflows. This approach is unique in that it dynamically selects unlinked, highly informative SNPs based on configurable thresholds for minor allele frequency (MAF) and minimum genetic distance for a robust and reliable analysis. Employing a curated panel of 222,366 SNPs from gnomAD v4 and data from the 1,000 genomes project, high accuracy in resolving relationships up to second-degree relatives can be achieved. For example, a subset of 126 SNPs (MAF > 0.4, minimum genetic distance of 30 cM) yielded 96.8% accuracy and a weighted F1 score of 0.975 across 2,244 tested pairs. This LR-based methodology enables forensic laboratories to select informative SNPs and integrate modern genomic data with existing accredited relationship testing frameworks, providing critical statistical support for close-relationship comparisons and enhances the rigor of FGG- and SNP-based human identification applications.

摘要

法医基因族谱学(FGG)是一种用于人类身份识别的力量倍增器,它利用密集的单核苷酸多态性(SNP)数据,通过同源片段(IBD)分析来推断亲属关系。尽管对于生成调查线索很强大,但法医界,尤其是法医和犯罪实验室广泛采用基于SNP的识别方法,需要基于似然比(LR)的亲属关系测试,以符合传统的亲属关系测试标准。为了填补这一空白,开发了一种新方法,将LR计算纳入FGG和SNP测试工作流程。这种方法的独特之处在于,它根据次要等位基因频率(MAF)的可配置阈值和最小遗传距离动态选择不连锁、信息丰富的SNP,以进行稳健可靠的分析。使用来自gnomAD v4的222,366个SNP的精选面板和1000基因组计划的数据,可以在解析二级亲属以内的关系方面实现高精度。例如,126个SNP的子集(MAF>0.4,最小遗传距离为30 cM)在2244对测试对中产生了96.8%的准确率和0.975的加权F1分数。这种基于LR的方法使法医实验室能够选择信息丰富的SNP,并将现代基因组数据与现有的认可亲属关系测试框架相结合,为近亲关系比较提供关键的统计支持,并提高基于FGG和SNP的人类身份识别应用的严谨性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/140a/12325062/8a122bb0428c/fgene-16-1635734-g001.jpg

相似文献

1
A likelihood ratio framework for inferring close kinship from dynamically selected SNPs.
Front Genet. 2025 Jul 23;16:1635734. doi: 10.3389/fgene.2025.1635734. eCollection 2025.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
Home treatment for mental health problems: a systematic review.
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
6
Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.
Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.
8
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.
9
Development of a SNP Panel for Geographic Assignment and Population Monitoring of Jaguars ().
Ecol Evol. 2025 May 22;15(5):e71465. doi: 10.1002/ece3.71465. eCollection 2025 May.

本文引用的文献

1
The GIAB genomic stratifications resource for human reference genomes.
Nat Commun. 2024 Oct 19;15(1):9029. doi: 10.1038/s41467-024-53260-y.
2
Shotgun DNA sequencing for human identification: Dynamic SNP selection and likelihood ratio calculations accounting for errors.
Forensic Sci Int Genet. 2025 Jan;74:103146. doi: 10.1016/j.fsigen.2024.103146. Epub 2024 Sep 7.
3
Prioritizing privacy and presentation of supportable hypothesis testing in forensic genetic genealogy investigations.
Biotechniques. 2024;76(9):425-431. doi: 10.1080/07366205.2024.2386218. Epub 2024 Aug 9.
5
An approach to unified formulae for likelihood ratio calculation in pairwise kinship analysis.
Front Genet. 2024 Feb 7;15:1226228. doi: 10.3389/fgene.2024.1226228. eCollection 2024.
6
A genomic mutational constraint map using variation in 76,156 human genomes.
Nature. 2024 Jan;625(7993):92-100. doi: 10.1038/s41586-023-06045-0. Epub 2023 Dec 6.
7
Improved computations for relationship inference using low-coverage sequencing data.
BMC Bioinformatics. 2023 Mar 9;24(1):90. doi: 10.1186/s12859-023-05217-z.
8
Linkage and linkage disequilibrium among the markers in the forensic MPS panels.
J Forensic Sci. 2021 Sep;66(5):1637-1646. doi: 10.1111/1556-4029.14724. Epub 2021 Apr 22.
9
Rapid, Phase-free Detection of Long Identity-by-Descent Segments Enables Effective Relationship Classification.
Am J Hum Genet. 2020 Apr 2;106(4):453-466. doi: 10.1016/j.ajhg.2020.02.012. Epub 2020 Mar 19.
10
Crossover interference and sex-specific genetic maps shape identical by descent sharing in close relatives.
PLoS Genet. 2019 Dec 20;15(12):e1007979. doi: 10.1371/journal.pgen.1007979. eCollection 2019 Dec.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验