Suppr超能文献

利用远程家族搜索推断基因组数据的身份信息。

Identity inference of genomic data using long-range familial searches.

机构信息

MyHeritage, Or Yehuda 6037606, Israel.

Department of Computer Science, Fu Foundation School of Engineering, Columbia University, New York, NY, USA.

出版信息

Science. 2018 Nov 9;362(6415):690-694. doi: 10.1126/science.aau4832. Epub 2018 Oct 11.

Abstract

Consumer genomics databases have reached the scale of millions of individuals. Recently, law enforcement authorities have exploited some of these databases to identify suspects via distant familial relatives. Using genomic data of 1.28 million individuals tested with consumer genomics, we investigated the power of this technique. We project that about 60% of the searches for individuals of European descent will result in a third-cousin or closer match, which theoretically allows their identification using demographic identifiers. Moreover, the technique could implicate nearly any U.S. individual of European descent in the near future. We demonstrate that the technique can also identify research participants of a public sequencing project. On the basis of these results, we propose a potential mitigation strategy and policy implications for human subject research.

摘要

消费者基因组数据库已达到数百万人的规模。最近,执法部门利用其中一些数据库通过远距离亲属关系来识别嫌疑人。我们使用经过消费者基因组测试的 128 万人的基因组数据,研究了该技术的效力。我们预计,约 60%的欧洲裔个体搜索结果将产生一个远房表亲或更亲近的匹配,这在理论上允许使用人口统计学标识符来识别他们。此外,该技术可能在不久的将来牵连到几乎所有的欧洲裔美国个体。我们证明,该技术还可以识别公共测序项目的研究参与者。基于这些结果,我们为人类受试者研究提出了一种潜在的缓解策略和政策影响。

相似文献

4
Simulating the Large-Scale Erosion of Genomic Privacy Over Time.随时间模拟大规模基因组隐私侵蚀。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Sep-Oct;15(5):1405-1412. doi: 10.1109/TCBB.2018.2859380. Epub 2018 Jul 24.
8
A principled approach to cross-sector genomic data access.跨部门基因组数据访问的原则方法。
Bioethics. 2021 Oct;35(8):779-786. doi: 10.1111/bioe.12919. Epub 2021 Jul 12.
9
Identifying genetic relatives without compromising privacy.在不侵犯隐私的前提下识别基因亲属。
Genome Res. 2014 Apr;24(4):664-72. doi: 10.1101/gr.153346.112. Epub 2014 Mar 10.

引用本文的文献

1
Legitimacy of investigative forensic genetic genealogy under Art. 8 ECHR.《欧洲人权公约》第八条下调查性法医基因族谱的合法性
Forensic Sci Int Synerg. 2025 Aug 18;11:100636. doi: 10.1016/j.fsisyn.2025.100636. eCollection 2025 Dec.
3
Power and Limitations of Inferring Genetic Ancestry.推断遗传血统的能力与局限性
Ann Hum Genet. 2025 Sep;89(5):264-273. doi: 10.1111/ahg.70007. Epub 2025 Jul 15.
8
Privacy of single-cell gene expression data.单细胞基因表达数据的隐私性。
Patterns (N Y). 2024 Nov 8;5(11):101096. doi: 10.1016/j.patter.2024.101096.

本文引用的文献

2
Genealogy databases and the future of criminal investigation.家谱数据库与刑事调查的未来。
Science. 2018 Jun 8;360(6393):1078-1079. doi: 10.1126/science.aau1083.
7
Routes for breaching and protecting genetic privacy.突破和保护遗传隐私的途径。
Nat Rev Genet. 2014 Jun;15(6):409-21. doi: 10.1038/nrg3723. Epub 2014 May 8.
9
Identifying personal genomes by surname inference.姓氏推断识别个人基因组。
Science. 2013 Jan 18;339(6117):321-4. doi: 10.1126/science.1229566.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验