Suppr超能文献

利用远程家族搜索推断基因组数据的身份信息。

Identity inference of genomic data using long-range familial searches.

机构信息

MyHeritage, Or Yehuda 6037606, Israel.

Department of Computer Science, Fu Foundation School of Engineering, Columbia University, New York, NY, USA.

出版信息

Science. 2018 Nov 9;362(6415):690-694. doi: 10.1126/science.aau4832. Epub 2018 Oct 11.

Abstract

Consumer genomics databases have reached the scale of millions of individuals. Recently, law enforcement authorities have exploited some of these databases to identify suspects via distant familial relatives. Using genomic data of 1.28 million individuals tested with consumer genomics, we investigated the power of this technique. We project that about 60% of the searches for individuals of European descent will result in a third-cousin or closer match, which theoretically allows their identification using demographic identifiers. Moreover, the technique could implicate nearly any U.S. individual of European descent in the near future. We demonstrate that the technique can also identify research participants of a public sequencing project. On the basis of these results, we propose a potential mitigation strategy and policy implications for human subject research.

摘要

消费者基因组数据库已达到数百万人的规模。最近,执法部门利用其中一些数据库通过远距离亲属关系来识别嫌疑人。我们使用经过消费者基因组测试的 128 万人的基因组数据,研究了该技术的效力。我们预计,约 60%的欧洲裔个体搜索结果将产生一个远房表亲或更亲近的匹配,这在理论上允许使用人口统计学标识符来识别他们。此外,该技术可能在不久的将来牵连到几乎所有的欧洲裔美国个体。我们证明,该技术还可以识别公共测序项目的研究参与者。基于这些结果,我们为人类受试者研究提出了一种潜在的缓解策略和政策影响。

相似文献

1
Identity inference of genomic data using long-range familial searches.
Science. 2018 Nov 9;362(6415):690-694. doi: 10.1126/science.aau4832. Epub 2018 Oct 11.
3
An Inference Attack on Genomic Data Using Kinship, Complex Correlations, and Phenotype Information.
IEEE/ACM Trans Comput Biol Bioinform. 2018 Jul-Aug;15(4):1333-1343. doi: 10.1109/TCBB.2017.2709740.
4
Simulating the Large-Scale Erosion of Genomic Privacy Over Time.
IEEE/ACM Trans Comput Biol Bioinform. 2018 Sep-Oct;15(5):1405-1412. doi: 10.1109/TCBB.2018.2859380. Epub 2018 Jul 24.
5
Using genetic genealogy databases in missing persons cases and to develop suspect leads in violent crimes.
Forensic Sci Int. 2019 Aug;301:107-117. doi: 10.1016/j.forsciint.2019.05.016. Epub 2019 May 14.
6
Privacy preserving protocol for detecting genetic relatives using rare variants.
Bioinformatics. 2014 Jun 15;30(12):i204-11. doi: 10.1093/bioinformatics/btu294.
7
Attacks on genetic privacy via uploads to genealogical databases.
Elife. 2020 Jan 7;9:e51810. doi: 10.7554/eLife.51810.
8
A principled approach to cross-sector genomic data access.
Bioethics. 2021 Oct;35(8):779-786. doi: 10.1111/bioe.12919. Epub 2021 Jul 12.
9
Identifying genetic relatives without compromising privacy.
Genome Res. 2014 Apr;24(4):664-72. doi: 10.1101/gr.153346.112. Epub 2014 Mar 10.
10

引用本文的文献

1
Legitimacy of investigative forensic genetic genealogy under Art. 8 ECHR.
Forensic Sci Int Synerg. 2025 Aug 18;11:100636. doi: 10.1016/j.fsisyn.2025.100636. eCollection 2025 Dec.
3
Power and Limitations of Inferring Genetic Ancestry.
Ann Hum Genet. 2025 Sep;89(5):264-273. doi: 10.1111/ahg.70007. Epub 2025 Jul 15.
4
An Upper Bound on the Power of DNA to Distinguish Pedigree Relationships.
Genes (Basel). 2025 Apr 26;16(5):492. doi: 10.3390/genes16050492.
6
Insights from social media into public perspectives on investigative genetic genealogy.
Front Genet. 2025 Jan 6;15:1482831. doi: 10.3389/fgene.2024.1482831. eCollection 2024.
7
Investigative genetic genealogy practices warranting policy attention: Results of a modified policy Delphi.
PLoS Genet. 2025 Jan 16;21(1):e1011520. doi: 10.1371/journal.pgen.1011520. eCollection 2025 Jan.
8
Privacy of single-cell gene expression data.
Patterns (N Y). 2024 Nov 8;5(11):101096. doi: 10.1016/j.patter.2024.101096.
10

本文引用的文献

1
Consumer genomics will change your life, whether you get tested or not.
Genome Biol. 2018 Aug 20;19(1):120. doi: 10.1186/s13059-018-1506-1.
2
Genealogy databases and the future of criminal investigation.
Science. 2018 Jun 8;360(6393):1078-1079. doi: 10.1126/science.aau1083.
3
Quantitative analysis of population-scale family trees with millions of relatives.
Science. 2018 Apr 13;360(6385):171-175. doi: 10.1126/science.aam9309. Epub 2018 Mar 1.
4
"Bridge to the Literature"? Third-Party Genetic Interpretation Tools and the Views of Tool Developers.
J Genet Couns. 2018 Aug;27(4):770-781. doi: 10.1007/s10897-018-0217-9. Epub 2018 Feb 7.
5
DNA.Land is a framework to collect genomes and phenomes in the era of abundant genetic information.
Nat Genet. 2018 Feb;50(2):160-165. doi: 10.1038/s41588-017-0021-8.
7
Routes for breaching and protecting genetic privacy.
Nat Rev Genet. 2014 Jun;15(6):409-21. doi: 10.1038/nrg3723. Epub 2014 May 8.
8
Forensic familial searching: scientific and social implications.
Nat Rev Genet. 2013 Jul;14(7):445. doi: 10.1038/nrg3519.
9
Identifying personal genomes by surname inference.
Science. 2013 Jan 18;339(6117):321-4. doi: 10.1126/science.1229566.
10
Cryptic distant relatives are common in both isolated and cosmopolitan genetic samples.
PLoS One. 2012;7(4):e34267. doi: 10.1371/journal.pone.0034267. Epub 2012 Apr 3.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验