Suppr超能文献

姓氏推断识别个人基因组。

Identifying personal genomes by surname inference.

机构信息

Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142, USA.

出版信息

Science. 2013 Jan 18;339(6117):321-4. doi: 10.1126/science.1229566.

Abstract

Sharing sequencing data sets without identifiers has become a common practice in genomics. Here, we report that surnames can be recovered from personal genomes by profiling short tandem repeats on the Y chromosome (Y-STRs) and querying recreational genetic genealogy databases. We show that a combination of a surname with other types of metadata, such as age and state, can be used to triangulate the identity of the target. A key feature of this technique is that it entirely relies on free, publicly accessible Internet resources. We quantitatively analyze the probability of identification for U.S. males. We further demonstrate the feasibility of this technique by tracing back with high probability the identities of multiple participants in public sequencing projects.

摘要

在基因组学中,分享没有标识符的测序数据集已经成为一种常见做法。在这里,我们报告说,姓氏可以通过分析 Y 染色体上的短串联重复序列(Y-STRs)和查询娱乐性遗传家谱数据库来从个人基因组中恢复。我们表明,姓氏与其他类型的元数据(如年龄和州)相结合,可以用于三角测量目标的身份。该技术的一个关键特征是它完全依赖于免费的、可公开访问的互联网资源。我们对美国男性的识别概率进行了定量分析。我们通过高概率追溯公共测序项目中的多个参与者的身份,进一步证明了该技术的可行性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验