Suppr超能文献

基于全基因组测序数据的特征预测进行个体识别。

Identification of individuals by trait prediction using whole-genome sequencing data.

机构信息

Human Longevity, Inc., Mountain View, CA 94303;

Human Longevity, Inc., Mountain View, CA 94303.

出版信息

Proc Natl Acad Sci U S A. 2017 Sep 19;114(38):10166-10171. doi: 10.1073/pnas.1711125114. Epub 2017 Sep 5.

Abstract

Prediction of human physical traits and demographic information from genomic data challenges privacy and data deidentification in personalized medicine. To explore the current capabilities of phenotype-based genomic identification, we applied whole-genome sequencing, detailed phenotyping, and statistical modeling to predict biometric traits in a cohort of 1,061 participants of diverse ancestry. Individually, for a large fraction of the traits, their predictive accuracy beyond ancestry and demographic information is limited. However, we have developed a maximum entropy algorithm that integrates multiple predictions to determine which genomic samples and phenotype measurements originate from the same person. Using this algorithm, we have reidentified an average of >8 of 10 held-out individuals in an ethnically mixed cohort and an average of 5 of either 10 African Americans or 10 Europeans. This work challenges current conceptions of personal privacy and may have far-reaching ethical and legal implications.

摘要

从基因组数据预测人类的身体特征和人口统计学信息,这对个性化医疗中的隐私和数据去识别化构成了挑战。为了探索基于表型的基因组识别的现有能力,我们应用全基因组测序、详细的表型分析和统计建模,对来自不同祖先的 1061 名参与者队列进行了生物特征预测。单独来看,对于很大一部分特征,它们在遗传和人口统计学信息之外的预测准确性是有限的。然而,我们开发了一种最大熵算法,该算法可以整合多个预测结果,以确定哪些基因组样本和表型测量来自同一个人。使用该算法,我们在一个混合种族的队列中平均重新识别了 10 个保留个体中的 8 个以上,平均识别了 10 个非裔美国人或 10 个欧洲人中的 5 个。这项工作挑战了当前个人隐私的概念,可能会产生深远的伦理和法律影响。

相似文献

1
Identification of individuals by trait prediction using whole-genome sequencing data.基于全基因组测序数据的特征预测进行个体识别。
Proc Natl Acad Sci U S A. 2017 Sep 19;114(38):10166-10171. doi: 10.1073/pnas.1711125114. Epub 2017 Sep 5.

引用本文的文献

7
The effects of loss of Y chromosome on male health.Y染色体缺失对男性健康的影响。
Nat Rev Genet. 2025 May;26(5):320-335. doi: 10.1038/s41576-024-00805-y. Epub 2025 Jan 2.

本文引用的文献

2
Deep sequencing of 10,000 human genomes.一万个人类基因组的深度测序。
Proc Natl Acad Sci U S A. 2016 Oct 18;113(42):11901-11906. doi: 10.1073/pnas.1613365113. Epub 2016 Oct 4.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验