使用遗传信息的社会调查中的数据质量控制。

Data quality control in social surveys using genetic information.

作者信息

Li Yi, Guo Guang

机构信息

a Department of Sociology , University of North Carolina at Chapel Hill , Chapel Hill , North Carolina , USA.

出版信息

Biodemography Soc Biol. 2014;60(2):212-28. doi: 10.1080/19485565.2014.953029.

DOI:10.1080/19485565.2014.953029

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6642059/

Abstract

This article introduces a novel way of taking advantage of genetic data in social surveys for the purposes of data quality control. Genetic information could detect and repair data issues such as missing data, reporting errors, differences in measures of the same variable, and flawed data. Using data from two surveys, the College Roommate Study (ROOM) and the National Longitudinal Study of Adolescent Health (Add Health), we show that proportion identical by descent score (a measure of genetic relationships) can identify "misreported" and unreported sibling type and detect misrepresented participants, bio-ancestry score (a measure of ancestral population memberships) can repair and recover missing race and discrepancies among different measures of self-reported race, and sex chromosomal information may help cross-check self-reported sex. This article represents an initial effort to utilize genetic data for the purposes of data quality control. As genetic data become increasingly available, researchers may explore more approaches to improving data quality.

摘要

本文介绍了一种在社会调查中利用遗传数据进行数据质量控制的新方法。遗传信息可以检测和修复数据问题，如缺失数据、报告错误、同一变量测量值的差异以及有缺陷的数据。利用来自两项调查的数据，即大学室友研究（ROOM）和青少年健康全国纵向研究（Add Health），我们表明，同源分数比例（一种遗传关系的度量）可以识别“误报”和未报告的兄弟姐妹类型，并检测出被错误表述的参与者，生物祖先分数（一种祖先群体成员身份的度量）可以修复和恢复缺失的种族以及不同自我报告种族测量之间的差异，而性染色体信息可能有助于交叉核对自我报告的性别。本文是利用遗传数据进行数据质量控制的初步尝试。随着遗传数据越来越容易获取，研究人员可能会探索更多提高数据质量的方法。

相似文献

1

Data quality control in social surveys using genetic information.

Biodemography Soc Biol. 2014;60(2):212-28. doi: 10.1080/19485565.2014.953029.

2

Genetic bio-ancestry and social construction of racial classification in social surveys in the contemporary United States.

Demography. 2014 Feb;51(1):141-72. doi: 10.1007/s13524-013-0242-0.

3

Color, race, and genomic ancestry in Brazil: dialogues between anthropology and genetics.

Curr Anthropol. 2009 Dec;50(6):787-819. doi: 10.1086/644532.

4

Genetically determined ancestry is more informative than self-reported race in HIV-infected and -exposed children.

Medicine (Baltimore). 2016 Sep;95(36):e4733. doi: 10.1097/MD.0000000000004733.

5

The molecular reinscription of race: a comment on "Genetic bio-ancestry and social construction of racial classification in social surveys in the contemporary United States".

Demography. 2014 Dec;51(6):2333-6. doi: 10.1007/s13524-014-0342-5.

6

Health and behavior risks of adolescents with mixed-race identity.

Am J Public Health. 2003 Nov;93(11):1865-70. doi: 10.2105/ajph.93.11.1865.

7

Accuracy of self-reported versus measured weight over adolescence and young adulthood: findings from the national longitudinal study of adolescent health, 1996-2008.

Am J Epidemiol. 2014 Jul 15;180(2):153-9. doi: 10.1093/aje/kwu133. Epub 2014 Jun 18.

8

The National Longitudinal Study of Adolescent to Adult Health (Add Health) sibling pairs genome-wide data.

Behav Genet. 2015 Jan;45(1):12-23. doi: 10.1007/s10519-014-9692-4. Epub 2014 Nov 7.

9

Latent Classes of Polysubstance Use Among Adolescents in the United States: Intersections of Sexual Identity with Sex, Age, and Race/Ethnicity.

LGBT Health. 2019 Apr;6(3):116-125. doi: 10.1089/lgbt.2018.0149. Epub 2019 Mar 1.

10

Factors associated with self-reported STDs: data from a national survey.

Sex Transm Dis. 1994 Nov-Dec;21(6):303-8. doi: 10.1097/00007435-199411000-00002.

本文引用的文献

1

Genetic and educational assortative mating among US adults.

Proc Natl Acad Sci U S A. 2014 Jun 3;111(22):7996-8000. doi: 10.1073/pnas.1321426111. Epub 2014 May 19.

2

Role of mother's genes and environment in postpartum depression.

Proc Natl Acad Sci U S A. 2011 May 17;108(20):8189-93. doi: 10.1073/pnas.1014129108. Epub 2011 May 16.

3

Sex-chromosome evolution: recent progress and the influence of male and female heterogamety.

Nat Rev Genet. 2011 Mar;12(3):157-66. doi: 10.1038/nrg2948. Epub 2011 Feb 8.

4

Reconciling the analysis of IBD and IBS in complex trait studies.

Nat Rev Genet. 2010 Nov;11(11):800-5. doi: 10.1038/nrg2865. Epub 2010 Sep 28.

5

Environmental contingencies and genetic propensities: social capital, educational continuation, and dopamine receptor gene DRD2.

AJS. 2008;114 Suppl:S260-86. doi: 10.1086/592204.

6

A panel of ancestry informative markers for estimating individual biogeographical ancestry and admixture from four continents: utility and applications.

Hum Mutat. 2008 May;29(5):648-58. doi: 10.1002/humu.20695.

7

The genetic structure of Pacific Islanders.

PLoS Genet. 2008 Jan;4(1):e19. doi: 10.1371/journal.pgen.0040019.

8

Genetic variation and population structure in native Americans.

PLoS Genet. 2007 Nov;3(11):e185. doi: 10.1371/journal.pgen.0030185.

9

PLINK: a tool set for whole-genome association and population-based linkage analyses.

Am J Hum Genet. 2007 Sep;81(3):559-75. doi: 10.1086/519795. Epub 2007 Jul 25.

10

Racial self-categorization in adolescence: multiracial development and social pathways.

Child Dev. 2006 Sep-Oct;77(5):1298-308. doi: 10.1111/j.1467-8624.2006.00935.x.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

文档翻译

学术文献翻译模型，支持多种主流文档格式。