Parker Glendon J, Leppert Tami, Anex Deon S, Hilmer Jonathan K, Matsunami Nori, Baird Lisa, Stevens Jeffery, Parsawar Krishna, Durbin-Johnson Blythe P, Rocke David M, Nelson Chad, Fairbanks Daniel J, Wilson Andrew S, Rice Robert H, Woodward Scott R, Bothner Brian, Hart Bradley R, Leppert Mark
Department of Biology, Utah Valley University, Orem, Utah, United States of America.
Protein-Based Identification Technologies L.L.C., Orem, Utah, United States of America.
PLoS One. 2016 Sep 7;11(9):e0160653. doi: 10.1371/journal.pone.0160653. eCollection 2016.
Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 single nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects' DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European-American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). This study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts.
从生物材料中进行人类身份识别很大程度上依赖于对DNA中基因多态性进行表征的能力。不幸的是,DNA在环境中会降解,有时会降至无法通过PCR扩增的水平。然而,蛋白质在化学性质上比DNA更稳定,并且可以持续更长时间。蛋白质还以单氨基酸多态性的形式包含遗传变异。这些可用于推断非同义单核苷酸多态性等位基因的状态。为了证明这一点,我们使用基于质谱的鸟枪法蛋白质组学对66名欧美受试者的毛干蛋白质进行了表征。从受试者DNA的22个基因的32个位点中正确推算出总共596个单核苷酸多态性等位基因,并使用桑格测序法进行了直接验证。使用乘积法则估计欧洲人群中产生个体非同义单核苷酸多态性等位基因谱的概率,得出最大鉴别力为1/12,500。欧美受试者推算出的非同义单核苷酸多态性谱在非洲人群中出现的频率要低得多(最大似然比 = 11,000)。对于另外10名具有非洲血统的受试者采集的毛干,情况则相反,其中一些谱在非洲人群中更为常见。在来自六具考古骨骼遗骸(距今可达260年)的毛干数据集中也鉴定出了基因变异肽段。这项研究表明,通过检测毛干蛋白质(包括来自生物考古背景的毛发)中的基因变异肽段,可以获得身份鉴别和生物地理背景的可量化指标。