Institute of Environmental Science and Research (ESR Ltd.), Mt Albert Science Centre, Private Bag 92-021, Auckland Mail Centre, Auckland 1142, New Zealand.
Forensic Sci Int Genet. 2013 Jul;7(4):444-52. doi: 10.1016/j.fsigen.2013.03.005. Epub 2013 Apr 15.
The ability to predict externally visible characteristics (EVCs) from DNA has appeal for use in forensic science, particularly where a forensic database match is not made and an eye witness account is unavailable. This technology has yet to be implemented in casework in New Zealand. The broad cultural diversity and likely population stratification within New Zealand dictates that any EVC predictions made using anonymous DNA must perform accurately in the absence of knowledge of the donor's ancestral background. Here we construct classification tree models with SNPs of known association with eye colour phenotypes in three categories, blue vs. non-blue, brown vs. non-brown and intermediate vs. non-intermediate. A set of nineteen SNPs from ten different known or suspected pigmentation genes were selected from the literature. A training dataset of 101 unrelated individuals from the New Zealand population and representing different ancestral backgrounds were used. We constructed four alternate models capable of predicting eye colour from the DNA genotypes of SNPs located within the HERC2, OCA2, TYR and SLC24A4 genes using probability calculation and classification trees. The final model selected for eye colour prediction exhibited high levels of accuracy for both blue (89%) and brown eye colour (94%). Models were further assessed with a test set of 25 'blind' samples where phenotype was unknown, with blue and brown eye colour predicted correctly where model thresholds were met. Classification trees offer an aesthetically simple and comprehendible model to predict blue and brown eye colour.
从 DNA 预测外部可见特征 (EVC) 的能力在法医学中具有吸引力,特别是在没有进行法医数据库匹配且没有目击者证词的情况下。这项技术尚未在新西兰的案例工作中实施。新西兰广泛的文化多样性和可能的人口分层,要求使用匿名 DNA 进行的任何 EVC 预测在不了解供体祖先背景的情况下必须准确进行。在这里,我们使用来自新西兰人群的 101 个无关个体的训练数据集,构建了与三种眼表型(蓝色与非蓝色、棕色与非棕色以及中间色与非中间色)相关的 SNP 的分类树模型。从文献中选择了来自十个不同已知或疑似色素沉着基因的一组十九个 SNP。我们构建了四个替代模型,能够使用位于 HERC2、OCA2、TYR 和 SLC24A4 基因内的 SNP 的 DNA 基因型预测眼睛颜色,使用概率计算和分类树。用于眼睛颜色预测的最终模型对蓝色(89%)和棕色眼睛颜色(94%)的准确性都很高。进一步使用未知表型的 25 个“盲”样本测试集评估了模型,在满足模型阈值的情况下,正确预测了蓝色和棕色眼睛颜色。分类树提供了一种美观简单且易于理解的模型,可用于预测蓝色和棕色眼睛颜色。