Budowle Bruce, Polanskey Deborah, Allard Marc W, Chakraborty Ranajit
FBI Laboratory, 2501 Investigation Parkway, Quantico, VA 22135, USA.
J Forensic Sci. 2004 Nov;49(6):1256-61.
The SWGDAM mtDNA database is a publicly available reference source that is used for estimating the rarity of an evidence mtDNA profile. Because of the current processes for generating population data, it is unlikely that population databases are error free. The majority of the errors are due to human error and are transcriptional in nature. Phylogenetic analysis of data sets can identify some potential errors, and coupled with a review of the sequence data or alignment sheets can be a very useful tool. Seven sequences with errors have been identified by phylogenetic analysis. In addition, two samples were inadvertently modified when placed in the SWGDAM database. The corrected sequences are provided so that users can modify appropriately the current iteration of the SWGDAM database. From a practical perspective, upper bound estimates of the percentage of matching profiles obtained from a database search containing an incorrect sequence and those of a database containing the corrected sequence are not substantially different. Community wide access and review has enabled identification of errors in the SWGDAM data set and will continue to do so. The result of public accessibility is that the quality of the SWGDAM forensic dataset is always improving.
SWGDAM线粒体DNA数据库是一个公开可用的参考资源,用于评估证据线粒体DNA图谱的稀有性。由于目前生成群体数据的过程,群体数据库不太可能没有错误。大多数错误是人为错误,本质上是转录错误。数据集的系统发育分析可以识别一些潜在错误,再加上对序列数据或比对表的审查,可能是一个非常有用的工具。通过系统发育分析已识别出7条有错误的序列。此外,有两个样本在放入SWGDAM数据库时被意外修改。现提供校正后的序列,以便用户能够适当地修改SWGDAM数据库的当前版本。从实际角度来看,从包含错误序列的数据库搜索中获得的匹配图谱百分比的上限估计值与包含校正序列的数据库的估计值没有实质性差异。全社区的访问和审查使得能够识别SWGDAM数据集中的错误,并且将继续如此。公开可访问性的结果是,SWGDAM法医数据集的质量一直在提高。