Young Robert G, Yu Jiaojia, Cote Marie-José, Hanner Robert H
University of Guelph, Guelph, Canada University of Guelph Guelph Canada.
Canadian Food Inspection Agency, Ottawa, Canada Canadian Food Inspection Agency Ottawa Canada.
Biodivers Data J. 2020 Apr 23;8:e50630. doi: 10.3897/BDJ.8.e50630. eCollection 2020.
Molecular identification methods, such as DNA barcoding, rely on centralized databases populated with morphologically identified individuals and their referential nucleotide sequence records. As molecular identification approaches have expanded in use to fields such as food fraud, environmental surveys, and border surveillance, there is a need for diverse international data sets. Although central data repositories, like the Barcode of Life Datasystems (BOLD), provided workarounds for formatting data for upload, these workarounds can be taxing on researchers with few resources and limited funding. To address these concerns, we present the Molecular Data Organization for Publication (MDOP) R package to assist researchers in uploading data to public databases. To illustrate the use of these scripts, we use the BOLD system as an example. The main intent of this writing is to assist in the movement of data, from academic, governmental, and other institutional computer systems, to public locations. The movement of these data can then better contribute to the global DNA barcoding initiative and other global molecular data efforts.
分子鉴定方法,如DNA条形码技术,依赖于集中式数据库,这些数据库中存有形态学鉴定的个体及其参考核苷酸序列记录。随着分子鉴定方法在食品欺诈、环境调查和边境监测等领域的应用不断扩展,需要多样化的国际数据集。尽管像生命条形码数据系统(BOLD)这样的中央数据存储库提供了数据上传格式的变通方法,但这些变通方法对于资源稀缺且资金有限的研究人员来说可能负担较重。为了解决这些问题,我们推出了用于出版物的分子数据组织(MDOP)R包,以协助研究人员将数据上传到公共数据库。为了说明这些脚本的使用方法,我们以BOLD系统为例。本文的主要目的是协助数据从学术、政府和其他机构的计算机系统转移到公共存储位置。这些数据的转移随后可以更好地推动全球DNA条形码计划和其他全球分子数据工作。