Yang Cheng-Hong, Wu Kuo-Chuan, Chuang Li-Yeh, Chang Hsueh-Wei
Department of Electronic Engineering, National Kaohsiung University of Science and Technology, Kaohsiung, Taiwan.
Biomedical Engineering, Kaohsiung Medical University, Kaohsiung, Taiwan.
Front Genet. 2019 Apr 3;10:259. doi: 10.3389/fgene.2019.00259. eCollection 2019.
The mitochondrial gene cytochrome c oxidase I (COI) is commonly used for DNA barcoding in animals. However, most of the COI barcode nucleotides are conserved and sequences longer than about 650 base pairs increase the computational burden for species identification. To solve this problem, we propose a decision theory-based COI SNP tagging (DCST) approach that focuses on the discrimination of species using single nucleotide polymorphisms (SNPs) as the variable nucleotides of the sequences of a group of species. Using the example of 126 teleost mackerel fish species (order: Scombriformes), we identified 281 SNPs by alignment and trimming of their COI sequences. After decision rule making, 49 SNPs in 126 fish species were determined using the scoring system of the DCST approach. These COI-SNP barcodes were finally transformed into one-dimensional barcode images. Our proposed DCST approach simplifies the computational complexity and identifies the most effective and fewest SNPs to resolve or discriminate species for species tagging.
线粒体基因细胞色素c氧化酶I(COI)常用于动物的DNA条形码分析。然而,COI条形码的大多数核苷酸是保守的,超过约650个碱基对的序列会增加物种鉴定的计算负担。为了解决这个问题,我们提出了一种基于决策理论的COI SNP标记(DCST)方法,该方法专注于使用单核苷酸多态性(SNP)作为一组物种序列的可变核苷酸来区分物种。以126种硬骨鱼鲭科鱼类(目:鲭形目)为例,我们通过比对和修剪它们的COI序列鉴定出281个SNP。在制定决策规则后,使用DCST方法的评分系统确定了126种鱼类中的49个SNP。这些COI-SNP条形码最终被转换为一维条形码图像。我们提出的DCST方法简化了计算复杂度,并识别出最有效且数量最少的SNP,以解决或区分物种进行物种标记。