P. G. Department of Zoology, Berhampur University, Berhampur, Odisha, India.
Malaria Programme, Wellcome Sanger Institute, Cambridge, United Kingdom.
Acta Trop. 2019 Nov;199:105152. doi: 10.1016/j.actatropica.2019.105152. Epub 2019 Aug 22.
Molecular taxonomy based identification of species in the form of DNA barcodes are extensively used in evolutionary systematics. Almost all the DNA barcodes contain detailed information of the barcoding gene along with uninformative sequences of a particular species. Therefore, a technique is highly essential to remove or to reduce the number of uninformative sequences and ought to create species-specific barcodes for differentiation. The actual variation in genetic sequences, called single nucleotide polymorphism (SNP) genotyping, can be utilized to develop a new tool for rapid, reliable, and high-throughput assay to distinguish the known species. SNPs act as important hereditary markers for uncovering the evolutionary history and normal genetic polymorphisms. Keeping in mind, we propose a decision tree-based barcoding (DTB) algorithm for generating SNP barcodes from the DNA barcoding sequence of several evolutionarily related species to accurately identify a single species. To address this issue, we analyzed mitochondrial COI gene sequences of 64 species of Anopheles mosquitoes. After alignment and truncating, 32 SNPs were discovered in COI gene sequences of Anopheles mosquitoes and then computed to set up the decision rule for constructing the decision tree. The decision tree based barcoding algorithm generates 126 nodes and 32 loci for discriminating 64 Anopheles mosquito species. Finally, we concluded that the DTB method is useful and effective for generating sequence tags for Anopheles mosquito species identification.
基于分子分类学的 DNA 条形码鉴定方法广泛应用于进化系统学中。几乎所有的 DNA 条形码都包含了条形码基因的详细信息以及特定物种的无信息序列。因此,需要一种技术来去除或减少无信息序列的数量,并为区分物种创造具有物种特异性的条形码。实际的遗传序列变化,称为单核苷酸多态性 (SNP) 基因分型,可以用来开发一种新的工具,用于快速、可靠和高通量的检测来区分已知的物种。SNP 作为重要的遗传标记,用于揭示进化历史和正常的遗传多态性。基于此,我们提出了一种基于决策树的条形码 (DTB) 算法,用于从几个进化相关物种的 DNA 条形码序列中生成 SNP 条形码,以准确识别单一物种。为了解决这个问题,我们分析了 64 种按蚊的线粒体 COI 基因序列。经过比对和截断,在按蚊的 COI 基因序列中发现了 32 个 SNP,并计算出建立决策树的决策规则。基于决策树的条形码算法生成了 126 个节点和 32 个基因座,用于区分 64 种按蚊物种。最后,我们得出结论,DTB 方法对于生成按蚊物种鉴定的序列标签是有用和有效的。