Kalafi E Y, Anuar M K, Sakharkar M K, Dhillon S K
Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia.
Drug Discovery and Development Research Group, College of Pharmacy and Nutrition, University of Saskatchewan, Saskatoon, Canada.
Folia Biol (Praha). 2018;64(4):137-143. doi: 10.14712/fb2018064040137.
The process of manual species identification is a daunting task, so much so that the number of taxonomists is seen to be declining. In order to assist taxonomists, many methods and algorithms have been proposed to develop semi-automated and fully automated systems for species identification. While semi-automated tools would require manual intervention by a domain expert, fully automated tools are assumed to be not as reliable as manual or semiautomated identification tools. Hence, in this study we investigate the accuracy of fully automated and semi-automated models for species identification. We have built fully automated and semi-automated species classification models using the monogenean species image dataset. With respect to monogeneans' morphology, they are differentiated based on the morphological characteristics of haptoral bars, anchors, marginal hooks and reproductive organs (male and female copulatory organs). Landmarks (in the semi-automated model) and shape morphometric features (in the fully automated model) were extracted from four monogenean species images, which were then classified using k-nearest neighbour and artificial neural network. In semi-automated models, a classification accuracy of 96.67 % was obtained using the k-nearest neighbour and 97.5 % using the artificial neural network, whereas in fully automated models, a classification accuracy of 90 % was obtained using the k-nearest neighbour and 98.8 % using the artificial neural network. As for the crossvalidation, semi-automated models performed at 91.2 %, whereas fully automated models performed slightly higher at 93.75 %.
人工进行物种鉴定的过程是一项艰巨的任务,以至于分类学家的数量正在减少。为了帮助分类学家,人们提出了许多方法和算法来开发用于物种鉴定的半自动和全自动系统。虽然半自动工具需要领域专家进行人工干预,但全自动工具被认为不如人工或半自动鉴定工具可靠。因此,在本研究中,我们调查了用于物种鉴定的全自动和半自动模型的准确性。我们使用单殖吸虫物种图像数据集构建了全自动和半自动物种分类模型。关于单殖吸虫的形态,它们是根据吸器杆、锚、边缘小钩和生殖器官(雄性和雌性交配器官)的形态特征来区分的。从四种单殖吸虫物种图像中提取地标(在半自动模型中)和形状形态计量特征(在全自动模型中),然后使用k近邻和人工神经网络进行分类。在半自动模型中,使用k近邻获得的分类准确率为96.67%,使用人工神经网络为97.5%,而在全自动模型中,使用k近邻获得的分类准确率为90%,使用人工神经网络为98.8%。至于交叉验证,半自动模型的表现为91.2%,而全自动模型的表现略高,为93.75%。