Mimura Kazuhide, Nakamura Kentaro
Ocean Resources Research Center for Next Generation, Chiba Institute of Technology, 2-17-1 Tsudanuma, Narashino, Chiba 275-0016, Japan.
Frontier Research Center for Energy and Resources, School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan.
Data Brief. 2023 Jan 31;47:108940. doi: 10.1016/j.dib.2023.108940. eCollection 2023 Apr.
In this paper, we describe the three datasets that were used to train, validate, and test deep learning models to detect microfossil fish teeth. The first dataset was created for training and validating a Mask R-CNN model to detect fish teeth in the images taken using the microscope. The training set contained 866 images and one annotation file; the validation set contained 92 images and one annotation file. The second dataset was created for training and validating EfficientNet-V2 models; it included 17,400 images of teeth and 15,036 images that contained only noise (particles other than teeth). The third dataset was created to evaluate the performance of a system that combines a Mask R-CNN model and an EfficientNet-V2 model; it contained 5177 images with annotation files for the locations of 431 teeth within the images.
在本文中,我们描述了用于训练、验证和测试深度学习模型以检测微化石鱼牙的三个数据集。第一个数据集用于训练和验证Mask R-CNN模型,以在使用显微镜拍摄的图像中检测鱼牙。训练集包含866张图像和一个注释文件;验证集包含92张图像和一个注释文件。第二个数据集用于训练和验证EfficientNet-V2模型;它包括17400张牙齿图像和15036张仅包含噪声(非牙齿颗粒)的图像。第三个数据集用于评估结合Mask R-CNN模型和EfficientNet-V2模型的系统的性能;它包含5177张图像以及图像中431颗牙齿位置的注释文件。