Suppr超能文献

利用翅膀拍动声学特性提取特征的候选消除和最近质心进行准确高效的蚊子属分类算法。

Accurate and efficient mosquito genus classification algorithm using candidate-elimination and nearest centroid on extracted features of wingbeat acoustic properties.

机构信息

Department of Information Systems and Computer Science, Ateneo de Manila University, Quezon City, Philippines; International School Manila, Taguig City, Philippines.

Department of Information Systems and Computer Science, Ateneo de Manila University, Quezon City, Philippines.

出版信息

Comput Biol Med. 2021 Dec;139:104973. doi: 10.1016/j.compbiomed.2021.104973. Epub 2021 Oct 25.

Abstract

The automatic identification of mosquito genus, if used together with effective strategies of suppression and control may help reduce the spread of mosquito-borne diseases. In this study, we explored and developed a simple and yet very effective algorithm for processing audio files to determine the presence (or absence) of a mosquito and then identify the correct genus for those involving a mosquito. A dataset of sound recordings from the Humbug Project of Zooniverse, collected by researchers from Oxford University, and actual recordings of mosquitoes in the Philippines were used in this study. Our developed technique involves extracting filter bank values from corresponding spectrograms of the audio files, and we built a classification model based only on three simple statistics from said collected values -- maximum, first quartile and third quartile. Specifically, the maximum values were used in defining thresholds for the candidate-elimination phase of the algorithm, and then the first and third quartile values were used in the succeeding nearest centroid computation phase. The proposed algorithm yielded an impressive 97.2% average classification accuracy from a 5-fold stratified cross validation. This is competitive with the 75.55-97.65% accuracy results reported in literature for different mosquito classification tasks run on different datasets. Moreover, the achieved accuracy is significantly higher than the 86.6% that we gathered from applying a CNN architecture from literature to our same dataset. Aside from being more accurate, the proposed algorithm is also significantly more efficient than the CNN model, requiring much less time (in both training and predicting phases) and memory space. The results offer a promising technique that may also simplify the process of solving other sound-based classification problems.

摘要

如果能将自动识别蚊子属的功能与有效的抑制和控制策略结合使用,可能有助于减少蚊媒疾病的传播。在本研究中,我们探索并开发了一种简单而有效的算法,用于处理音频文件,以确定是否存在蚊子,并识别涉及蚊子的正确属。本研究使用了来自 Zooniverse 的 Humbug 项目的录音数据集和牛津大学研究人员收集的实际菲律宾蚊子录音。我们开发的技术涉及从音频文件的相应声谱图中提取滤波器组值,并仅基于从收集的值中提取的三个简单统计信息构建分类模型——最大值、第一四分位数和第三四分位数。具体来说,最大值用于定义算法候选消除阶段的阈值,然后使用第一和第三四分位数值进行随后的最近质心计算阶段。该算法在 5 折分层交叉验证中产生了令人印象深刻的 97.2%的平均分类准确率。这与在不同数据集上运行的不同蚊子分类任务的文献报道的 75.55-97.65%的准确率结果相当。此外,所达到的准确率明显高于我们从文献中应用的 CNN 架构应用于我们相同数据集时收集到的 86.6%的准确率。除了更准确之外,该算法的效率也明显高于 CNN 模型,所需的时间(训练和预测阶段)和内存空间都要少得多。该结果提供了一种有前途的技术,也可能简化其他基于声音的分类问题的解决过程。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验