如何处理基于惯性测量数据的行为模式监督机器学习中的混合行为片段。

How to treat mixed behavior segments in supervised machine learning of behavioural modes from inertial measurement data.

作者信息

Resheff Yehezkel S, Bensch Hanna M, Zöttl Markus, Harel Roi, Matsumoto-Oda Akiko, Crofoot Margaret C, Gomez Sara, Börger Luca, Rotics Shay

机构信息

Hebrew University Business School, The Hebrew University of Jerusalem, Jerusalem, Israel.

Department of Biology and Environmental Science, Centre for Ecology and Evolution in Microbial Model Systems (EEMIS), Linnaeus University, 391 82, Kalmar, Sweden.

出版信息

Mov Ecol. 2024 Jun 10;12(1):44. doi: 10.1186/s40462-024-00485-7.

DOI:10.1186/s40462-024-00485-7

PMID:38858733

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11165886/

Abstract

The application of supervised machine learning methods to identify behavioural modes from inertial measurements of bio-loggers has become a standard tool in behavioural ecology. Several design choices can affect the accuracy of identifying the behavioural modes. One such choice is the inclusion or exclusion of segments consisting of more than a single behaviour (mixed segments) in the machine learning model training data. Currently, the common practice is to ignore such segments during model training. In this paper we tested the hypothesis that including mixed segments in model training will improve accuracy, as the model would perform better in identifying them in the test data. We test this hypothesis using a series of data simulations on four datasets of accelerometer data coupled with behaviour observations, obtained from four study species (Damaraland mole-rats, meerkats, olive baboons, polar bears). Results show that when a substantial proportion of the test data are mixed behaviour segments (above ~ 10%), including mixed segments in machine learning model training improves the accuracy of classification. These results were consistent across the four study species, and robust to changes in segment length, sample size, and degree of mixture within the mixed segments. However, we also find that in some cases (particularly in baboons) models trained with mixed segments show reduced accuracy in classifying test data containing only single behaviour (pure) segments, compared to models trained without mixed segments. Based on these results, we recommend that when the classification model is expected to deal with a substantial proportion of mixed behaviour segments (> 10%), it is beneficial to include them in model training, otherwise, it is unnecessary but also not harmful. The exception is when there is a basis to assume that the training data contains a higher rate of mixed segments than the actual (unobserved) data to be classified-such a situation may occur particularly when training data are collected in captivity and used to classify data from the wild. In this case, excess inclusion of mixed segments in training data should probably be avoided.

摘要

将监督式机器学习方法应用于从生物记录器的惯性测量中识别行为模式，已成为行为生态学中的一种标准工具。有几个设计选择会影响行为模式识别的准确性。其中一个选择是在机器学习模型训练数据中包含或排除由多种行为组成的片段（混合片段）。目前，常见的做法是在模型训练期间忽略这些片段。在本文中，我们测试了这样一个假设，即在模型训练中包含混合片段会提高准确性，因为模型在测试数据中识别它们时会表现得更好。我们使用一系列数据模拟对四个加速度计数据集进行了测试，这些数据集与行为观察结果相结合，分别来自四个研究物种（达马拉兰鼹鼠、狐獴、东非狒狒、北极熊）。结果表明，当测试数据中有相当比例是混合行为片段（超过约10%）时，在机器学习模型训练中包含混合片段可提高分类准确性。这些结果在四个研究物种中都是一致的，并且对于混合片段的长度、样本大小和混合程度的变化具有稳健性。然而，我们也发现，在某些情况下（特别是在狒狒中），与不包含混合片段训练的模型相比，包含混合片段训练的模型在对仅包含单一行为（纯）片段的测试数据进行分类时准确性会降低。基于这些结果，我们建议，当预期分类模型要处理相当比例的混合行为片段（>10%）时，将它们包含在模型训练中是有益的，否则，这没有必要但也无害。例外情况是，当有理由假设训练数据中混合片段的比例高于实际（未观察到的）待分类数据时——这种情况可能尤其会在圈养环境中收集训练数据并用于对野外数据进行分类时发生。在这种情况下，可能应避免在训练数据中过度包含混合片段。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f52/11165886/ac919012f741/40462_2024_485_Fig1_HTML.jpg

相似文献

How to treat mixed behavior segments in supervised machine learning of behavioural modes from inertial measurement data.如何处理基于惯性测量数据的行为模式监督机器学习中的混合行为片段。

Mov Ecol. 2024 Jun 10;12(1):44. doi: 10.1186/s40462-024-00485-7.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Identification of behaviour in freely moving dogs (Canis familiaris) using inertial sensors.使用惯性传感器识别自由活动的狗（犬）的行为。

PLoS One. 2013 Oct 18;8(10):e77814. doi: 10.1371/journal.pone.0077814. eCollection 2013.

Classification of broiler behaviours using triaxial accelerometer and machine learning.使用三轴加速度计和机器学习对肉鸡行为进行分类。

Animal. 2021 Jul;15(7):100269. doi: 10.1016/j.animal.2021.100269. Epub 2021 Jun 5.

Machine learning goes wild: Using data from captive individuals to infer wildlife behaviours.机器学习失控：利用圈养个体的数据推断野生动物行为。

PLoS One. 2020 May 5;15(5):e0227317. doi: 10.1371/journal.pone.0227317. eCollection 2020.

Seeing It All: Evaluating Supervised Machine Learning Methods for the Classification of Diverse Otariid Behaviours.全面了解：评估用于不同海狗行为分类的监督式机器学习方法

PLoS One. 2016 Dec 21;11(12):e0166898. doi: 10.1371/journal.pone.0166898. eCollection 2016.

Challenges of machine learning model validation using correlated behaviour data: Evaluation of cross-validation strategies and accuracy measures.使用相关行为数据验证机器学习模型的挑战：交叉验证策略和准确性度量的评估。

PLoS One. 2020 Jul 20;15(7):e0236092. doi: 10.1371/journal.pone.0236092. eCollection 2020.

Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型，对于使用可穿戴设备进行压力预测具有良好的泛化能力。

J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.

The role of individual variability on the predictive performance of machine learning applied to large bio-logging datasets.个体变异性对机器学习在大型生物记录数据集预测性能的影响。

Sci Rep. 2022 Nov 17;12(1):19737. doi: 10.1038/s41598-022-22258-1.

Behavioural compass: animal behaviour recognition using magnetometers.行为指南针：利用磁力计进行动物行为识别

Mov Ecol. 2019 Aug 27;7:28. doi: 10.1186/s40462-019-0172-6. eCollection 2019.

引用本文的文献

Practical guidelines for validation of supervised machine learning models in accelerometer-based animal behaviour classification.基于加速度计的动物行为分类中监督式机器学习模型验证的实用指南。

J Anim Ecol. 2025 Jul;94(7):1322-1334. doi: 10.1111/1365-2656.70054. Epub 2025 May 19.

A benchmark for computational analysis of animal behavior, using animal-borne tags.一种使用动物携带标签进行动物行为计算分析的基准。

Mov Ecol. 2024 Dec 18;12(1):78. doi: 10.1186/s40462-024-00511-8.

本文引用的文献

Estimating individual exposure to predation risk in group-living baboons, Papio anubis.估算群体生活的狒狒（Papio anubis）个体所面临的捕食风险。

PLoS One. 2023 Nov 8;18(11):e0287357. doi: 10.1371/journal.pone.0287357. eCollection 2023.

Big-data approaches lead to an increased understanding of the ecology of animal movement.大数据方法提高了对动物运动生态学的理解。

Science. 2022 Feb 18;375(6582):eabg1780. doi: 10.1126/science.abg1780.

R package for animal behavior classification from accelerometer data-rabc.用于基于加速度计数据进行动物行为分类的R包——rabc。

Ecol Evol. 2021 Aug 20;11(18):12364-12377. doi: 10.1002/ece3.7937. eCollection 2021 Sep.

Limitations of using surrogates for behaviour classification of accelerometer data: refining methods using random forest models in Caprids.使用替代指标进行加速度计数据行为分类的局限性：利用Caprids中的随机森林模型改进方法

Mov Ecol. 2021 Jun 7;9(1):28. doi: 10.1186/s40462-021-00265-7.

Using tri-axial accelerometer loggers to identify spawning behaviours of large pelagic fish.使用三轴加速度计记录仪识别大型远洋鱼类的产卵行为。

Mov Ecol. 2021 May 24;9(1):26. doi: 10.1186/s40462-021-00248-8.

Meerkat helpers buffer the detrimental effects of adverse environmental conditions on fecundity, growth and survival.高跷羚助手缓冲了不利环境条件对繁殖力、生长和存活的不利影响。

J Anim Ecol. 2021 Mar;90(3):641-652. doi: 10.1111/1365-2656.13396. Epub 2020 Dec 14.

Divergent field metabolic rates highlight the challenges of increasing temperatures and energy limitation in aquatic ectotherms.变域场代谢率突出了水生变温动物在温度升高和能量限制方面面临的挑战。

Oecologia. 2020 Jun;193(2):311-323. doi: 10.1007/s00442-020-04669-x. Epub 2020 May 20.

The seasonal energetic landscape of an apex marine carnivore, the polar bear.极地熊这种海洋顶级捕食者季节性的能量景观。

Ecology. 2020 Mar;101(3):e02959. doi: 10.1002/ecy.2959.

Using accelerometry to compare costs of extended migration in an arctic herbivore.利用加速度计比较北极食草动物长距离迁徙的成本。

Curr Zool. 2017 Dec;63(6):667-674. doi: 10.1093/cz/zox056. Epub 2017 Oct 3.

Differences in cooperative behavior among Damaraland mole rats are consequences of an age-related polyethism.达马拉兰鼹鼠合作行为的差异是与年龄相关的多态行为的结果。

Proc Natl Acad Sci U S A. 2016 Sep 13;113(37):10382-7. doi: 10.1073/pnas.1607885113. Epub 2016 Sep 1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

如何处理基于惯性测量数据的行为模式监督机器学习中的混合行为片段。

How to treat mixed behavior segments in supervised machine learning of behavioural modes from inertial measurement data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献