• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

电影:利用运动增强生态学中的人工智能目标检测

The Motion Picture: Leveraging Movement to Enhance AI Object Detection in Ecology.

作者信息

Maslen Ben, Popovic Gordana, Wang Dadong, Jansen Andrew, Warton David

机构信息

School of Mathematics and Statistics University of New South Wales Sydney New South Wales Australia.

Evolution and Ecology Research Centre University of New South Wales Sydney New South Wales Australia.

出版信息

Ecol Evol. 2025 Aug 19;15(8):e71996. doi: 10.1002/ece3.71996. eCollection 2025 Aug.

DOI:10.1002/ece3.71996
PMID:40837535
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12364559/
Abstract

The rise of AI has seen an explosion in the use of deep learning methods that automate the analysis of image and video data, saving ecologists vast amounts of time and resources. Ecological imagery poses unique challenges; however, with cryptic species struggling to be detected among poor visibility and diverse environments. We propose leveraging movement information to attempt to improve the predictions produced by a high-performing object detection algorithm. Frame differencing, background subtraction, optical flow and multi-object tracking are trialed on four diverse datasets containing over 35,000 annotated images sourced from terrestrial, marine and freshwater habitats. We find that leveraging movement information is useful for smaller sized studies and rarer species, however is not needed for well annotated studies (> 400 annotations per class). Out of the methods that utilise movement, we find that a simple 'differencing' of neighbouring frames generally performed the best, whilst attempting to track taxa to boost prediction scores performed poorly. Other studies in this area tend to focus only on 1-2 datasets and a single method that utilises movement information, making it difficult for ecologists to generalise results. Our study provides key lessons for ecologists to determine whether it is useful to incorporate methods that leverage movement information when attempting to automatically predict taxa. We offer straightforward code for practical implementation via our GitHub repository, BenMaslen/MCD, along with an evaluation benchmark dataset called 'Tassie BRUV' that can be accessed from the Dryad public repository https://doi.org/10.5061/dryad.sbcc2frf7.

摘要

随着人工智能的兴起,深度学习方法在图像和视频数据分析自动化中的应用呈爆炸式增长,为生态学家节省了大量时间和资源。然而,生态图像带来了独特的挑战,在能见度差和环境多样的情况下,隐秘物种难以被发现。我们建议利用运动信息来尝试改进高性能目标检测算法产生的预测。我们在四个不同的数据集上对帧差法、背景减法、光流法和多目标跟踪法进行了试验,这些数据集包含来自陆地、海洋和淡水栖息地的35000多张带注释的图像。我们发现,利用运动信息对规模较小的研究和较罕见的物种有用,但对于注释完善的研究(每类>400个注释)则不需要。在利用运动的方法中,我们发现相邻帧的简单“差分”通常表现最佳,而试图跟踪分类单元以提高预测分数的效果不佳。该领域的其他研究往往只关注1-2个数据集和一种利用运动信息的单一方法,这使得生态学家难以将结果推广。我们的研究为生态学家提供了关键经验,以确定在尝试自动预测分类单元时纳入利用运动信息的方法是否有用。我们通过GitHub仓库BenMaslen/MCD提供了用于实际实现的简单代码,以及一个名为“塔斯马尼亚BRUV”的评估基准数据集,该数据集可从Dryad公共仓库https://doi.org/10.5061/dryad.sbcc2frf7访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a16/12364559/4911e906423c/ECE3-15-e71996-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a16/12364559/b15e1e1d15f3/ECE3-15-e71996-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a16/12364559/b2fcaf0c20cb/ECE3-15-e71996-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a16/12364559/419b8aff1e78/ECE3-15-e71996-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a16/12364559/c630d11b5ddb/ECE3-15-e71996-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a16/12364559/4911e906423c/ECE3-15-e71996-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a16/12364559/b15e1e1d15f3/ECE3-15-e71996-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a16/12364559/b2fcaf0c20cb/ECE3-15-e71996-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a16/12364559/419b8aff1e78/ECE3-15-e71996-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a16/12364559/c630d11b5ddb/ECE3-15-e71996-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a16/12364559/4911e906423c/ECE3-15-e71996-g002.jpg

相似文献

1
The Motion Picture: Leveraging Movement to Enhance AI Object Detection in Ecology.电影:利用运动增强生态学中的人工智能目标检测
Ecol Evol. 2025 Aug 19;15(8):e71996. doi: 10.1002/ece3.71996. eCollection 2025 Aug.
2
Aspects of Genetic Diversity, Host Specificity and Public Health Significance of Single-Celled Intestinal Parasites Commonly Observed in Humans and Mostly Referred to as 'Non-Pathogenic'.人类常见且大多被称为“非致病性”的单细胞肠道寄生虫的遗传多样性、宿主特异性及公共卫生意义
APMIS. 2025 Sep;133(9):e70036. doi: 10.1111/apm.70036.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
Kinematic Adaptive Frame Recognition (KAFR): A Novel Framework for Video Segmentation via Frame Similarity and Surgical Tool Tracking.运动自适应帧识别(KAFR):一种通过帧相似度和手术工具跟踪进行视频分割的新型框架。
IEEE Access. 2025;13:101681-101697. doi: 10.1109/access.2025.3573264. Epub 2025 May 23.
5
Integrated neural network framework for multi-object detection and recognition using UAV imagery.用于使用无人机图像进行多目标检测与识别的集成神经网络框架。
Front Neurorobot. 2025 Jul 30;19:1643011. doi: 10.3389/fnbot.2025.1643011. eCollection 2025.
6
Improving reliability of movement assessment in Parkinson's disease using computer vision-based automated severity estimation.利用基于计算机视觉的自动严重程度估计提高帕金森病运动评估的可靠性。
J Parkinsons Dis. 2025 Mar;15(2):349-360. doi: 10.1177/1877718X241312605. Epub 2025 Feb 13.
7
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
8
Evaluating Place Cell Detection Methods in Rats and Humans - Implications for Cross-Species Spatial Coding.评估大鼠和人类中的位置细胞检测方法——对跨物种空间编码的启示
bioRxiv. 2025 Sep 3:2025.08.29.672705. doi: 10.1101/2025.08.29.672705.
9
DeePosit, an AI-based tool for detecting mouse urine and fecal depositions from thermal video clips of behavioral experiments.DeePosit是一种基于人工智能的工具,用于从行为实验的热视频片段中检测小鼠尿液和粪便沉积。
Elife. 2025 Aug 28;13:RP100739. doi: 10.7554/eLife.100739.
10
Post-pandemic planning for maternity care for local, regional, and national maternity systems across the four nations: a mixed-methods study.针对四个地区的地方、区域和国家孕产妇保健系统的疫情后规划:一项混合方法研究。
Health Soc Care Deliv Res. 2025 Sep;13(35):1-25. doi: 10.3310/HHTE6611.

本文引用的文献

1
A review of 28 free animal-tracking software applications: current features and limitations.对 28 种免费动物追踪软件应用程序的回顾:当前的特点和局限性。
Lab Anim (NY). 2021 Sep;50(9):246-254. doi: 10.1038/s41684-021-00811-1. Epub 2021 Jul 29.
2
Automatic detection of fish and tracking of movement for ecology.用于生态学的鱼类自动检测与运动跟踪。
Ecol Evol. 2021 May 18;11(12):8254-8263. doi: 10.1002/ece3.7656. eCollection 2021 Jun.
3
A realistic fish-habitat dataset to evaluate algorithms for underwater visual analysis.用于评估水下视觉分析算法的真实鱼类栖息地数据集。
Sci Rep. 2020 Sep 4;10(1):14671. doi: 10.1038/s41598-020-71639-x.
4
Biomedical image augmentation using Augmentor.使用 Augmentor 进行生物医学图像增强。
Bioinformatics. 2019 Nov 1;35(21):4522-4524. doi: 10.1093/bioinformatics/btz259.
5
Two-Dimensional Whitening Reconstruction for Enhancing Robustness of Principal Component Analysis.用于增强主成分分析鲁棒性的二维白化重建
IEEE Trans Pattern Anal Mach Intell. 2016 Oct;38(10):2130-6. doi: 10.1109/TPAMI.2015.2501810. Epub 2015 Nov 18.
6
ViBe: a universal background subtraction algorithm for video sequences.ViBe:一种适用于视频序列的通用背景减除算法。
IEEE Trans Image Process. 2011 Jun;20(6):1709-24. doi: 10.1109/TIP.2010.2101613. Epub 2010 Dec 23.