• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分析传感器数据集的特点,以支持数据理解和准备。

Analyzing Particularities of Sensor Datasets for Supporting Data Understanding and Preparation.

机构信息

Research and Innovation, ATOS Spain SA, 48013 Bilbao, Spain.

DeustoTech, University of Deusto, 48007 Bilbao, Spain.

出版信息

Sensors (Basel). 2021 Sep 10;21(18):6063. doi: 10.3390/s21186063.

DOI:10.3390/s21186063
PMID:34577271
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8472945/
Abstract

Data scientists spend much time with data cleaning tasks, and this is especially important when dealing with data gathered from sensors, as finding failures is not unusual (there is an abundance of research on anomaly detection in sensor data). This work analyzes several aspects of the data generated by different sensor types to understand particularities in the data, linking them with existing data mining methodologies. Using data from different sources, this work analyzes how the type of sensor used and its measurement units have an important impact in basic statistics such as variance and mean, because of the statistical distributions of the datasets. The work also analyzes the behavior of outliers, how to detect them, and how they affect the equivalence of sensors, as equivalence is used in many solutions for identifying anomalies. Based on the previous results, the article presents guidance on how to deal with data coming from sensors, in order to understand the characteristics of sensor datasets, and proposes a parallelized implementation. Finally, the article shows that the proposed decision-making processes work well with a new type of sensor and that parallelizing with several cores enables calculations to be executed up to four times faster.

摘要

数据科学家花费大量时间进行数据清理任务,当处理从传感器收集的数据时,这一点尤其重要,因为发现故障并不罕见(关于传感器数据中的异常检测有大量研究)。这项工作分析了不同传感器类型生成的数据的几个方面,以了解数据的特殊性,并将其与现有的数据挖掘方法联系起来。使用来自不同来源的数据,这项工作分析了所使用的传感器类型及其测量单位如何对基本统计数据(如方差和平均值)产生重要影响,这是由于数据集的统计分布造成的。该工作还分析了异常值的行为、如何检测它们以及它们如何影响传感器的等效性,因为在许多用于识别异常的解决方案中都使用了等效性。基于之前的结果,本文提出了如何处理来自传感器的数据的指导意见,以便了解传感器数据集的特点,并提出了一种并行实现。最后,本文表明,所提出的决策过程在新型传感器上效果良好,并且使用多个核心并行化可以使计算速度提高至四倍。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/84f599db28c1/sensors-21-06063-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/eb3720a64754/sensors-21-06063-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/6f60a1d1a2ec/sensors-21-06063-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/f16be8ebfa17/sensors-21-06063-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/7b34e5af8b92/sensors-21-06063-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/12eecbe26102/sensors-21-06063-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/9bf9785ed026/sensors-21-06063-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/076d6bfcdba3/sensors-21-06063-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/5c6eb9548a22/sensors-21-06063-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/7f2bee9b0edf/sensors-21-06063-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/84f599db28c1/sensors-21-06063-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/eb3720a64754/sensors-21-06063-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/6f60a1d1a2ec/sensors-21-06063-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/f16be8ebfa17/sensors-21-06063-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/7b34e5af8b92/sensors-21-06063-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/12eecbe26102/sensors-21-06063-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/9bf9785ed026/sensors-21-06063-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/076d6bfcdba3/sensors-21-06063-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/5c6eb9548a22/sensors-21-06063-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/7f2bee9b0edf/sensors-21-06063-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0627/8472945/84f599db28c1/sensors-21-06063-g010.jpg

相似文献

1
Analyzing Particularities of Sensor Datasets for Supporting Data Understanding and Preparation.分析传感器数据集的特点,以支持数据理解和准备。
Sensors (Basel). 2021 Sep 10;21(18):6063. doi: 10.3390/s21186063.
2
An Interoperable Architecture for the Internet of COVID-19 Things (IoCT) Using Open Geospatial Standards-Case Study: Workplace Reopening.使用开放地理空间标准的新冠疫情物联网 (IoCT) 的可互操作架构-案例研究:工作场所重新开放。
Sensors (Basel). 2020 Dec 24;21(1):50. doi: 10.3390/s21010050.
3
A Novel Sensor Data Pre-Processing Methodology for the Internet of Things Using Anomaly Detection and Transfer-By-Subspace-Similarity Transformation.一种基于异常检测和子空间相似变换的物联网传感器数据预处理新方法。
Sensors (Basel). 2019 Oct 18;19(20):4536. doi: 10.3390/s19204536.
4
Sensor Failure Detection in Ambient Assisted Living Using Association Rule Mining.使用关联规则挖掘技术进行环境辅助生活中的传感器故障检测。
Sensors (Basel). 2020 Nov 26;20(23):6760. doi: 10.3390/s20236760.
5
Qualitative Data Clustering to Detect Outliers.用于检测异常值的定性数据聚类
Entropy (Basel). 2021 Jul 7;23(7):869. doi: 10.3390/e23070869.
6
Effective Sensor Selection and Data Anomaly Detection for Condition Monitoring of Aircraft Engines.用于飞机发动机状态监测的有效传感器选择与数据异常检测
Sensors (Basel). 2016 Apr 29;16(5):623. doi: 10.3390/s16050623.
7
Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors.用于具有物联网传感器的数据中心异常检测的带注释数据集。
Sensors (Basel). 2020 Jul 4;20(13):3745. doi: 10.3390/s20133745.
8
Anomaly detection based on sensor data in petroleum industry applications.石油工业应用中基于传感器数据的异常检测。
Sensors (Basel). 2015 Jan 27;15(2):2774-97. doi: 10.3390/s150202774.
9
AI-Based Sensor Information Fusion for Supporting Deep Supervised Learning.基于人工智能的传感器信息融合,用于支持深度监督学习。
Sensors (Basel). 2019 Mar 18;19(6):1345. doi: 10.3390/s19061345.
10
Smart Multi-Sensor Platform for Analytics and Social Decision Support in Agriculture.智能多传感器平台,用于农业分析和社会决策支持。
Sensors (Basel). 2020 Jul 24;20(15):4127. doi: 10.3390/s20154127.

本文引用的文献

1
Sensors Data Analysis in Supervisory Control and Data Acquisition (SCADA) Systems to Foresee Failures with an Undetermined Origin.用于预见来源不明故障的监控与数据采集(SCADA)系统中的传感器数据分析
Sensors (Basel). 2021 Apr 14;21(8):2762. doi: 10.3390/s21082762.
2
Smart Anomaly Detection and Prediction for Assembly Process Maintenance in Compliance with Industry 4.0.智能异常检测和预测,符合工业 4.0 标准的装配过程维护。
Sensors (Basel). 2021 Mar 29;21(7):2376. doi: 10.3390/s21072376.
3
A Survey of Anomaly Detection in Industrial Wireless Sensor Networks with Critical Water System Infrastructure as a Case Study.
以关键水系统基础设施为案例研究的工业无线传感器网络异常检测综述。
Sensors (Basel). 2018 Aug 1;18(8):2491. doi: 10.3390/s18082491.
4
Common pitfalls in statistical analysis: The use of correlation techniques.统计分析中的常见陷阱:相关技术的运用
Perspect Clin Res. 2016 Oct-Dec;7(4):187-190. doi: 10.4103/2229-3485.192046.
5
Anomaly detection based on sensor data in petroleum industry applications.石油工业应用中基于传感器数据的异常检测。
Sensors (Basel). 2015 Jan 27;15(2):2774-97. doi: 10.3390/s150202774.
6
The treatment of ties in ranking problems.排序问题中平局情况的处理。
Biometrika. 1945 Nov;33:239-51. doi: 10.1093/biomet/33.3.239.
7
Use of coefficient of variation in assessing variability of quantitative assays.变异系数在评估定量测定变异性中的应用。
Clin Diagn Lab Immunol. 2002 Nov;9(6):1235-9. doi: 10.1128/cdli.9.6.1235-1239.2002.