• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于众包的可扩展变分高斯过程:激光干涉引力波天文台中的故障检测

Scalable Variational Gaussian Processes for Crowdsourcing: Glitch Detection in LIGO.

作者信息

Morales-Alvarez Pablo, Ruiz Pablo, Coughlin Scott, Molina Rafael, Katsaggelos Aggelos K

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1534-1551. doi: 10.1109/TPAMI.2020.3025390. Epub 2022 Feb 3.

DOI:10.1109/TPAMI.2020.3025390
PMID:32956038
Abstract

In the last years, crowdsourcing is transforming the way classification training sets are obtained. Instead of relying on a single expert annotator, crowdsourcing shares the labelling effort among a large number of collaborators. For instance, this is being applied in the laureate laser interferometer gravitational waves observatory (LIGO), in order to detect glitches which might hinder the identification of true gravitational-waves. The crowdsourcing scenario poses new challenging difficulties, as it has to deal with different opinions from a heterogeneous group of annotators with unknown degrees of expertise. Probabilistic methods, such as Gaussian processes (GP), have proven successful in modeling this setting. However, GPs do not scale up well to large data sets, which hampers their broad adoption in real-world problems (in particular LIGO). This has led to the very recent introduction of deep learning based crowdsourcing methods, which have become the state-of-the-art for this type of problems. However, the accurate uncertainty quantification provided by GPs has been partially sacrificed. This is an important aspect for astrophysicists in LIGO, since a glitch detection system should provide very accurate probability distributions of its predictions. In this work, we first leverage a standard sparse GP approximation (SVGP) to develop a GP-based crowdsourcing method that factorizes into mini-batches. This makes it able to cope with previously-prohibitive data sets. This first approach, which we refer to as scalable variational Gaussian processes for crowdsourcing (SVGPCR), brings back GP-based methods to a state-of-the-art level, and excels at uncertainty quantification. SVGPCR is shown to outperform deep learning based methods and previous probabilistic ones when applied to the LIGO data. Its behavior and main properties are carefully analyzed in a controlled experiment based on the MNIST data set. Moreover, recent GP inference techniques are also adapted to crowdsourcing and evaluated experimentally.

摘要

在过去几年中,众包正在改变获取分类训练集的方式。众包不是依赖单个专家注释者,而是将标注工作分摊给大量协作者。例如,这一方式正在被应用于诺贝尔奖得主激光干涉引力波天文台(LIGO),以检测可能妨碍识别真正引力波的干扰信号。众包场景带来了新的挑战性难题,因为它必须处理来自专业程度未知的异质注释者群体的不同意见。概率方法,如高斯过程(GP),已被证明在对这种情况进行建模时是成功的。然而,高斯过程在处理大数据集时扩展性不佳,这阻碍了它们在实际问题(特别是LIGO)中的广泛应用。这导致最近引入了基于深度学习的众包方法,这些方法已成为这类问题的最新技术水平。然而,高斯过程所提供的准确不确定性量化在一定程度上被牺牲了。对于LIGO的天体物理学家来说,这是一个重要方面,因为故障检测系统应该提供其预测的非常准确的概率分布。在这项工作中,我们首先利用标准稀疏高斯过程近似(SVGP)来开发一种基于高斯过程的众包方法,该方法可分解为小批次。这使得它能够处理以前难以处理的数据集。我们将这种第一种方法称为用于众包的可扩展变分高斯过程(SVGPCR),它将基于高斯过程的方法带回了最新技术水平,并且在不确定性量化方面表现出色。当应用于LIGO数据时,SVGPCR被证明优于基于深度学习的方法和以前的概率方法。在基于MNIST数据集的控制实验中,对其行为和主要特性进行了仔细分析。此外,最近的高斯过程推理技术也被应用于众包并进行了实验评估。

相似文献

1
Scalable Variational Gaussian Processes for Crowdsourcing: Glitch Detection in LIGO.用于众包的可扩展变分高斯过程:激光干涉引力波天文台中的故障检测
IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1534-1551. doi: 10.1109/TPAMI.2020.3025390. Epub 2022 Feb 3.
2
Gravity Spy: integrating advanced LIGO detector characterization, machine learning, and citizen science.引力波监测计划:整合先进的激光干涉引力波天文台(LIGO)探测器特性、机器学习与公民科学。
Class Quantum Gravity. 2017;34(No 6). doi: 10.1088/1361-6382/aa5cea. Epub 2017 Feb 28.
3
Learning from crowds in digital pathology using scalable variational Gaussian processes.基于可扩展变分高斯过程的数字病理学众包学习。
Sci Rep. 2021 Jun 2;11(1):11612. doi: 10.1038/s41598-021-90821-3.
4
Crowdsourcing with the drift diffusion model of decision making.利用决策的漂移扩散模型进行众包。
Sci Rep. 2024 May 17;14(1):11311. doi: 10.1038/s41598-024-61687-y.
5
Probabilistic Attention Based on Gaussian Processes for Deep Multiple Instance Learning.
IEEE Trans Neural Netw Learn Syst. 2024 Aug;35(8):10909-10922. doi: 10.1109/TNNLS.2023.3245329. Epub 2024 Aug 5.
6
Correlated product of experts for sparse Gaussian process regression.用于稀疏高斯过程回归的专家相关乘积
Mach Learn. 2023;112(5):1411-1432. doi: 10.1007/s10994-022-06297-3. Epub 2023 Jan 25.
7
Learning by aggregating experts and filtering novices: a solution to crowdsourcing problems in bioinformatics.通过聚合专家和过滤新手来学习:解决生物信息学众包问题的一种方法。
BMC Bioinformatics. 2013;14 Suppl 12(Suppl 12):S5. doi: 10.1186/1471-2105-14-S12-S5. Epub 2013 Sep 24.
8
LIGO: The Laser Interferometer Gravitational-Wave Observatory.激光干涉引力波天文台(LIGO)
Science. 1992 Apr 17;256(5055):325-33. doi: 10.1126/science.256.5055.325.
9
A stochastic variational framework for Recurrent Gaussian Processes models.循环高斯过程模型的随机变分框架。
Neural Netw. 2019 Apr;112:54-72. doi: 10.1016/j.neunet.2019.01.005. Epub 2019 Feb 1.
10
Deep Learning for Gravitational-Wave Data Analysis: A Resampling White-Box Approach.用于引力波数据分析的深度学习:一种重采样白盒方法。
Sensors (Basel). 2021 May 3;21(9):3174. doi: 10.3390/s21093174.

引用本文的文献

1
A fusocelular skin dataset with whole slide images for deep learning models.一个用于深度学习模型的带有全切片图像的梭形细胞皮肤数据集。
Sci Data. 2025 May 14;12(1):788. doi: 10.1038/s41597-025-05108-3.
2
Chained Deep Learning Using Generalized Cross-Entropy for Multiple Annotators Classification.链式深度学习使用广义交叉熵进行多标注分类。
Sensors (Basel). 2023 Mar 28;23(7):3518. doi: 10.3390/s23073518.
3
Learning from crowds in digital pathology using scalable variational Gaussian processes.基于可扩展变分高斯过程的数字病理学众包学习。
Sci Rep. 2021 Jun 2;11(1):11612. doi: 10.1038/s41598-021-90821-3.