用于节点分类的主动学习：一项评估

Active Learning for Node Classification: An Evaluation.

作者信息

Madhawa Kaushalya, Murata Tsuyoshi

机构信息

Department of Computer Science, Tokyo Institute of Technology, Tokyo 152-8552, Japan.

出版信息

Entropy (Basel). 2020 Oct 16;22(10):1164. doi: 10.3390/e22101164.

DOI:10.3390/e22101164

PMID:33286933

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7597335/

Abstract

Current breakthroughs in the field of machine learning are fueled by the deployment of deep neural network models. Deep neural networks models are notorious for their dependence on large amounts of labeled data for training them. Active learning is being used as a solution to train classification models with less labeled instances by selecting only the most informative instances for labeling. This is especially important when the labeled data are scarce or the labeling process is expensive. In this paper, we study the application of active learning on attributed graphs. In this setting, the data instances are represented as nodes of an attributed graph. Graph neural networks achieve the current state-of-the-art classification performance on attributed graphs. The performance of graph neural networks relies on the careful tuning of their hyperparameters, usually performed using a validation set, an additional set of labeled instances. In label scarce problems, it is realistic to use all labeled instances for training the model. In this setting, we perform a fair comparison of the existing active learning algorithms proposed for graph neural networks as well as other data types such as images and text. With empirical results, we demonstrate that state-of-the-art active learning algorithms designed for other data types do not perform well on graph-structured data. We study the problem within the framework of the exploration-vs.-exploitation trade-off and propose a new count-based exploration term. With empirical evidence on multiple benchmark graphs, we highlight the importance of complementing uncertainty-based active learning models with an exploration term.

摘要

机器学习领域当前的突破得益于深度神经网络模型的部署。深度神经网络模型因依赖大量带标签数据进行训练而声名狼藉。主动学习正被用作一种解决方案，通过仅选择最具信息性的实例进行标注，以用较少的带标签实例训练分类模型。当带标签数据稀缺或标注过程成本高昂时，这一点尤为重要。在本文中，我们研究主动学习在属性图上的应用。在这种情况下，数据实例被表示为属性图的节点。图神经网络在属性图上实现了当前最先进的分类性能。图神经网络的性能依赖于对其超参数的精心调整，通常使用验证集（另一组带标签实例）来进行。在标签稀缺的问题中，将所有带标签实例用于训练模型是现实可行的。在这种情况下，我们对针对图神经网络以及其他数据类型（如图像和文本）提出的现有主动学习算法进行了公平比较。通过实证结果，我们证明为其他数据类型设计的最先进主动学习算法在图结构数据上表现不佳。我们在探索与利用权衡的框架内研究该问题，并提出了一个基于计数的新探索项。通过在多个基准图上的实证证据，我们强调了用一个探索项补充基于不确定性的主动学习模型的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af9e/7597335/2f66acb15229/entropy-22-01164-g001.jpg

相似文献

Active Learning for Node Classification: An Evaluation.用于节点分类的主动学习：一项评估

Entropy (Basel). 2020 Oct 16;22(10):1164. doi: 10.3390/e22101164.

Semisupervised Graph Neural Networks for Graph Classification.用于图分类的半监督图神经网络

IEEE Trans Cybern. 2023 Oct;53(10):6222-6235. doi: 10.1109/TCYB.2022.3164696. Epub 2023 Sep 15.

Survey on graph embeddings and their applications to machine learning problems on graphs.关于图嵌入及其在图上机器学习问题中的应用的综述。

PeerJ Comput Sci. 2021 Feb 4;7:e357. doi: 10.7717/peerj-cs.357. eCollection 2021.

Deep semi-supervised learning via dynamic anchor graph embedding in latent space.基于潜在空间动态锚图嵌入的深度半监督学习。

Neural Netw. 2022 Feb;146:350-360. doi: 10.1016/j.neunet.2021.11.026. Epub 2021 Dec 1.

A unified deep semi-supervised graph learning scheme based on nodes re-weighting and manifold regularization.一种基于节点重新加权和流形正则化的统一深度半监督图学习方案。

Neural Netw. 2023 Jan;158:188-196. doi: 10.1016/j.neunet.2022.11.017. Epub 2022 Nov 19.

Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。

Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.

SEAL: Semisupervised Adversarial Active Learning on Attributed Graphs.SEAL：属性图上的半监督对抗主动学习

IEEE Trans Neural Netw Learn Syst. 2021 Jul;32(7):3136-3147. doi: 10.1109/TNNLS.2020.3009682. Epub 2021 Jul 6.

MGLNN: Semi-supervised learning via Multiple Graph Cooperative Learning Neural Networks.MGLNN：基于多图协同学习神经网络的半监督学习。

Neural Netw. 2022 Sep;153:204-214. doi: 10.1016/j.neunet.2022.05.024. Epub 2022 Jun 3.

Co-Embedding of Nodes and Edges With Graph Neural Networks.节点和边的图神经网络联合嵌入。

IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):7075-7086. doi: 10.1109/TPAMI.2020.3029762. Epub 2023 May 5.

Graph Transformer Networks: Learning meta-path graphs to improve GNNs.图 Transformer 网络：学习元路径图以改进 GNNs。

Neural Netw. 2022 Sep;153:104-119. doi: 10.1016/j.neunet.2022.05.026. Epub 2022 Jun 4.

引用本文的文献

Toward optimal disease surveillance with graph-based active learning.迈向基于图的主动学习的最优疾病监测。

Proc Natl Acad Sci U S A. 2024 Dec 24;121(52):e2412424121. doi: 10.1073/pnas.2412424121. Epub 2024 Dec 19.

An active learning approach to train a deep learning algorithm for tumor segmentation from brain MR images.一种用于从脑部磁共振图像中训练深度学习算法进行肿瘤分割的主动学习方法。

Insights Imaging. 2023 Aug 25;14(1):141. doi: 10.1186/s13244-023-01487-6.

Self-Supervised Node Classification with Strategy and Actively Selected Labeled Set.基于策略和主动选择标记集的自监督节点分类

Entropy (Basel). 2022 Dec 23;25(1):30. doi: 10.3390/e25010030.

CoarSAS2hvec: Heterogeneous Information Network Embedding with Balanced Network Sampling.CoarSAS2hvec：基于平衡网络采样的异构信息网络嵌入

Entropy (Basel). 2022 Feb 14;24(2):276. doi: 10.3390/e24020276.

Multi-Scale Aggregation Graph Neural Networks Based on Feature Similarity for Semi-Supervised Learning.基于特征相似性的多尺度聚合图神经网络用于半监督学习

Entropy (Basel). 2021 Mar 28;23(4):403. doi: 10.3390/e23040403.

An Improvised Machine Learning Model Based on Mutual Information Feature Selection Approach for Microbes Classification.一种基于互信息特征选择方法的用于微生物分类的简易机器学习模型。

Entropy (Basel). 2021 Feb 23;23(2):257. doi: 10.3390/e23020257.

Computation in Complex Networks.复杂网络中的计算

Entropy (Basel). 2021 Feb 5;23(2):192. doi: 10.3390/e23020192.

本文引用的文献

Hyperbolic Graph Convolutional Neural Networks.双曲图卷积神经网络

Adv Neural Inf Process Syst. 2019 Dec;32:4869-4880.

A Comprehensive Survey on Graph Neural Networks.图神经网络综述。

IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):4-24. doi: 10.1109/TNNLS.2020.2978386. Epub 2021 Jan 4.

The BioGRID interaction database: 2019 update.生物相互作用数据库（BioGRID）：2019 年更新版。

Nucleic Acids Res. 2019 Jan 8;47(D1):D529-D541. doi: 10.1093/nar/gky1079.

Disease prediction using graph convolutional networks: Application to Autism Spectrum Disorder and Alzheimer's disease.基于图卷积网络的疾病预测：在自闭症谱系障碍和阿尔茨海默病中的应用。

Med Image Anal. 2018 Aug;48:117-130. doi: 10.1016/j.media.2018.06.001. Epub 2018 Jun 2.

Predicting multicellular function through multi-layer tissue networks.通过多层组织网络预测多细胞功能。

Bioinformatics. 2017 Jul 15;33(14):i190-i198. doi: 10.1093/bioinformatics/btx252.

Mixing patterns in networks.网络中的混合模式。

Phys Rev E Stat Nonlin Soft Matter Phys. 2003 Feb;67(2 Pt 2):026126. doi: 10.1103/PhysRevE.67.026126. Epub 2003 Feb 27.

Collective dynamics of 'small-world' networks.“小世界”网络的集体动力学

Nature. 1998 Jun 4;393(6684):440-2. doi: 10.1038/30918.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于节点分类的主动学习：一项评估

Active Learning for Node Classification: An Evaluation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献