从在细胞图上训练的图神经网络中提取知识，用于非神经学生模型。

Acharya Vasundhara, Yener Bülent, Beamer Gillian

Rensselaer Polytechnic Institute, Troy, USA.

Professor, Rensselaer Polytechnic Institute, Troy, USA.

Sci Rep. 2025 Aug 10;15(1):29274. doi: 10.1038/s41598-025-13697-7.

The development and refinement of artificial intelligence (AI) and machine learning algorithms have been an area of intense research in radiology and pathology, particularly for automated or computer-aided diagnosis. Whole Slide Imaging (WSI) has emerged as a promising tool for developing and utilizing such algorithms in diagnostic and experimental pathology. However, patch-wise analysis of WSIs often falls short of capturing the intricate cell-level interactions within local microenvironment. A robust alternative to address this limitation involves leveraging cell graph representations, thereby enabling a more detailed analysis of local cell interactions. These cell graphs encapsulate the local spatial arrangement of cells in histopathology images, a factor proven to have significant prognostic value. Graph Neural Networks (GNNs) can effectively utilize these spatial feature representations and other features, demonstrating promising performance across classification tasks of varying complexities. It is also feasible to distill the knowledge acquired by deep neural networks to smaller student models through knowledge distillation (KD), achieving goals such as model compression and performance enhancement. Traditional approaches for constructing cell graphs generally rely on edge thresholds defined by sparsity/density or the assumption that nearby cells interact. However, such methods may fail to capture biologically meaningful interactions. Additionally, existing works in knowledge distillation primarily focus on distilling knowledge between neural networks. We designed cell graphs with biologically informed edge thresholds or criteria to address these limitations, moving beyond density/sparsity-based definitions. Furthermore, we demonstrated that student models do not need to be neural networks. Even non-neural models can learn from a neural network teacher. We evaluated our approach across varying dataset complexities, including the presence or absence of distribution shifts, varying degrees of imbalance, and different levels of graph complexity for training GNNs. We also investigated whether softened probabilities obtained from calibrated logits offered better guidance than raw logits. Our experiments revealed that the teacher's guidance was effective when distribution shifts existed in the data. The teacher model demonstrated decent performance due to its higher complexity and ability to use cell graph structures and features. Its logits provided rich information and regularization to students, mitigating the risk of overfitting the training distribution. We also examined the differences in feature importance between student models trained with the teacher's logits and their counterparts trained on hard labels. In particular, the student model demonstrated a stronger emphasis on morphological features in the Tuberculosis (TB) dataset than the models trained with hard labels. This emphasis aligns closely with the features that pathologists typically prioritize for diagnostic purposes. Future work could explore designing alternative teacher models, evaluating the proposed approach on larger datasets, and investigating causal knowledge distillation as a potential extension.

人工智能（AI）和机器学习算法的发展与完善一直是放射学和病理学领域的研究热点，特别是在自动或计算机辅助诊断方面。全切片成像（WSI）已成为在诊断和实验病理学中开发和应用此类算法的一种有前景的工具。然而，对WSI进行逐块分析往往无法捕捉局部微环境中复杂的细胞水平相互作用。解决这一局限性的一个有力替代方法是利用细胞图表示，从而能够对局部细胞相互作用进行更详细的分析。这些细胞图封装了组织病理学图像中细胞的局部空间排列，这一因素已被证明具有重要的预后价值。图神经网络（GNN）可以有效地利用这些空间特征表示和其他特征，在各种复杂程度的分类任务中都表现出了良好的性能。通过知识蒸馏（KD）将深度神经网络获得的知识提炼到较小的学生模型中也是可行的，从而实现模型压缩和性能提升等目标。传统的构建细胞图的方法通常依赖于由稀疏性/密度定义的边缘阈值或附近细胞相互作用的假设。然而，这些方法可能无法捕捉到生物学上有意义的相互作用。此外，现有的知识蒸馏工作主要集中在神经网络之间的知识提炼。我们设计了具有生物学意义的边缘阈值或标准的细胞图来解决这些局限性，超越了基于密度/稀疏性的定义。此外，我们证明学生模型不一定需要是神经网络。即使是非神经模型也可以从神经网络教师那里学习。我们在不同数据集复杂度下评估了我们的方法，包括是否存在分布偏移、不同程度的不平衡以及训练GNN时不同级别的图复杂度。我们还研究了从校准后的对数its得到的软化概率是否比原始对数its提供更好的指导。我们的实验表明，当数据中存在分布偏移时，教师的指导是有效的。教师模型由于其更高的复杂度以及使用细胞图结构和特征的能力而表现出良好的性能。它的对数its为学生提供了丰富的信息和正则化，降低了过度拟合训练分布的风险。我们还研究了用教师的对数its训练的学生模型与用硬标签训练的对应模型在特征重要性上的差异。特别是，在结核病（TB）数据集中，学生模型比用硬标签训练的模型更加强调形态学特征。这种强调与病理学家通常用于诊断目的的优先特征密切相关。未来的工作可以探索设计替代的教师模型，在更大的数据集上评估所提出的方法，并研究因果知识蒸馏作为一种潜在的扩展。

相似文献

Distilling knowledge from graph neural networks trained on cell graphs to non-neural student models.

Sci Rep. 2025 Aug 10;15(1):29274. doi: 10.1038/s41598-025-13697-7.

Prescription of Controlled Substances: Benefits and Risks

Short-Term Memory Impairment

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Sexual Harassment and Prevention Training

Edges are all you need: Potential of medical time series analysis on complete blood count data with graph neural networks.

PLoS One. 2025 Jul 8;20(7):e0327636. doi: 10.1371/journal.pone.0327636. eCollection 2025.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].

Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.

The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.

Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.

The use of Open Dialogue in Trauma Informed Care services for mental health consumers and their family networks: A scoping review.

J Psychiatr Ment Health Nurs. 2024 Aug;31(4):681-698. doi: 10.1111/jpm.13023. Epub 2024 Jan 17.

本文引用的文献

C2P-GCN: Cell-to-Patch Graph Convolutional Network for Colorectal Cancer Grading.

Annu Int Conf IEEE Eng Med Biol Soc. 2024 Jul;2024:1-4. doi: 10.1109/EMBC53108.2024.10782435.

Spatial transcriptomic sequencing reveals immune microenvironment features of granulomas in lung and omentum.

Theranostics. 2024 Sep 23;14(16):6185-6201. doi: 10.7150/thno.99038. eCollection 2024.

Decoupled graph knowledge distillation: A general logits-based method for learning MLPs on graphs.

Neural Netw. 2024 Nov;179:106567. doi: 10.1016/j.neunet.2024.106567. Epub 2024 Jul 23.

Multi-Class Cell Detection Using Spatial Context Representation.

Proc IEEE Int Conf Comput Vis. 2021 Oct;2021:3985-3994. doi: 10.1109/iccv48922.2021.00397. Epub 2022 Feb 28.

Prediction of Tuberculosis From Lung Tissue Images of Diversity Outbred Mice Using Jump Knowledge Based Cell Graph Neural Network.

IEEE Access. 2024;12:17164-17194. doi: 10.1109/access.2024.3359989. Epub 2024 Jan 30.

Prognostic value and distribution pattern of tumor infiltrating lymphocytes and their subsets in distant metastases of advanced breast cancer.

Clin Breast Cancer. 2024 Apr;24(3):e167-e176. doi: 10.1016/j.clbc.2023.12.011. Epub 2023 Dec 30.

Graph Random Forest: A Graph Embedded Algorithm for Identifying Highly Connected Important Features.

Biomolecules. 2023 Jul 20;13(7):1153. doi: 10.3390/biom13071153.

Structure-Aware DropEdge Toward Deep Graph Convolutional Networks.

IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):15565-15577. doi: 10.1109/TNNLS.2023.3288484. Epub 2024 Oct 29.

Use and misuse of random forest variable importance metrics in medicine: demonstrations through incident stroke prediction.

BMC Med Res Methodol. 2023 Jun 19;23(1):144. doi: 10.1186/s12874-023-01965-x.

Evaluating explainability for graph neural networks.

Sci Data. 2023 Mar 18;10(1):144. doi: 10.1038/s41597-023-01974-x.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Distilling knowledge from graph neural networks trained on cell graphs to non-neural student models.

Sci Rep. 2025 Aug 10;15(1):29274. doi: 10.1038/s41598-025-13697-7.

Prescription of Controlled Substances: Benefits and Risks

Short-Term Memory Impairment

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Sexual Harassment and Prevention Training

Edges are all you need: Potential of medical time series analysis on complete blood count data with graph neural networks.

PLoS One. 2025 Jul 8;20(7):e0327636. doi: 10.1371/journal.pone.0327636. eCollection 2025.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].

Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.

The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.

Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.

The use of Open Dialogue in Trauma Informed Care services for mental health consumers and their family networks: A scoping review.

J Psychiatr Ment Health Nurs. 2024 Aug;31(4):681-698. doi: 10.1111/jpm.13023. Epub 2024 Jan 17.

本文引用的文献

C2P-GCN: Cell-to-Patch Graph Convolutional Network for Colorectal Cancer Grading.

Annu Int Conf IEEE Eng Med Biol Soc. 2024 Jul;2024:1-4. doi: 10.1109/EMBC53108.2024.10782435.

Spatial transcriptomic sequencing reveals immune microenvironment features of granulomas in lung and omentum.

Theranostics. 2024 Sep 23;14(16):6185-6201. doi: 10.7150/thno.99038. eCollection 2024.

Decoupled graph knowledge distillation: A general logits-based method for learning MLPs on graphs.

Neural Netw. 2024 Nov;179:106567. doi: 10.1016/j.neunet.2024.106567. Epub 2024 Jul 23.

Multi-Class Cell Detection Using Spatial Context Representation.

Proc IEEE Int Conf Comput Vis. 2021 Oct;2021:3985-3994. doi: 10.1109/iccv48922.2021.00397. Epub 2022 Feb 28.

Prediction of Tuberculosis From Lung Tissue Images of Diversity Outbred Mice Using Jump Knowledge Based Cell Graph Neural Network.

IEEE Access. 2024;12:17164-17194. doi: 10.1109/access.2024.3359989. Epub 2024 Jan 30.

Prognostic value and distribution pattern of tumor infiltrating lymphocytes and their subsets in distant metastases of advanced breast cancer.

Clin Breast Cancer. 2024 Apr;24(3):e167-e176. doi: 10.1016/j.clbc.2023.12.011. Epub 2023 Dec 30.

Graph Random Forest: A Graph Embedded Algorithm for Identifying Highly Connected Important Features.

Biomolecules. 2023 Jul 20;13(7):1153. doi: 10.3390/biom13071153.

Structure-Aware DropEdge Toward Deep Graph Convolutional Networks.

IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):15565-15577. doi: 10.1109/TNNLS.2023.3288484. Epub 2024 Oct 29.

Use and misuse of random forest variable importance metrics in medicine: demonstrations through incident stroke prediction.

BMC Med Res Methodol. 2023 Jun 19;23(1):144. doi: 10.1186/s12874-023-01965-x.

Evaluating explainability for graph neural networks.

Sci Data. 2023 Mar 18;10(1):144. doi: 10.1038/s41597-023-01974-x.

Distilling knowledge from graph neural networks trained on cell graphs to non-neural student models.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献