用于分子性质预测的知识蒸馏：可扩展性分析

Knowledge Distillation for Molecular Property Prediction: A Scalability Analysis.

作者信息

Sheshanarayana Rahul, You Fengqi

机构信息

College of Engineering, Cornell University, Ithaca, NY, 14853, USA.

Robert Frederick Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY, 14853, USA.

出版信息

Adv Sci (Weinh). 2025 Jun;12(22):e2503271. doi: 10.1002/advs.202503271. Epub 2025 Apr 9.

DOI:10.1002/advs.202503271

PMID:40202211

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12165064/

Abstract

Knowledge distillation (KD) is a powerful model compression technique that transfers knowledge from complex teacher models to compact student models, reducing computational costs while preserving predictive accuracy. This study investigated KD's efficacy in molecular property prediction across domain-specific and cross-domain tasks, leveraging state-of-the-art graph neural networks (SchNet, DimeNet++, and TensorNet). In the domain-specific setting, KD improved regression performance across diverse quantum mechanical properties in the QM9 dataset, with DimeNet++ student models achieving up to an 90% improvement in compared to non-KD baselines. Notably, in certain cases, smaller student models achieved comparable or even superior improvements while being 2× smaller, highlighting KD's ability to enhance efficiency without sacrificing predictive performance. Cross-domain evaluations further demonstrated KD's adaptability, where embeddings from QM9-trained teacher models enhanced predictions for ESOL (logS) and FreeSolv (ΔG), with SchNet exhibiting the highest gains of ≈65% in logS predictions. Embedding analysis revealed substantial student-teacher alignment gains, with the relative shift in cosine similarity distribution peaks reaching up to 1.0 across student models. These findings highlighted KD as a robust strategy for enhancing molecular representation learning, with implications for cheminformatics, materials science, and drug discovery.

摘要

知识蒸馏（KD）是一种强大的模型压缩技术，它将知识从复杂的教师模型转移到紧凑的学生模型，在保持预测准确性的同时降低计算成本。本研究利用先进的图神经网络（SchNet、DimeNet++和TensorNet），研究了KD在特定领域和跨领域任务的分子性质预测中的有效性。在特定领域设置中，KD提高了QM9数据集中各种量子力学性质的回归性能，与非KD基线相比，DimeNet++学生模型在[具体指标]上实现了高达90%的提升。值得注意的是，在某些情况下，较小的学生模型在小2倍的情况下实现了相当甚至更好的[具体指标]提升，突出了KD在不牺牲预测性能的情况下提高效率的能力。跨领域评估进一步证明了KD的适应性，其中来自QM9训练的教师模型的嵌入增强了对ESOL（logS）和FreeSolv（ΔG）的预测，SchNet在logS预测中表现出最高约65%的提升。嵌入分析揭示了学生-教师对齐的显著提升，学生模型的余弦相似度分布峰值的相对偏移高达1.0。这些发现突出了KD作为增强分子表示学习的稳健策略，对化学信息学、材料科学和药物发现具有重要意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8af/12165064/91e1c7b820ae/ADVS-12-2503271-g001.jpg

相似文献

Knowledge Distillation for Molecular Property Prediction: A Scalability Analysis.

Adv Sci (Weinh). 2025 Jun;12(22):e2503271. doi: 10.1002/advs.202503271. Epub 2025 Apr 9.

On Representation Knowledge Distillation for Graph Neural Networks.

IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):4656-4667. doi: 10.1109/TNNLS.2022.3223018. Epub 2024 Apr 4.

Graph Neural Network-Based Molecular Property Prediction with Patch Aggregation.

J Chem Theory Comput. 2024 Oct 22;20(20):8886-8896. doi: 10.1021/acs.jctc.4c00798. Epub 2024 Oct 2.

Pea-KD: Parameter-efficient and accurate Knowledge Distillation on BERT.

PLoS One. 2022 Feb 18;17(2):e0263592. doi: 10.1371/journal.pone.0263592. eCollection 2022.

Sci Rep. 2024 Aug 14;14(1):18888. doi: 10.1038/s41598-024-69813-6.

PointGAT: A Quantum Chemical Property Prediction Model Integrating Graph Attention and 3D Geometry.

J Chem Theory Comput. 2024 May 28;20(10):4115-4128. doi: 10.1021/acs.jctc.3c01420. Epub 2024 May 10.

Leveraging different learning styles for improved knowledge distillation in biomedical imaging.

Comput Biol Med. 2024 Jan;168:107764. doi: 10.1016/j.compbiomed.2023.107764. Epub 2023 Nov 30.

Sci Rep. 2024 Apr 8;14(1):8150. doi: 10.1038/s41598-024-58409-9.

DCCD: Reducing Neural Network Redundancy via Distillation.

IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):10006-10017. doi: 10.1109/TNNLS.2023.3238337. Epub 2024 Jul 8.

Tailored knowledge distillation with automated loss function learning.

PLoS One. 2025 Jun 11;20(6):e0325599. doi: 10.1371/journal.pone.0325599. eCollection 2025.

本文引用的文献

Unifying topological structure and self-attention mechanism for node classification in directed networks.

Sci Rep. 2025 Jan 4;15(1):805. doi: 10.1038/s41598-024-84816-z.

PubChem 2025 update.

Nucleic Acids Res. 2025 Jan 6;53(D1):D1516-D1525. doi: 10.1093/nar/gkae1059.

Knowledge distillation of neural network potential for molecular crystals.

Faraday Discuss. 2025 Jan 14;256(0):139-155. doi: 10.1039/d4fd00090k.

Transformers for Molecular Property Prediction: Lessons Learned from the Past Five Years.

J Chem Inf Model. 2024 Aug 26;64(16):6259-6280. doi: 10.1021/acs.jcim.4c00747. Epub 2024 Aug 13.

Transfer learning across different chemical domains: virtual screening of organic materials with deep learning models pretrained on small molecule and chemical reaction data.

J Cheminform. 2024 Jul 30;16(1):89. doi: 10.1186/s13321-024-00886-1.

The ChEMBL Database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periods.

Nucleic Acids Res. 2024 Jan 5;52(D1):D1180-D1192. doi: 10.1093/nar/gkad1004.

Improving drug-target affinity prediction via feature fusion and knowledge distillation.

Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad145.

Cross-property deep transfer learning framework for enhanced predictive analytics on small materials data.

Nat Commun. 2021 Nov 15;12(1):6595. doi: 10.1038/s41467-021-26921-5.

Reducing Time to Discovery: Materials and Molecular Modeling, Imaging, Informatics, and Integration.

ACS Nano. 2021 Mar 23;15(3):3971-3995. doi: 10.1021/acsnano.1c00211. Epub 2021 Feb 12.

AI on a chip.

Lab Chip. 2020 Aug 26;20(17):3074-3090. doi: 10.1039/d0lc00521e.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于分子性质预测的知识蒸馏：可扩展性分析

Knowledge Distillation for Molecular Property Prediction: A Scalability Analysis.

作者信息

Sheshanarayana Rahul, You Fengqi

机构信息

College of Engineering, Cornell University, Ithaca, NY, 14853, USA.

Robert Frederick Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY, 14853, USA.

出版信息

Adv Sci (Weinh). 2025 Jun;12(22):e2503271. doi: 10.1002/advs.202503271. Epub 2025 Apr 9.

DOI:10.1002/advs.202503271

PMID:40202211

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12165064/

Abstract

摘要

用于分子性质预测的知识蒸馏：可扩展性分析

Knowledge Distillation for Molecular Property Prediction: A Scalability Analysis.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于分子性质预测的知识蒸馏：可扩展性分析

Knowledge Distillation for Molecular Property Prediction: A Scalability Analysis.

作者信息

机构信息

出版信息

相似文献

本文引用的文献