用于头影测量标志点检测和治疗预测的多模态深度学习

Multimodal deep learning for cephalometric landmark detection and treatment prediction.

作者信息

Gao Fei, Tang Yulong

机构信息

Department of Stomatology, General Hospital of PLA Northern Theater Command, Shenyang, 110002, Liaoning, China.

出版信息

Sci Rep. 2025 Jul 12;15(1):25205. doi: 10.1038/s41598-025-06229-w.

DOI:10.1038/s41598-025-06229-w

PMID:40651957

Abstract

In orthodontics and maxillofacial surgery, accurate cephalometric analysis and treatment outcome prediction are critical for clinical decision-making. Traditional approaches rely on manual landmark identification, which is time-consuming and subject to inter-observer variability, while existing automated methods typically utilize single imaging modalities with limited accuracy. This paper presents DeepFuse, a novel multi-modal deep learning framework that integrates information from lateral cephalograms, CBCT volumes, and digital dental models to simultaneously perform landmark detection and treatment outcome prediction. The framework employs modality-specific encoders, an attention-guided fusion mechanism, and dual-task decoders to leverage complementary information across imaging techniques. Extensive experiments on three clinical datasets demonstrate that DeepFuse achieves a mean radial error of 1.21 mm for landmark detection, representing a 13% improvement over state-of-the-art methods, with a clinical acceptability rate of 92.4% at the 2 mm threshold. For treatment outcome prediction, the framework attains an overall accuracy of 85.6%, significantly outperforming both conventional prediction models and experienced clinicians. The proposed approach enhances diagnostic precision and treatment planning while providing interpretable visualization of decision factors, demonstrating significant potential for clinical integration in orthodontic and maxillofacial practice.

摘要

在正畸学和颌面外科中，精确的头影测量分析和治疗结果预测对于临床决策至关重要。传统方法依赖于手动识别标志点，既耗时又存在观察者间的差异，而现有的自动化方法通常使用单一成像模式，准确性有限。本文提出了DeepFuse，这是一种新颖的多模态深度学习框架，它整合了侧位头影图、CBCT容积和数字牙科模型的信息，以同时进行标志点检测和治疗结果预测。该框架采用特定模态的编码器、注意力引导融合机制和双任务解码器，以利用跨成像技术的互补信息。在三个临床数据集上进行的大量实验表明，DeepFuse在标志点检测方面实现了1.21毫米的平均径向误差，比现有最先进方法提高了13%，在2毫米阈值下的临床可接受率为92.4%。对于治疗结果预测，该框架的总体准确率达到85.6%，显著优于传统预测模型和经验丰富的临床医生。所提出的方法提高了诊断精度和治疗计划，同时提供了决策因素的可解释可视化，显示出在正畸和颌面实践中临床整合的巨大潜力。

相似文献

Multimodal deep learning for cephalometric landmark detection and treatment prediction.

Sci Rep. 2025 Jul 12;15(1):25205. doi: 10.1038/s41598-025-06229-w.

Deep learning for cephalometric landmark detection: systematic review and meta-analysis.

Clin Oral Investig. 2021 Jul;25(7):4299-4309. doi: 10.1007/s00784-021-03990-w. Epub 2021 May 27.

An open-source deep learning framework for respiratory motion monitoring and volumetric imaging during radiation therapy.

Med Phys. 2025 Jul;52(7):e18015. doi: 10.1002/mp.18015.

Noise-aware system generative model (NASGM): positron emission tomography (PET) image simulation framework with observer validation studies.

Med Phys. 2025 Jul;52(7):e17962. doi: 10.1002/mp.17962.

Towards Better Cephalometric Landmark Detection With Diffusion Data Generation.

IEEE Trans Med Imaging. 2025 Jul;44(7):2784-2794. doi: 10.1109/TMI.2025.3557430.

A cross-temporal multimodal fusion system based on deep learning for orthodontic monitoring.

Comput Biol Med. 2024 Sep;180:109025. doi: 10.1016/j.compbiomed.2024.109025. Epub 2024 Aug 18.

Ultrasound-guided versus anatomic landmark-guided percutaneous femoral artery access.

Cochrane Database Syst Rev. 2025 Mar 28;3(3):CD014594. doi: 10.1002/14651858.CD014594.pub2.

Structural semantic-guided MR synthesis from PET images via a dual cross-attention mechanism.

Med Phys. 2025 Jul;52(7):e17957. doi: 10.1002/mp.17957.

Accuracy of automated 3D cephalometric landmarks by deep learning algorithms: systematic review and meta-analysis.

Radiol Med. 2023 May;128(5):544-555. doi: 10.1007/s11547-023-01629-2. Epub 2023 Apr 24.

Short-Term Memory Impairment

本文引用的文献

MHSA-Net: Multihead Self-Attention Network for Occluded Person Re-Identification.

IEEE Trans Neural Netw Learn Syst. 2023 Nov;34(11):8210-8224. doi: 10.1109/TNNLS.2022.3144163. Epub 2023 Oct 27.

Fusion of medical imaging and electronic health records using deep learning: a systematic review and implementation guidelines.

NPJ Digit Med. 2020 Oct 16;3:136. doi: 10.1038/s41746-020-00341-z. eCollection 2020.

Automated cephalometric landmark detection with confidence regions using Bayesian convolutional neural networks.

BMC Oral Health. 2020 Oct 7;20(1):270. doi: 10.1186/s12903-020-01256-7.

Artificial Intelligence in Dentistry: Chances and Challenges.

J Dent Res. 2020 Jul;99(7):769-774. doi: 10.1177/0022034520915714. Epub 2020 Apr 21.

Deep Learning for Cardiac Image Segmentation: A Review.

Front Cardiovasc Med. 2020 Mar 5;7:25. doi: 10.3389/fcvm.2020.00025. eCollection 2020.

Artificial intelligence in orthodontics : Evaluation of a fully automated cephalometric analysis using a customized convolutional neural network.

J Orofac Orthop. 2020 Jan;81(1):52-68. doi: 10.1007/s00056-019-00203-8. Epub 2019 Dec 18.

Context-guided fully convolutional networks for joint craniomaxillofacial bone segmentation and landmark digitization.

Med Image Anal. 2020 Feb;60:101621. doi: 10.1016/j.media.2019.101621. Epub 2019 Nov 23.

Automated identification of cephalometric landmarks:

Angle Orthod. 2020 Jan;90(1):69-76. doi: 10.2319/022019-129.1. Epub 2019 Jul 22.

Automated identification of cephalometric landmarks: .

Angle Orthod. 2019 Nov;89(6):903-909. doi: 10.2319/022019-127.1. Epub 2019 Jul 8.

Deep Geodesic Learning for Segmentation and Anatomical Landmarking.

IEEE Trans Med Imaging. 2019 Apr;38(4):919-931. doi: 10.1109/TMI.2018.2875814. Epub 2018 Oct 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于头影测量标志点检测和治疗预测的多模态深度学习

Multimodal deep learning for cephalometric landmark detection and treatment prediction.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献