Wang Jian, Tian HongTian, Yang Xin, Wu HuaiYu, Zhu XiLiang, Chen RuSi, Chang Ao, Chen YanLin, Dou HaoRan, Huang RuoBing, Cheng Jun, Zhou YongSong, Gao Rui, Yang KeEn, Li GuoQiu, Chen Jing, Ni Dong, Xu JinFeng, Gu Ning, Dong FaJin
Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, China.
College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China.
Radiol Artif Intell. 2025 Jul;7(4):e240625. doi: 10.1148/ryai.240625.
Purpose To develop and evaluate an artificial intelligence (AI) system for generating breast US reports. Materials and Methods This retrospective study included 104 364 cases from three hospitals (January 2020-December 2022). The AI system was trained on 82 896 cases, validated on 10 385 cases, and tested on an internal set (10 383 cases) and two external sets (300 and 400 cases). Under blind review, three senior radiologists (each with >10 years of experience) evaluated AI-generated reports and those written by one midlevel radiologist (with 7 years of experience), as well as reports from three junior radiologists (each with 2-3 years of experience) with and without AI assistance. The primary outcomes included the acceptance rates of Breast Imaging Reporting and Data System (BI-RADS) categories and lesion characteristics. Statistical analysis included one-sided and two-sided McNemar tests for noninferiority and significance testing. Results In external test set 1 (300 cases), the midlevel radiologist and AI system achieved BI-RADS acceptance rates of 95.00% (285 of 300) versus 92.33% (277 of 300) ( < .001, noninferiority test with a prespecified margin of 10%). In external test set 2 (400 cases), three junior radiologists had BI-RADS acceptance rates of 87.00% (348 of 400) versus 90.75% (363 of 400) ( = .06), 86.50% (346 of 400) versus 92.00% (368 of 400) ( = .007), and 84.75% (339 of 400) versus 90.25% (361 of 400) ( = .02) without and with AI assistance, respectively. Conclusion The AI system performed comparably to a midlevel radiologist and aided junior radiologists in BI-RADS classification. Neural Networks, Computer-aided Diagnosis, CAD, Ultrasound © RSNA, 2025.