• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用大语言模型检测新病变:在真实世界多发性硬化症数据集中的应用

Detecting New Lesions Using a Large Language Model: Applications in Real-World Multiple Sclerosis Datasets.

作者信息

Poole Shane, Sisodia Nikki, Koshal Kanishka, Henderson Kyra, Wijangco Jaeleene, Paredes Danelvis, Chen Chelsea, Rowles William, Akula Amit, Wuerfel Jens, Sharma Vishakha, Rauschecker Andreas M, Henry Roland G, Bove Riley

机构信息

UCSF Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA, USA.

F. Hoffmann-La Roche, Basel, Switzerland.

出版信息

Ann Neurol. 2025 Aug;98(2):308-316. doi: 10.1002/ana.27251. Epub 2025 Apr 25.

DOI:10.1002/ana.27251
PMID:40277428
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12278027/
Abstract

OBJECTIVE

Neuroimaging is routinely utilized to identify new inflammatory activity in multiple sclerosis (MS). A large language model to classify narrative magnetic resonance imaging reports in the electronic health record (EHR) as discrete data could provide significant benefits for MS research. The objectives of the current study were to develop such a prompt and to illustrate its research applications through a common clinical scenario: monitoring response to B-cell depleting therapy (BCDT).

METHODS

An institutional ecosystem that securely connects healthcare data with ChatGPT4 was applied to clinical MS magnetic resonance imaging reports in a single institutional EHR (2000-2022). A prompt (msLesionprompt) was developed and iteratively refined to classify the presence or absence of new T2-weighted lesions (newT2w) and contrast-enhancing lesions (CEL). The multistep validation included evaluating efficiency (time and cost), comparison with manually annotated reports using standard confusion matrix, and application to identifying predictors of newT2w/CEL after BCDT start.

RESULTS

Accuracy of msLesionprompt was high for detection of newT2w (97%) and CEL (96.8%). All 14,888 available reports were categorized in 4.13 hours ($28); 79% showed no newT2w or CEL. Data extracted showed expected suppression of new activity by BCDT (>97% monitoring magnetic resonance images after an initial "rebaseline" scan). Neighborhood poverty (Area Deprivation Index) was identified as a predictor of inflammatory activity (newT2w: OR 1.69, 95% CI 1.10-2.59, p = 0.017; CEL: OR 1.54, 95% CI 1.01-2.34, p = 0.046).

INTERPRETATION

Extracting discrete information from narrative imaging reports using an large language model is feasible and efficient. This approach could augment many real-world analyses of MS disease evolution and treatment response. ANN NEUROL 2025;98:308-316.

摘要

目的

神经影像学常用于识别多发性硬化症(MS)中的新炎症活动。一个将电子健康记录(EHR)中的叙述性磁共振成像报告分类为离散数据的大语言模型可为MS研究带来显著益处。本研究的目的是开发这样一个提示,并通过一个常见的临床场景说明其研究应用:监测对B细胞耗竭疗法(BCDT)的反应。

方法

一个将医疗数据与ChatGPT4安全连接的机构生态系统应用于单一机构EHR(2000 - 2022年)中的临床MS磁共振成像报告。开发并迭代完善了一个提示(msLesionprompt),以对新的T2加权病变(newT2w)和强化病变(CEL)的有无进行分类。多步骤验证包括评估效率(时间和成本)、使用标准混淆矩阵与人工标注报告进行比较,以及应用于识别BCDT开始后newT2w/CEL的预测因素。

结果

msLesionprompt检测newT2w(97%)和CEL(96.8%)的准确率很高。所有14888份可用报告在4.13小时内(花费28美元)完成分类;79%的报告未显示新的newT2w或CEL。提取的数据显示BCDT对新活动有预期的抑制作用(初始“重新基线”扫描后>97%的监测磁共振图像)。社区贫困(地区剥夺指数)被确定为炎症活动的一个预测因素(newT2w:比值比1.69,95%置信区间1.10 - 2.59,p = 0.017;CEL:比值比1.54,95%置信区间1.01 - 2.34,p = 0.046)。

解读

使用大语言模型从叙述性影像报告中提取离散信息是可行且高效的。这种方法可以增强对MS疾病演变和治疗反应的许多真实世界分析。《神经病学纪事》2025年;98:308 - 316。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd4/12278027/5031fd4fd016/ANA-98-308-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd4/12278027/a8960b1daf7b/ANA-98-308-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd4/12278027/b29cf584815e/ANA-98-308-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd4/12278027/5031fd4fd016/ANA-98-308-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd4/12278027/a8960b1daf7b/ANA-98-308-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd4/12278027/b29cf584815e/ANA-98-308-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd4/12278027/5031fd4fd016/ANA-98-308-g003.jpg

相似文献

1
Detecting New Lesions Using a Large Language Model: Applications in Real-World Multiple Sclerosis Datasets.使用大语言模型检测新病变:在真实世界多发性硬化症数据集中的应用
Ann Neurol. 2025 Aug;98(2):308-316. doi: 10.1002/ana.27251. Epub 2025 Apr 25.
2
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
3
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
4
Contrast-enhanced ultrasound using SonoVue® (sulphur hexafluoride microbubbles) compared with contrast-enhanced computed tomography and contrast-enhanced magnetic resonance imaging for the characterisation of focal liver lesions and detection of liver metastases: a systematic review and cost-effectiveness analysis.超声造影使用声诺维®(六氟化硫微泡)与对比增强计算机断层扫描和对比增强磁共振成像在局灶性肝脏病变的特征描述和肝转移检测中的比较:系统评价和成本效益分析。
Health Technol Assess. 2013 Apr;17(16):1-243. doi: 10.3310/hta17160.
5
Siponimod for multiple sclerosis.西尼莫德用于多发性硬化症。
Cochrane Database Syst Rev. 2021 Nov 16;11(11):CD013647. doi: 10.1002/14651858.CD013647.pub2.
6
Eliciting adverse effects data from participants in clinical trials.从临床试验参与者中获取不良反应数据。
Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.
7
Azathioprine for people with multiple sclerosis.硫唑嘌呤用于多发性硬化症患者。
Cochrane Database Syst Rev. 2024 Dec 9;12(12):CD015005. doi: 10.1002/14651858.CD015005.pub2.
8
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.
9
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
10
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.

本文引用的文献

1
CORAL: Expert-Curated Oncology Reports to Advance Language Model Inference.CORAL:经专家策划的肿瘤学报告,以推进语言模型推理。
NEJM AI. 2024 Apr;1(4). doi: 10.1056/aidbp2300110. Epub 2024 Mar 13.
2
Can ChatGPT4-vision identify radiologic progression of multiple sclerosis on brain MRI?ChatGPT4-vision能否识别脑磁共振成像上多发性硬化症的放射学进展?
Eur Radiol Exp. 2025 Jan 15;9(1):9. doi: 10.1186/s41747-024-00547-w.
3
Integrating large language models in care, research, and education in multiple sclerosis management.将大型语言模型整合到多发性硬化症管理的护理、研究和教育中。
Mult Scler. 2024 Oct;30(11-12):1392-1401. doi: 10.1177/13524585241277376. Epub 2024 Sep 23.
4
Performance of an Open-Source Large Language Model in Extracting Information from Free-Text Radiology Reports.开源大语言模型从自由文本放射学报告中提取信息的性能。
Radiol Artif Intell. 2024 Jul;6(4):e230364. doi: 10.1148/ryai.230364.
5
ChatGPT vs. neurologists: a cross-sectional study investigating preference, satisfaction ratings and perceived empathy in responses among people living with multiple sclerosis.ChatGPT 与神经科医生:一项横断面研究,调查多发性硬化症患者对偏好、满意度评分和感知同理心的反应。
J Neurol. 2024 Jul;271(7):4057-4066. doi: 10.1007/s00415-024-12328-x. Epub 2024 Apr 3.
6
Quantitative Evaluation of Large Language Models to Streamline Radiology Report Impressions: A Multimodal Retrospective Analysis.大语言模型在简化放射科报告印象方面的定量评估:一项多模态回顾性分析。
Radiology. 2024 Mar;310(3):e231593. doi: 10.1148/radiol.231593.
7
Can ChatGPT pass the MRCP (UK) written examinations? Analysis of performance and errors using a clinical decision-reasoning framework.ChatGPT 能否通过英国皇家内科医师学会会员资格考试(MRCP(UK))?使用临床决策推理框架分析表现和错误。
BMJ Open. 2024 Mar 15;14(3):e080558. doi: 10.1136/bmjopen-2023-080558.
8
From jargon to clarity: Improving the readability of foot and ankle radiology reports with an artificial intelligence large language model.从行话到清晰明了:利用人工智能大语言模型提高足踝放射学报告的可读性
Foot Ankle Surg. 2024 Jun;30(4):331-337. doi: 10.1016/j.fas.2024.01.008. Epub 2024 Feb 5.
9
Disparities by Race in Pregnancy Care and Clinical Outcomes in Women With Multiple Sclerosis: A Diverse Multicenter Cohort.种族差异与多发性硬化症女性妊娠护理和临床结局:一个多样化的多中心队列。
Neurology. 2024 Feb 27;102(4):e208100. doi: 10.1212/WNL.0000000000208100. Epub 2024 Jan 23.
10
A Comparison of a Large Language Model vs Manual Chart Review for the Extraction of Data Elements From the Electronic Health Record.大型语言模型与人工病历审查在从电子健康记录中提取数据元素方面的比较
Gastroenterology. 2024 Apr;166(4):707-709.e3. doi: 10.1053/j.gastro.2023.12.019. Epub 2023 Dec 25.