• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用自然语言处理技术在非结构化医疗记录中改善罕见病的早期诊断:以德雷维特综合征为例

Improving early diagnosis of rare diseases using Natural Language Processing in unstructured medical records: an illustration from Dravet syndrome.

作者信息

Lo Barco Tommaso, Kuchenbuch Mathieu, Garcelon Nicolas, Neuraz Antoine, Nabbout Rima

机构信息

Department of Pediatric Neurology, Necker-Enfants Malades Hospital, APHP, Centre de Référence Épilepsies Rares, Member of ERN EPICARE, Université de Paris, Paris, France.

Child Neuropsychiatry, Department of Surgical Sciences, Dentistry, Gynecology and Pediatrics, University of Verona, Verona, Italy.

出版信息

Orphanet J Rare Dis. 2021 Jul 13;16(1):309. doi: 10.1186/s13023-021-01936-9.

DOI:10.1186/s13023-021-01936-9
PMID:34256808
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8278630/
Abstract

BACKGROUND

The growing use of Electronic Health Records (EHRs) is promoting the application of data mining in health-care. A promising use of big data in this field is to develop models to support early diagnosis and to establish natural history. Dravet Syndrome (DS) is a rare developmental and epileptic encephalopathy that commonly initiates in the first year of life with febrile seizures (FS). Age at diagnosis is often delayed after 2 years, as it is difficult to differentiate DS at onset from FS. We aimed to explore if some clinical terms (concepts) are significantly more used in the electronic narrative medical reports of individuals with DS before the age of 2 years compared to those of individuals with FS. These concepts would allow an earlier detection of patients with DS resulting in an earlier orientation toward expert centers that can provide early diagnosis and care.

METHODS

Data were collected from the Necker Enfants Malades Hospital using a document-based data warehouse, Dr Warehouse, which employs Natural Language Processing, a computer technology consisting in processing written information. Using Unified Medical Language System Meta-thesaurus, phenotype concepts can be recognized in medical reports. We selected individuals with DS (DS Cohort) and individuals with FS (FS Cohort) with confirmed diagnosis after the age of 4 years. A phenome-wide analysis was performed evaluating the statistical associations between the phenotypes of DS and FS, based on concepts found in the reports produced before 2 years and using a series of logistic regressions.

RESULTS

We found significative higher representation of concepts related to seizures' phenotypes distinguishing DS from FS in the first phases, namely the major recurrence of complex febrile convulsions (long-lasting and/or with focal signs) and other seizure-types. Some typical early onset non-seizure concepts also emerged, in relation to neurodevelopment and gait disorders.

CONCLUSIONS

Narrative medical reports of individuals younger than 2 years with FS contain specific concepts linked to DS diagnosis, which can be automatically detected by software exploiting NLP. This approach could represent an innovative and sustainable methodology to decrease time of diagnosis of DS and could be transposed to other rare diseases.

摘要

背景

电子健康记录(EHRs)的使用日益广泛,推动了数据挖掘在医疗保健领域的应用。大数据在该领域的一个有前景的用途是开发模型以支持早期诊断并建立自然病史。德雷维特综合征(DS)是一种罕见的发育性和癫痫性脑病,通常在生命的第一年以热性惊厥(FS)起病。诊断年龄往往在2岁后延迟,因为在发病时很难将DS与FS区分开来。我们旨在探讨与热性惊厥患者相比,2岁前患有DS的个体的电子叙述性医疗报告中是否有某些临床术语(概念)的使用频率显著更高。这些概念将有助于更早地发现DS患者,从而更早地将其转诊至能够提供早期诊断和治疗的专家中心。

方法

数据从内克尔儿童医院收集,使用基于文档的数据仓库Dr Warehouse,该仓库采用自然语言处理技术,这是一种处理书面信息的计算机技术。使用统一医学语言系统元词表,可以在医疗报告中识别表型概念。我们选择了4岁后确诊的DS患者(DS队列)和FS患者(FS队列)。基于2岁前生成的报告中发现的概念,并使用一系列逻辑回归进行全表型分析,评估DS和FS表型之间的统计关联。

结果

我们发现,在第一阶段,与区分DS和FS的癫痫表型相关的概念有显著更高的呈现,即复杂性热性惊厥的主要复发(持续时间长和/或有局灶性体征)和其他癫痫类型。还出现了一些典型的早期发作非癫痫概念,与神经发育和步态障碍有关。

结论

2岁以下FS患者的叙述性医疗报告包含与DS诊断相关的特定概念,软件利用自然语言处理可以自动检测到这些概念。这种方法可能代表一种创新且可持续的方法,以缩短DS的诊断时间,并且可以应用于其他罕见疾病。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0809/8278630/6b4e3d02a934/13023_2021_1936_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0809/8278630/6b4e3d02a934/13023_2021_1936_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0809/8278630/6b4e3d02a934/13023_2021_1936_Fig1_HTML.jpg

相似文献

1
Improving early diagnosis of rare diseases using Natural Language Processing in unstructured medical records: an illustration from Dravet syndrome.利用自然语言处理技术在非结构化医疗记录中改善罕见病的早期诊断:以德雷维特综合征为例
Orphanet J Rare Dis. 2021 Jul 13;16(1):309. doi: 10.1186/s13023-021-01936-9.
2
Natural history of rare diseases using natural language processing of narrative unstructured electronic health records: The example of Dravet syndrome.利用叙述性非结构化电子健康记录的自然语言处理技术研究罕见病的自然史:以Dravet综合征为例。
Epilepsia. 2024 Feb;65(2):350-361. doi: 10.1111/epi.17855. Epub 2023 Dec 30.
3
Outcomes and comorbidities of SCN1A-related seizure disorders.与SCN1A相关的癫痫疾病的结局与共病情况。
Epilepsy Behav. 2019 Jan;90:252-259. doi: 10.1016/j.yebeh.2018.09.041. Epub 2018 Dec 5.
4
[Complex febrile Seizures or Dravet syndrome?: Description of 3 case reports].[复杂热性惊厥还是德雷维特综合征?:3例病例报告描述]
Rev Chil Pediatr. 2014 Oct;85(5):588-93. doi: 10.4067/S0370-41062014000500010.
5
Next generation phenotyping using narrative reports in a rare disease clinical data warehouse.利用罕见病临床数据仓库中的叙述报告进行下一代表型分析。
Orphanet J Rare Dis. 2018 May 31;13(1):85. doi: 10.1186/s13023-018-0830-6.
6
Early clinical features and diagnosis of Dravet syndrome in 138 Chinese patients with SCN1A mutations.138例携带SCN1A突变的中国患者的Dravet综合征的早期临床特征与诊断
Brain Dev. 2014 Sep;36(8):676-81. doi: 10.1016/j.braindev.2013.10.004. Epub 2013 Oct 26.
7
When is a child with status epilepticus likely to have Dravet syndrome?当患有癫痫持续状态的儿童可能患有 Dravet 综合征时,应该如何处理?
Epilepsy Res. 2014 May;108(4):740-7. doi: 10.1016/j.eplepsyres.2014.02.019. Epub 2014 Mar 12.
8
Palliative epilepsy surgery in Dravet syndrome-case series and review of the literature.德雷维特综合征的姑息性癫痫手术——病例系列及文献综述
Childs Nerv Syst. 2016 Sep;32(9):1703-8. doi: 10.1007/s00381-016-3201-4. Epub 2016 Jul 27.
9
Development and content validation of a preliminary core set of patient- and caregiver-relevant outcomes for inclusion in a potential composite endpoint for Dravet Syndrome.用于纳入Dravet综合征潜在复合终点的患者及照料者相关初步核心结局集的开发与内容验证。
Epilepsy Behav. 2018 Jan;78:232-242. doi: 10.1016/j.yebeh.2017.08.029. Epub 2017 Nov 4.
10
Behavior problems and health-related quality of life in Dravet syndrome.德雷维特综合征中的行为问题与健康相关生活质量
Epilepsy Behav. 2019 Jan;90:217-227. doi: 10.1016/j.yebeh.2018.11.029. Epub 2018 Dec 19.

引用本文的文献

1
Assessing the impact of Dravet syndrome on caregivers' quality of life and perceived burden in Poland.评估在波兰,德雷维特综合征对照料者生活质量和感知负担的影响。
BMC Psychol. 2025 Jul 9;13(1):763. doi: 10.1186/s40359-025-03102-3.
2
Caregivers' experiences and challenges of the diagnostic odyssey in Dravet syndrome.护理人员在德雷维特综合征诊断过程中的经历与挑战。
Orphanet J Rare Dis. 2025 May 16;20(1):234. doi: 10.1186/s13023-025-03772-7.
3
Concept Recognition and Characterization of Patients Undergoing Resection of Vestibular Schwannoma Using Natural Language Processing.

本文引用的文献

1
Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes.深度表型分析广泛儿科数据库中的非结构化数据挖掘,以揭示神经发育综合征中常见的 KCNA2 变异。
Genet Med. 2021 May;23(5):968-971. doi: 10.1038/s41436-020-01039-z. Epub 2021 Jan 26.
2
Axes of a revolution: challenges and promises of big data in healthcare.革命的轴心:大数据在医疗保健中的挑战与机遇。
Nat Med. 2020 Jan;26(1):29-38. doi: 10.1038/s41591-019-0727-5. Epub 2020 Jan 13.
3
The use or generation of biomedical data and existing medicines to discover and establish new treatments for patients with rare diseases - recommendations of the IRDiRC Data Mining and Repurposing Task Force.
使用自然语言处理对前庭神经鞘瘤切除患者进行概念识别与特征描述
J Neurol Surg B Skull Base. 2024 May 11;86(3):332-341. doi: 10.1055/s-0044-1786738. eCollection 2025 Jun.
4
Natural Language Understanding to Assess Oral Health-Related Quality of Life: A Cross-Sectional Study Incorporating a Mixed Methods Approach.用于评估口腔健康相关生活质量的自然语言理解:一项采用混合方法的横断面研究。
J Oral Rehabil. 2025 Sep;52(9):1377-1385. doi: 10.1111/joor.13986. Epub 2025 May 9.
5
Artificial Intelligence: A New Frontier in Rare Disease Early Diagnosis.人工智能:罕见病早期诊断的新前沿。
Cureus. 2025 Feb 22;17(2):e79487. doi: 10.7759/cureus.79487. eCollection 2025 Feb.
6
Perception of psychosocial burden in mothers of children with rare pediatric neurological diseases.患有罕见儿科神经疾病儿童的母亲对心理社会负担的感知。
Sci Rep. 2025 Feb 21;15(1):6295. doi: 10.1038/s41598-025-87251-w.
7
Development of cohort definitions and algorithms to identify patients with Lennox-Gastaut syndrome or Dravet syndrome from real-world administrative healthcare databases.制定队列定义和算法,以便从真实世界的行政医疗保健数据库中识别患有伦诺克斯-加斯托综合征或德热综合征的患者。
Heliyon. 2024 Dec 25;11(3):e41486. doi: 10.1016/j.heliyon.2024.e41486. eCollection 2025 Feb 15.
8
Internet-Based Abnormal Chromosomal Diagnosis During Pregnancy Using a Noninvasive Innovative Approach to Detecting Chromosomal Abnormalities in the Fetus: Scoping Review.基于互联网的孕期染色体异常诊断:使用无创创新方法检测胎儿染色体异常的范围综述
JMIR Bioinform Biotechnol. 2024 Oct 16;5:e58439. doi: 10.2196/58439.
9
Fostering Tomorrow: Uniting Artificial Intelligence and Social Pediatrics for Comprehensive Child Well-being.培育明日:将人工智能与社会儿科学相结合,促进儿童全面福祉。
Turk Arch Pediatr. 2024 Jul 1;59(4):345-352. doi: 10.5152/TurkArchPediatr.2024.24076.
10
Objectivizing issues in the diagnosis of complex rare diseases: lessons learned from testing existing diagnosis support systems on ciliopathies.客观化复杂罕见病诊断中的问题:从对纤毛病诊断支持系统的测试中得到的经验教训。
BMC Med Inform Decis Mak. 2024 May 24;24(1):134. doi: 10.1186/s12911-024-02538-8.
利用或生成生物医学数据和现有药物,为罕见病患者发现和建立新的治疗方法——IRDiRC 数据挖掘和再利用工作组的建议。
Orphanet J Rare Dis. 2019 Oct 15;14(1):225. doi: 10.1186/s13023-019-1193-3.
4
Perception of impact of Dravet syndrome on children and caregivers in multiple countries: looking beyond seizures.多国家儿童和照料者对德雷维综合征影响的认知:超越癫痫发作。
Dev Med Child Neurol. 2019 Oct;61(10):1229-1236. doi: 10.1111/dmcn.14186. Epub 2019 Mar 4.
5
Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence.人工智能在儿科疾病评估和精准诊断中的应用。
Nat Med. 2019 Mar;25(3):433-438. doi: 10.1038/s41591-018-0335-9. Epub 2019 Feb 11.
6
Big data in IBD: a look into the future.炎症性肠病中的大数据:展望未来。
Nat Rev Gastroenterol Hepatol. 2019 May;16(5):312-321. doi: 10.1038/s41575-019-0102-5.
7
Motor development in children with Dravet syndrome.Dravet 综合征患儿的运动发育。
Dev Med Child Neurol. 2019 Aug;61(8):950-956. doi: 10.1111/dmcn.14147. Epub 2019 Jan 15.
8
A Global Overview of Precision Medicine in Type 2 Diabetes.2 型糖尿病精准医学的全球概述。
Diabetes. 2018 Oct;67(10):1911-1922. doi: 10.2337/dbi17-0045.
9
Next generation phenotyping using narrative reports in a rare disease clinical data warehouse.利用罕见病临床数据仓库中的叙述报告进行下一代表型分析。
Orphanet J Rare Dis. 2018 May 31;13(1):85. doi: 10.1186/s13023-018-0830-6.
10
A clinician friendly data warehouse oriented toward narrative reports: Dr. Warehouse.面向叙事报告的临床医生友好型数据仓库:Dr. Warehouse。
J Biomed Inform. 2018 Apr;80:52-63. doi: 10.1016/j.jbi.2018.02.019. Epub 2018 Mar 1.