• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将术语整合到标准SQL中:一种用于常规数据研究的新方法。

Integrating terminologies into standard SQL: a new approach for research on routine data.

作者信息

Sander André, Wauer Roland

机构信息

ID GmbH & Co. KGaA, Platz vor dem Neuen Tor 2, 10115, Berlin, Germany.

Klinik für Neonatologie, Charité-Universitätsmedizin Berlin, 10098, Berlin, Germany.

出版信息

J Biomed Semantics. 2019 Apr 24;10(1):7. doi: 10.1186/s13326-019-0199-z.

DOI:10.1186/s13326-019-0199-z
PMID:31014403
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6480592/
Abstract

BACKGROUND

Most electronic medical records still contain large amounts of free-text data. Semantic evaluation of such data requires the data to be encoded with sufficient classifications or transformed into a knowledge-based database.

METHODS

We present an approach that allows databases accessible via SQL (Structured Query Language) to be searched directly through semantic queries without the need for further transformations. Therefore, we developed I) an extension to SQL named Ontology-SQL (O-SQL) that allows to use semantic expressions, II) a framework that uses a standard terminology server to annotate free-text containing database tables and III) a parser that rewrites O-SQL to SQL, so that such queries can be passed to the database server.

RESULTS

I) We compared several semantic queries published to date and were able to reproduce them in a reduced, highly condensed form. II) The quality of the annotation process was measured against manual annotation, and we found a sensitivity of 97.62% and a specificity of 100.00%. III) Different semantic queries were analyzed, and measured with F-scores between 0.91 and 0.98.

CONCLUSIONS

We showed that systematic analysis of free-text-containing medical records is possible with standard tools. The seamless connection of ontologies and standard technologies from the database field represents an important constituent of unstructured data analysis. The developed technology can be readily applied to relationally organized data and supports the increasingly important field of translational research.

摘要

背景

大多数电子病历仍包含大量自由文本数据。对此类数据进行语义评估需要用足够的分类对数据进行编码或将其转换为基于知识的数据库。

方法

我们提出了一种方法,可直接通过语义查询搜索可通过SQL(结构化查询语言)访问的数据库,而无需进一步转换。因此,我们开发了:I)SQL的一个扩展,名为本体SQL(O-SQL),它允许使用语义表达式;II)一个框架,该框架使用标准术语服务器对包含数据库表的自由文本进行注释;III)一个解析器,将O-SQL重写为SQL,以便此类查询可以传递到数据库服务器。

结果

I)我们比较了迄今发布的几个语义查询,并能够以简化的高度浓缩形式重现它们。II)根据人工注释测量注释过程的质量,我们发现灵敏度为97.62%,特异性为100.00%。III)分析了不同的语义查询,F分数在0.91至0.98之间。

结论

我们表明,使用标准工具对包含自由文本的病历进行系统分析是可行的。本体与数据库领域标准技术的无缝连接是非结构化数据分析的重要组成部分。所开发的技术可以很容易地应用于关系型组织的数据,并支持日益重要的转化研究领域。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/593c/6480592/1954fdec0df7/13326_2019_199_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/593c/6480592/fda525bb7425/13326_2019_199_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/593c/6480592/f63aee5563fa/13326_2019_199_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/593c/6480592/a9af73df4d19/13326_2019_199_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/593c/6480592/1954fdec0df7/13326_2019_199_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/593c/6480592/fda525bb7425/13326_2019_199_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/593c/6480592/f63aee5563fa/13326_2019_199_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/593c/6480592/a9af73df4d19/13326_2019_199_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/593c/6480592/1954fdec0df7/13326_2019_199_Fig4_HTML.jpg

相似文献

1
Integrating terminologies into standard SQL: a new approach for research on routine data.将术语整合到标准SQL中:一种用于常规数据研究的新方法。
J Biomed Semantics. 2019 Apr 24;10(1):7. doi: 10.1186/s13326-019-0199-z.
2
Enabling Ontology Based Semantic Queries in Biomedical Database Systems.在生物医学数据库系统中实现基于本体的语义查询。
Int J Semant Comput. 2014 Mar;8(1):67-83. doi: 10.1142/S1793351X14500032.
3
SIFR annotator: ontology-based semantic annotation of French biomedical text and clinical notes.SIFR 标注器:基于本体论的法语生物医学文本和临床笔记的语义标注。
BMC Bioinformatics. 2018 Nov 6;19(1):405. doi: 10.1186/s12859-018-2429-2.
4
SEMCARE: Multilingual Semantic Search in Semi-Structured Clinical Data.SEMCARE:半结构化临床数据中的多语言语义搜索
Stud Health Technol Inform. 2016;223:93-9.
5
Creating personalised clinical pathways by semantic interoperability with electronic health records.通过与电子健康记录的语义互操作性创建个性化临床路径。
Artif Intell Med. 2013 Jun;58(2):81-9. doi: 10.1016/j.artmed.2013.02.005. Epub 2013 Mar 5.
6
Constructing a Graph Database for Semantic Literature-Based Discovery.构建用于基于语义文献发现的图形数据库。
Stud Health Technol Inform. 2015;216:1094.
7
A Formalization of SQL with Nulls.带空值的SQL形式化
J Autom Reason. 2022;66(4):989-1030. doi: 10.1007/s10817-022-09632-4. Epub 2022 Jul 27.
8
An ontological approach for the exploitation of clinical data.一种用于临床数据利用的本体论方法。
Stud Health Technol Inform. 2013;192:142-6.
9
Querying archetype-based EHRs by search ontology-based XPath engineering.通过基于搜索本体的XPath工程查询基于原型的电子健康记录。
J Biomed Semantics. 2018 May 11;9(1):16. doi: 10.1186/s13326-018-0180-2.
10
Exploiting the semantic graph for the representation and retrieval of medical documents.利用语义图进行医学文献的表示和检索。
Comput Biol Med. 2018 Oct 1;101:39-50. doi: 10.1016/j.compbiomed.2018.08.009. Epub 2018 Aug 7.

引用本文的文献

1
ND-AMD: A Web-Based Database for Animal Models of Neurological Disease With Analysis Tools.ND-AMD:一个基于网络的神经疾病动物模型数据库及分析工具。
CNS Neurosci Ther. 2025 May;31(5):e70411. doi: 10.1111/cns.70411.
2
Mechanism of Luteolin in the Treatment of Primary Sjogren's Syndrome: a Study Based on Systems Biology and Cell Experiments.木犀草素治疗原发性干燥综合征的机制:基于系统生物学和细胞实验的研究
ACS Omega. 2025 Apr 15;10(16):16339-16354. doi: 10.1021/acsomega.4c10653. eCollection 2025 Apr 29.
3
Medical Information Mining-Based Visual Artificial Intelligence Emergency Nursing Management System.

本文引用的文献

1
From single-case analysis of neonatal deaths toward a further reduction of the neonatal mortality rate.从新生儿死亡的单病例分析到进一步降低新生儿死亡率
J Perinat Med. 2018 Dec 19;47(1):125-133. doi: 10.1515/jpm-2018-0003.
2
Text Mining the History of Medicine.挖掘医学史
PLoS One. 2016 Jan 6;11(1):e0144717. doi: 10.1371/journal.pone.0144717. eCollection 2016.
3
Fine-grained information extraction from German transthoracic echocardiography reports.从德国经胸超声心动图报告中提取细粒度信息。
基于医学信息挖掘的可视化人工智能急诊护理管理系统。
J Healthc Eng. 2021 Nov 25;2021:4253606. doi: 10.1155/2021/4253606. eCollection 2021.
4
Crowd-Sourced Chemistry: Considerations for Building a Standardized Database to Improve Omic Analyses.众包化学:构建标准化数据库以改进组学分析的考量因素
ACS Omega. 2020 Jan 9;5(2):980-985. doi: 10.1021/acsomega.9b03708. eCollection 2020 Jan 21.
5
Knowledge-based best of breed approach for automated detection of clinical events based on German free text digital hospital discharge letters.基于知识的最佳实践方法,用于基于德语自由文本数字出院记录自动检测临床事件。
PLoS One. 2019 Nov 27;14(11):e0224916. doi: 10.1371/journal.pone.0224916. eCollection 2019.
BMC Med Inform Decis Mak. 2015 Nov 12;15:91. doi: 10.1186/s12911-015-0215-x.
4
Cohort Discovery Query Optimization via Computable Controlled Vocabulary Versioning.通过可计算控制词汇表版本控制进行队列发现查询优化
Stud Health Technol Inform. 2015;216:1084.
5
Semantic enrichment of longitudinal clinical study data using the CDISC standards and the semantic statistics vocabularies.使用CDISC标准和语义统计词汇对纵向临床研究数据进行语义丰富。
J Biomed Semantics. 2015 Apr 9;6:16. doi: 10.1186/s13326-015-0012-6. eCollection 2015.
6
Enabling Ontology Based Semantic Queries in Biomedical Database Systems.在生物医学数据库系统中实现基于本体的语义查询。
Int J Semant Comput. 2014 Mar;8(1):67-83. doi: 10.1142/S1793351X14500032.
7
Automated mapping of clinical terms into SNOMED-CT. An application to codify procedures in pathology.临床术语到SNOMED-CT的自动映射。一种用于病理程序编码的应用。
J Med Syst. 2014 Oct;38(10):134. doi: 10.1007/s10916-014-0134-x. Epub 2014 Sep 2.
8
An adaptable architecture for patient cohort identification from diverse data sources.一种适用于从多种数据源中识别患者队列的可适应架构。
J Am Med Inform Assoc. 2013 Dec;20(e2):e327-33. doi: 10.1136/amiajnl-2013-001858. Epub 2013 Sep 24.
9
An ontological approach for the exploitation of clinical data.一种用于临床数据利用的本体论方法。
Stud Health Technol Inform. 2013;192:142-6.
10
Automated identification of drug and food allergies entered using non-standard terminology.自动识别使用非标准术语输入的药物和食物过敏。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):962-8. doi: 10.1136/amiajnl-2013-001756. Epub 2013 Jun 7.