• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用SPARQL测试格:在生物医学本体论质量保证中的应用。

Using SPARQL to Test for Lattices: application to quality assurance in biomedical ontologies.

作者信息

Zhang Guo-Qiang, Bodenreider Olivier

机构信息

Case Western Reserve University, Cleveland, OH 44106, USA.

National Library of Medicine, Bethesda, MD 20892, USA.

出版信息

Semant Web ISWC. 2010;6497:273-288. doi: 10.1007/978-3-642-17749-1_18.

DOI:10.1007/978-3-642-17749-1_18
PMID:25699294
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4330995/
Abstract

We present a scalable, SPARQL-based computational pipeline for testing the lattice-theoretic properties of partial orders represented as RDF triples. The use case for this work is quality assurance in biomedical ontologies, one desirable property of which is conformance to lattice structures. At the core of our pipeline is the algorithm called , for detecting the mber of nimal upper bounds of any pair of elements in a given finite partial order. Our technical contribution is the coding of completely in SPARQL. To show its scalability, we applied to the entirety of SNOMED CT, the largest clinical ontology (over 300,000 conepts). Our experimental results have been groundbreaking: for the first time, all non-lattice pairs in SNOMED CT have been identified exhaustively from 34 million candidate pairs using over 2.5 billion queries issued to Virtuoso. The percentage of non-lattice pairs ranges from 0 to 1.66 among the 19 SNOMED CT hierarchies. These non-lattice pairs represent target areas for focused curation by domain experts. RDF, SPARQL and related tooling provide an e cient platform for implementing lattice algorithms on large data structures.

摘要

我们提出了一种基于SPARQL的可扩展计算管道,用于测试表示为RDF三元组的偏序关系的格理论属性。这项工作的用例是生物医学本体中的质量保证,其中一个理想属性是符合格结构。我们管道的核心是名为 的算法,用于检测给定有限偏序中任意一对元素的最小上界数量。我们的技术贡献是将 完全用SPARQL编码。为了展示其可扩展性,我们将 应用于整个SNOMED CT,这是最大的临床本体(超过30万个概念)。我们的实验结果具有开创性:首次通过向Virtuoso发出超过25亿个查询,从3400万个候选对中详尽地识别出了SNOMED CT中的所有非格对。在19个SNOMED CT层次结构中,非格对的百分比范围从0到1.66。这些非格对代表了领域专家进行重点策划的目标区域。RDF、SPARQL和相关工具为在大型数据结构上实现格算法提供了一个高效的平台。

相似文献

1
Using SPARQL to Test for Lattices: application to quality assurance in biomedical ontologies.使用SPARQL测试格:在生物医学本体论质量保证中的应用。
Semant Web ISWC. 2010;6497:273-288. doi: 10.1007/978-3-642-17749-1_18.
2
Large-scale, Exhaustive Lattice-based Structural Auditing of SNOMED CT.基于格的SNOMED CT大规模详尽结构审核
AMIA Annu Symp Proc. 2010 Nov 13;2010:922-6.
3
An efficient, large-scale, non-lattice-detection algorithm for exhaustive structural auditing of biomedical ontologies.一种用于生物医学本体全面结构审核的高效、大规模、非格检测算法。
J Biomed Inform. 2018 Apr;80:106-119. doi: 10.1016/j.jbi.2018.03.004. Epub 2018 Mar 13.
4
Auditing SNOMED CT hierarchical relations based on lexical features of concepts in non-lattice subgraphs.基于非格子网中概念的词汇特征来审核 SNOMED CT 层次关系。
J Biomed Inform. 2018 Feb;78:177-184. doi: 10.1016/j.jbi.2017.12.010. Epub 2017 Dec 20.
5
Leveraging non-lattice subgraphs for suggestion of new concepts for SNOMED CT.利用非格状子图为医学系统命名法(SNOMED CT)的新概念提供建议。
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2021 Dec;2021:1805-1812. doi: 10.1109/bibm52615.2021.9669407.
6
Mining non-lattice subgraphs for detecting missing hierarchical relations and concepts in SNOMED CT.挖掘非格状子图以检测SNOMED CT中缺失的层次关系和概念。
J Am Med Inform Assoc. 2017 Jul 1;24(4):788-798. doi: 10.1093/jamia/ocw175.
7
MaPLE: A MapReduce Pipeline for Lattice-based Evaluation and Its Application to SNOMED CT.MaPLE:一种用于基于格的评估的MapReduce管道及其在SNOMED CT中的应用。
Proc IEEE Int Conf Big Data. 2014 Oct;2014:754-759. doi: 10.1109/BigData.2014.7004301.
8
BioBenchmark Toyama 2012: an evaluation of the performance of triple stores on biological data.2012年富山生物基准测试:三元组存储在生物数据上的性能评估
J Biomed Semantics. 2014 Jul 10;5:32. doi: 10.1186/2041-1480-5-32. eCollection 2014.
9
BioPortal as a Dataset of Linked Biomedical Ontologies and Terminologies in RDF.BioPortal作为RDF格式的链接生物医学本体和术语数据集。
Semant Web. 2013;4(3):277-284.
10
FAIR-compliant clinical, radiomics and DICOM metadata of RIDER, interobserver, Lung1 and head-Neck1 TCIA collections.符合 FAIR 原则的 RIDER、观察者间一致性、Lung1 和 head-Neck1 TCIA 数据集的临床、影像组学和 DICOM 元数据。
Med Phys. 2020 Nov;47(11):5931-5940. doi: 10.1002/mp.14322. Epub 2020 Jun 27.

引用本文的文献

1
Extending import detection algorithms for concept import from two to three biomedical terminologies.将概念导入的导入检测算法从两个扩展到三个生物医学术语。
BMC Med Inform Decis Mak. 2020 Dec 15;20(Suppl 10):272. doi: 10.1186/s12911-020-01290-z.
2
Spark-MCA: Large-scale, Exhaustive Formal Concept Analysis for Evaluating the Semantic Completeness of SNOMED CT.Spark-MCA:用于评估SNOMED CT语义完整性的大规模详尽形式概念分析
AMIA Annu Symp Proc. 2018 Apr 16;2017:1931-1940. eCollection 2017.
3
Quality Assurance of NCI Thesaurus by Mining Structural-Lexical Patterns.通过挖掘结构-词汇模式对美国国立癌症研究所叙词表进行质量保证
AMIA Annu Symp Proc. 2018 Apr 16;2017:364-373. eCollection 2017.
4
Mining non-lattice subgraphs for detecting missing hierarchical relations and concepts in SNOMED CT.挖掘非格状子图以检测SNOMED CT中缺失的层次关系和概念。
J Am Med Inform Assoc. 2017 Jul 1;24(4):788-798. doi: 10.1093/jamia/ocw175.
5
FEDRR: fast, exhaustive detection of redundant hierarchical relations for quality improvement of large biomedical ontologies.FEDRR:用于大型生物医学本体质量改进的冗余层次关系快速详尽检测
BioData Min. 2016 Oct 10;9:31. doi: 10.1186/s13040-016-0110-8. eCollection 2016.
6
COBE: A Conjunctive Ontology Browser and Explorer for Visualizing SNOMED CT Fragments.COBE:一种用于可视化SNOMED CT片段的联合本体浏览器和资源管理器。
AMIA Annu Symp Proc. 2015 Nov 5;2015:2092-100. eCollection 2015.
7
COHeRE: Cross-Ontology Hierarchical Relation Examination for Ontology Quality Assurance.COHeRE:用于本体质量保证的跨本体层次关系检查
AMIA Annu Symp Proc. 2015 Nov 5;2015:456-65. eCollection 2015.
8
NEO: Systematic Non-Lattice Embedding of Ontologies for Comparing the Subsumption Relationship in SNOMED CT and in FMA Using MapReduce.NEO:用于使用MapReduce比较SNOMED CT和FMA中包含关系的本体的系统非格嵌入
AMIA Jt Summits Transl Sci Proc. 2015 Mar 25;2015:216-20. eCollection 2015.
9
MaPLE: A MapReduce Pipeline for Lattice-based Evaluation and Its Application to SNOMED CT.MaPLE:一种用于基于格的评估的MapReduce管道及其在SNOMED CT中的应用。
Proc IEEE Int Conf Big Data. 2014 Oct;2014:754-759. doi: 10.1109/BigData.2014.7004301.
10
An analysis of FMA using structural self-bisimilarity.利用结构自相似性分析 FMA。
J Biomed Inform. 2013 Jun;46(3):497-505. doi: 10.1016/j.jbi.2013.03.005. Epub 2013 Apr 2.

本文引用的文献

1
Large-scale, Exhaustive Lattice-based Structural Auditing of SNOMED CT.基于格的SNOMED CT大规模详尽结构审核
AMIA Annu Symp Proc. 2010 Nov 13;2010:922-6.
2
Using the abstraction network in complement to description logics for quality assurance in biomedical terminologies - a case study in SNOMED CT.利用抽象网络辅助描述逻辑进行生物医学术语的质量保证——以SNOMED CT为例
Stud Health Technol Inform. 2010;160(Pt 2):1070-4.
3
A review of auditing methods applied to the content of controlled biomedical terminologies.对应用于受控生物医学术语内容的审核方法的综述。
J Biomed Inform. 2009 Jun;42(3):413-25. doi: 10.1016/j.jbi.2009.03.003. Epub 2009 Mar 12.
4
Auditing the semantic completeness of SNOMED CT using formal concept analysis.使用形式概念分析审核SNOMED CT的语义完整性。
J Am Med Inform Assoc. 2009 Jan-Feb;16(1):89-102. doi: 10.1197/jamia.M2541. Epub 2008 Oct 24.
5
Structural methodologies for auditing SNOMED.用于审核SNOMED的结构化方法。
J Biomed Inform. 2007 Oct;40(5):561-81. doi: 10.1016/j.jbi.2006.12.003. Epub 2006 Dec 24.
6
SNOMED-CT: The advanced terminology and coding system for eHealth.SNOMED-CT:电子健康的先进术语和编码系统。
Stud Health Technol Inform. 2006;121:279-90.
7
Issues in the structuring and acquisition of an ontology for medical language understanding.医学语言理解本体构建与获取中的问题
Methods Inf Med. 1995 Mar;34(1-2):15-24.