• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自动透明度评估用于开放知识提取系统。

Automatic transparency evaluation for open knowledge extraction systems.

机构信息

School of Computing, Dublin City University, Dublin, Ireland.

ADAPT Centre, School of Computer Science, University College Dublin, Dublin, Ireland.

出版信息

J Biomed Semantics. 2023 Aug 31;14(1):12. doi: 10.1186/s13326-023-00293-9.

DOI:10.1186/s13326-023-00293-9
PMID:37653549
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10468861/
Abstract

BACKGROUND

This paper proposes Cyrus, a new transparency evaluation framework, for Open Knowledge Extraction (OKE) systems. Cyrus is based on the state-of-the-art transparency models and linked data quality assessment dimensions. It brings together a comprehensive view of transparency dimensions for OKE systems. The Cyrus framework is used to evaluate the transparency of three linked datasets, which are built from the same corpus by three state-of-the-art OKE systems. The evaluation is automatically performed using a combination of three state-of-the-art FAIRness (Findability, Accessibility, Interoperability, Reusability) assessment tools and a linked data quality evaluation framework, called Luzzu. This evaluation includes six Cyrus data transparency dimensions for which existing assessment tools could be identified. OKE systems extract structured knowledge from unstructured or semi-structured text in the form of linked data. These systems are fundamental components of advanced knowledge services. However, due to the lack of a transparency framework for OKE, most OKE systems are not transparent. This means that their processes and outcomes are not understandable and interpretable. A comprehensive framework sheds light on different aspects of transparency, allows comparison between the transparency of different systems by supporting the development of transparency scores, gives insight into the transparency weaknesses of the system, and ways to improve them. Automatic transparency evaluation helps with scalability and facilitates transparency assessment. The transparency problem has been identified as critical by the European Union Trustworthy Artificial Intelligence (AI) guidelines. In this paper, Cyrus provides the first comprehensive view of transparency dimensions for OKE systems by merging the perspectives of the FAccT (Fairness, Accountability, and Transparency), FAIR, and linked data quality research communities.

RESULTS

In Cyrus, data transparency includes ten dimensions which are grouped in two categories. In this paper, six of these dimensions, i.e., provenance, interpretability, understandability, licensing, availability, interlinking have been evaluated automatically for three state-of-the-art OKE systems, using the state-of-the-art metrics and tools. Covid-on-the-Web is identified to have the highest mean transparency.

CONCLUSIONS

This is the first research to study the transparency of OKE systems that provides a comprehensive set of transparency dimensions spanning ethics, trustworthy AI, and data quality approaches to transparency. It also demonstrates how to perform automated transparency evaluation that combines existing FAIRness and linked data quality assessment tools for the first time. We show that state-of-the-art OKE systems vary in the transparency of the linked data generated and that these differences can be automatically quantified leading to potential applications in trustworthy AI, compliance, data protection, data governance, and future OKE system design and testing.

摘要

背景

本文提出了 Cyrus,这是一个用于开放知识抽取(OKE)系统的新透明度评估框架。Cyrus 基于最先进的透明度模型和关联数据质量评估维度。它汇集了 OKE 系统透明度维度的综合视图。Cyrus 框架用于评估三个关联数据集的透明度,这些数据集由三个最先进的 OKE 系统从同一个语料库构建而成。评估是使用三种最先进的 FAIRness(可发现性、可访问性、互操作性、可重用性)评估工具和一个称为 Luzzu 的关联数据质量评估框架的组合自动执行的。这项评估包括六个 Cyrus 数据透明度维度,对于这些维度,可以确定现有的评估工具。OKE 系统以关联数据的形式从非结构化或半结构化文本中提取结构化知识。这些系统是高级知识服务的基本组成部分。但是,由于缺乏 OKE 的透明度框架,大多数 OKE 系统都不透明。这意味着它们的过程和结果不可理解和可解释。一个全面的框架揭示了透明度的不同方面,通过支持透明度分数的开发,可以比较不同系统的透明度,深入了解系统的透明度弱点以及改进它们的方法。自动透明度评估有助于可扩展性并促进透明度评估。透明度问题已被欧洲联盟可信人工智能 (AI) 准则确定为关键问题。在本文中,Cyrus 通过合并 FAccT(公平、问责制和透明度)、FAIR 和关联数据质量研究社区的观点,为 OKE 系统提供了透明度维度的全面视图。

结果

在 Cyrus 中,数据透明度包括十个维度,分为两类。在本文中,使用最先进的指标和工具,自动评估了这三个最先进的 OKE 系统的六个维度,即来源、可解释性、可理解性、许可、可用性和互链接。Covid-on-the-Web 被确定为具有最高平均透明度。

结论

这是第一项研究 OKE 系统透明度的研究,它提供了一套全面的透明度维度,涵盖了道德、可信 AI 和数据质量方法的透明度。它还演示了如何首次结合现有的 FAIRness 和关联数据质量评估工具进行自动透明度评估。我们表明,最先进的 OKE 系统在生成的关联数据的透明度上存在差异,并且这些差异可以自动量化,从而为可信 AI、合规性、数据保护、数据治理以及未来的 OKE 系统设计和测试中的潜在应用提供了可能性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ef/10468861/f64c4a1c9b6b/13326_2023_293_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ef/10468861/6bdf3b57b82b/13326_2023_293_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ef/10468861/f64c4a1c9b6b/13326_2023_293_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ef/10468861/6bdf3b57b82b/13326_2023_293_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ef/10468861/f64c4a1c9b6b/13326_2023_293_Fig2_HTML.jpg

相似文献

1
Automatic transparency evaluation for open knowledge extraction systems.自动透明度评估用于开放知识提取系统。
J Biomed Semantics. 2023 Aug 31;14(1):12. doi: 10.1186/s13326-023-00293-9.
2
Toward Fairness, Accountability, Transparency, and Ethics in AI for Social Media and Health Care: Scoping Review.迈向社交媒体和医疗保健领域人工智能的公平性、问责制、透明度和伦理:范围审查
JMIR Med Inform. 2024 Apr 3;12:e50048. doi: 10.2196/50048.
3
Explainable AI for Bioinformatics: Methods, Tools and Applications.可解释人工智能在生物信息学中的应用:方法、工具与应用。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad236.
4
Can I trust my fake data - A comprehensive quality assessment framework for synthetic tabular data in healthcare.我能相信我的虚假数据吗——医疗保健中合成表格数据的综合质量评估框架。
Int J Med Inform. 2024 May;185:105413. doi: 10.1016/j.ijmedinf.2024.105413. Epub 2024 Mar 12.
5
FAIR-Checker: supporting digital resource findability and reuse with Knowledge Graphs and Semantic Web standards.FAIR-Checker:利用知识图谱和语义 Web 标准支持数字资源的可发现性和再利用。
J Biomed Semantics. 2023 Jul 1;14(1):7. doi: 10.1186/s13326-023-00289-5.
6
The role of explainability in creating trustworthy artificial intelligence for health care: A comprehensive survey of the terminology, design choices, and evaluation strategies.可解释性在医疗保健人工智能可信性构建中的作用:术语、设计选择和评估策略的全面调查。
J Biomed Inform. 2021 Jan;113:103655. doi: 10.1016/j.jbi.2020.103655. Epub 2020 Dec 10.
7
Are current clinical studies on artificial intelligence-based medical devices comprehensive enough to support a full health technology assessment? A systematic review.基于人工智能的医疗器械的当前临床研究是否足够全面,足以支持全面的健康技术评估?系统评价。
Artif Intell Med. 2023 Jun;140:102547. doi: 10.1016/j.artmed.2023.102547. Epub 2023 Apr 23.
8
A scalable and transparent data pipeline for AI-enabled health data ecosystems.用于支持人工智能的健康数据生态系统的可扩展且透明的数据管道。
Front Med (Lausanne). 2024 Jul 30;11:1393123. doi: 10.3389/fmed.2024.1393123. eCollection 2024.
9
Risk management frameworks for human health and environmental risks.人类健康与环境风险的风险管理框架。
J Toxicol Environ Health B Crit Rev. 2003 Nov-Dec;6(6):569-720. doi: 10.1080/10937400390208608.
10
Towards Transparency by Design for Artificial Intelligence.人工智能设计透明化。
Sci Eng Ethics. 2020 Dec;26(6):3333-3361. doi: 10.1007/s11948-020-00276-4. Epub 2020 Nov 16.

本文引用的文献

1
FAIR-Checker: supporting digital resource findability and reuse with Knowledge Graphs and Semantic Web standards.FAIR-Checker:利用知识图谱和语义 Web 标准支持数字资源的可发现性和再利用。
J Biomed Semantics. 2023 Jul 1;14(1):7. doi: 10.1186/s13326-023-00289-5.
2
Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.停止为高风险决策解释黑箱机器学习模型,转而使用可解释模型。
Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.
3
Transparency assessment of COVID-19 models.
新型冠状病毒肺炎模型的透明度评估。
Lancet Glob Health. 2020 Dec;8(12):e1459-e1460. doi: 10.1016/S2214-109X(20)30447-2. Epub 2020 Oct 27.
4
Transparency and reproducibility in artificial intelligence.人工智能中的透明度和可重复性。
Nature. 2020 Oct;586(7829):E14-E16. doi: 10.1038/s41586-020-2766-y. Epub 2020 Oct 14.
5
An empirical assessment of transparency and reproducibility-related research practices in the social sciences (2014-2017).社会科学中与透明度和可重复性相关的研究实践的实证评估(2014 - 2017年)
R Soc Open Sci. 2020 Feb 19;7(2):190806. doi: 10.1098/rsos.190806. eCollection 2020 Feb.
6
A governance model for the application of AI in health care.人工智能在医疗保健领域应用的治理模型。
J Am Med Inform Assoc. 2020 Mar 1;27(3):491-497. doi: 10.1093/jamia/ocz192.
7
Developing a FHIR-based EHR phenotyping framework: A case study for identification of patients with obesity and multiple comorbidities from discharge summaries.基于 FHIR 的电子健康记录表型框架的开发:以从出院小结中识别肥胖且伴有多种合并症的患者为例。
J Biomed Inform. 2019 Nov;99:103310. doi: 10.1016/j.jbi.2019.103310. Epub 2019 Oct 14.
8
PubTator central: automated concept annotation for biomedical full text articles.PubTator 中心:用于生物医学全文文章的自动概念标注。
Nucleic Acids Res. 2019 Jul 2;47(W1):W587-W593. doi: 10.1093/nar/gkz389.
9
Reproducible research practices, transparency, and open access data in the biomedical literature, 2015-2017.2015-2017 年生物医学文献中的可重复性研究实践、透明度和开放获取数据。
PLoS Biol. 2018 Nov 20;16(11):e2006930. doi: 10.1371/journal.pbio.2006930. eCollection 2018 Nov.
10
Opinion: The dangers of faulty, biased, or malicious algorithms requires independent oversight.观点:有缺陷、有偏见或恶意的算法存在危险,需要独立监督。
Proc Natl Acad Sci U S A. 2016 Nov 29;113(48):13538-13540. doi: 10.1073/pnas.1618211113. Epub 2016 Nov 23.