与人工数据提取相比，将人工智能作为数据提取工具并不完全可靠。

Artificial intelligence as a tool for data extraction is not fully reliable compared to manual data extraction.

作者信息

Daraqel Baraa, Owayda Amer, Khan Haris, Koletsi Despina, Mheissen Samer

机构信息

Department of Orthodontics, Oral Health Research and Promotion Unit, Al-Quds University, Jerusalem, Palestine.

Private practice, Harmony medical group, Abu Dhabi, United Arab Emirates.

出版信息

J Dent. 2025 Sep;160:105846. doi: 10.1016/j.jdent.2025.105846. Epub 2025 May 29.

DOI:10.1016/j.jdent.2025.105846

PMID:40449825

Abstract

INTRODUCTION

Data extraction for systematic reviews is a time-consuming step and prone to errors.

OBJECTIVE

This study aimed to evaluate the agreement between artificial intelligence and human data extraction methods.

METHODS

Studies published in seven orthodontic journals between 2019 and 2024, were retrieved and included. Fifteen data sets from each study were extracted manually and using the Microsoft Bing AI-based tool by two independent reviewers. Files in Portable Document Format were uploaded to the AI-based tool, and specific data were requested through its chat feature. The association between the data extraction methods and study characteristics was examined, and agreement was evaluated using interclass correlation and Kappa statistics.

RESULTS

A total of 300 orthodontic studies were included. Slight differences between human and AI-based data extraction methods for publication years and study designs were observed, though these were not statistically significant. Minor inconsistencies were also found in the extraction of the number of trial arms and the mean age of participants per group, but these were not significant. The AI-based tool was less effective in extracting variables related to the study design (P = 0.017) and the number of centers (P < 0.001). Agreement between human and AI-based extraction methods ranged from slight (0.16) for the type of study design to moderate (0.45) for study design classification, and substantial to perfect (0.65-1.00) for most other variables.

CONCLUSION

AI-based data extraction, while effective for straightforward variables, is not fully reliable for complex data extraction. Human input remains essential for ensuring accuracy and completeness in systematic reviews.

CLINICAL SIGNIFICANCE

AI-based tools can effectively extract straightforward data, potentially reducing the time and effort required for systematic reviews. This can help clinicians and researchers process large data more efficiently. However, it is important to keep human supervision to maintain the integrity and reliability of clinical evidence.

摘要

引言

系统评价中的数据提取是一个耗时的步骤，且容易出错。

目的

本研究旨在评估人工智能与人工数据提取方法之间的一致性。

方法

检索并纳入了2019年至2024年期间在7种正畸学杂志上发表的研究。由两名独立的审阅者从每项研究中手动提取15个数据集，并使用基于微软必应人工智能的工具进行提取。将便携式文档格式的文件上传到基于人工智能的工具，并通过其聊天功能请求特定数据。检查了数据提取方法与研究特征之间的关联，并使用组内相关系数和卡帕统计量评估一致性。

结果

共纳入300项正畸学研究。观察到人工与基于人工智能的数据提取方法在发表年份和研究设计方面存在细微差异，尽管这些差异无统计学意义。在试验组数量和每组参与者平均年龄的提取方面也发现了一些小的不一致，但这些也不显著。基于人工智能的工具在提取与研究设计相关的变量（P = 0.017）和中心数量（P < 0.001）方面效果较差。人工与基于人工智能的提取方法之间的一致性范围从研究设计类型的轻微一致性（0.16）到研究设计分类的中等一致性（0.45），而对于大多数其他变量则为实质性到完美一致性（0.65 - 1.00）。

结论

基于人工智能的数据提取虽然对简单变量有效，但对于复杂数据提取并不完全可靠。人工输入对于确保系统评价的准确性和完整性仍然至关重要。

临床意义

基于人工智能的工具可以有效地提取简单数据，可能减少系统评价所需的时间和精力。这有助于临床医生和研究人员更高效地处理大数据。然而，保持人工监督对于维持临床证据的完整性和可靠性很重要。

相似文献

Artificial intelligence as a tool for data extraction is not fully reliable compared to manual data extraction.与人工数据提取相比，将人工智能作为数据提取工具并不完全可靠。

J Dent. 2025 Sep;160:105846. doi: 10.1016/j.jdent.2025.105846. Epub 2025 May 29.

[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果：来自系统评价和意大利医院数据评估的证据]

Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.

A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.拓扑替康治疗卵巢癌的临床有效性和成本效益的快速系统评价。

Health Technol Assess. 2001;5(28):1-110. doi: 10.3310/hta5280.

Eliciting adverse effects data from participants in clinical trials.从临床试验参与者中获取不良反应数据。

Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状Meta分析。

Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病：网络荟萃分析。

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗：一项系统综述

Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.

Artificial intelligence for detecting keratoconus.人工智能在圆锥角膜检测中的应用。

Cochrane Database Syst Rev. 2023 Nov 15;11(11):CD014911. doi: 10.1002/14651858.CD014911.pub2.

Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施：系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。

Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

与人工数据提取相比，将人工智能作为数据提取工具并不完全可靠。

Artificial intelligence as a tool for data extraction is not fully reliable compared to manual data extraction.

作者信息

Daraqel Baraa, Owayda Amer, Khan Haris, Koletsi Despina, Mheissen Samer

机构信息

Department of Orthodontics, Oral Health Research and Promotion Unit, Al-Quds University, Jerusalem, Palestine.

Private practice, Harmony medical group, Abu Dhabi, United Arab Emirates.

出版信息

J Dent. 2025 Sep;160:105846. doi: 10.1016/j.jdent.2025.105846. Epub 2025 May 29.

DOI:10.1016/j.jdent.2025.105846

PMID:40449825

Abstract

INTRODUCTION

Data extraction for systematic reviews is a time-consuming step and prone to errors.

OBJECTIVE

This study aimed to evaluate the agreement between artificial intelligence and human data extraction methods.

METHODS

RESULTS

CONCLUSION

CLINICAL SIGNIFICANCE

摘要

引言

系统评价中的数据提取是一个耗时的步骤，且容易出错。

目的

本研究旨在评估人工智能与人工数据提取方法之间的一致性。

方法

结果

结论

基于人工智能的数据提取虽然对简单变量有效，但对于复杂数据提取并不完全可靠。人工输入对于确保系统评价的准确性和完整性仍然至关重要。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

与人工数据提取相比，将人工智能作为数据提取工具并不完全可靠。

Artificial intelligence as a tool for data extraction is not fully reliable compared to manual data extraction.

作者信息

机构信息

出版信息

INTRODUCTION

OBJECTIVE

METHODS

RESULTS

CONCLUSION

CLINICAL SIGNIFICANCE

引言

目的

方法

结果

结论

临床意义

相似文献

与人工数据提取相比，将人工智能作为数据提取工具并不完全可靠。

Artificial intelligence as a tool for data extraction is not fully reliable compared to manual data extraction.

作者信息

机构信息

出版信息

INTRODUCTION

OBJECTIVE

METHODS

RESULTS

CONCLUSION

CLINICAL SIGNIFICANCE

引言

目的

方法

结果

结论

临床意义

相似文献