• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

数据质量与数据数量:互补还是矛盾?

Data Quality and Data Quantity: Complements or Contradictions?

机构信息

University Duisburg-Essen, Faculty of Medicine, IMIBE, Essen, Germany.

出版信息

Stud Health Technol Inform. 2023 Jun 29;305:24-27. doi: 10.3233/SHTI230414.

DOI:10.3233/SHTI230414
PMID:37386948
Abstract

Although data quality is well defined, the relationship to data quantity remains unclear. Especially the big data approach promises advantages of volume in comparison with small samples in good quality. Aim of this study was to review this issue. Based on the experiences with six registries within a German funding initiative, the definition of data quality provided by the International Organization for Standardization (ISO) was confronted with several aspects of data quantity. The results of a literature search combining both concepts were considered additionally. Data quantity was identified as an umbrella of some inherent characteristics of data like case and data completeness. The same time, quantity could be regarded as a non inherent characteristic of data beyond the ISO standard focusing on the breadth and depth of metadata, i.e. data elements along with their value sets. The FAIR Guiding Principles take into account the latter solely. Surprisingly, the literature agreed in demanding an increase in data quality with volume, turning the big data approach inside out. A usage of data without context - as it could be the case in data mining or machine learning - is neither covered by the concept of data quality nor of data quantity.

摘要

尽管数据质量的定义已经很明确,但它与数据数量的关系仍不清楚。特别是大数据方法相对于高质量的小样本来说,在数量上具有优势。本研究旨在探讨这个问题。基于德国资助倡议下六个登记处的经验,本文将国际标准化组织(ISO)提供的数据质量定义与数据数量的几个方面进行了对比。此外,还考虑了将这两个概念结合起来的文献检索结果。数据数量被确定为数据某些固有特征(如案例和数据完整性)的总称。同时,数量也可以被视为超出 ISO 标准范围的数据的非固有特征,该标准侧重于元数据的广度和深度,即数据元素及其值集。FAIR 指导原则仅考虑后者。令人惊讶的是,文献在要求随着数据量的增加而提高数据质量方面达成了一致,这使得大数据方法变得颠倒了。在没有上下文的情况下使用数据——例如在数据挖掘或机器学习中——既不受数据质量概念的涵盖,也不受数据数量概念的涵盖。

相似文献

1
Data Quality and Data Quantity: Complements or Contradictions?数据质量与数据数量:互补还是矛盾?
Stud Health Technol Inform. 2023 Jun 29;305:24-27. doi: 10.3233/SHTI230414.
2
FAIR and Quality Assured Data - The Use Case of Trueness.公正且有质量保证的数据 - 准确性的应用案例。
Stud Health Technol Inform. 2022 Jan 14;289:25-28. doi: 10.3233/SHTI210850.
3
[Cross-Project Support of Registries in Development, Implementation and Operation].[注册机构在开发、实施和运营中的跨项目支持]
Gesundheitswesen. 2021 Nov;83(S 01):S54-S59. doi: 10.1055/a-1537-9324. Epub 2021 Nov 3.
4
Metadata Definition in Registries: What Is a Data Element?注册信息标准中的元数据定义:什么是数据元素?
Stud Health Technol Inform. 2022 May 25;294:174-178. doi: 10.3233/SHTI220432.
5
Metadata of Registries: Results from an Initiative in Health Services Research.注册研究的元数据:卫生服务研究中的一项举措的结果。
Stud Health Technol Inform. 2021 May 27;281:18-22. doi: 10.3233/SHTI210112.
6
Pragmatic MDR: a metadata repository with bottom-up standardization of medical metadata through reuse.实用型 MDR:通过重用实现医疗元数据自底向上标准化的元数据存储库。
BMC Med Inform Decis Mak. 2021 May 17;21(1):160. doi: 10.1186/s12911-021-01524-8.
7
Bridging Documentation and Metadata Standards: Experiences from a Funding Initiative for Registries.衔接文档与元数据标准:来自注册机构资助计划的经验
Stud Health Technol Inform. 2019 Aug 21;264:1046-1050. doi: 10.3233/SHTI190384.
8
Recommended data elements for health registries: a survey from a German funding initiative.推荐用于健康登记的数据集元素:来自德国资助倡议的一项调查。
BMC Med Inform Decis Mak. 2024 May 27;24(1):136. doi: 10.1186/s12911-024-02535-x.
9
A Survey of Biological Data in a Big Data Perspective.大数据视角下的生物数据综述。
Big Data. 2022 Aug;10(4):279-297. doi: 10.1089/big.2020.0383. Epub 2022 Apr 7.
10
Big Data and medicine: a big deal?大数据与医学:一大议题?
J Intern Med. 2018 May;283(5):418-429. doi: 10.1111/joim.12721. Epub 2018 Jan 8.

引用本文的文献

1
Detection and classification of supraspinatus pathologies on shoulder magnetic resonance images using a code-free deep learning application.使用无代码深度学习应用程序在肩部磁共振图像上检测和分类冈上肌病变
Asia Pac J Sports Med Arthrosc Rehabil Technol. 2025 May 5;42:1-7. doi: 10.1016/j.asmart.2025.04.005. eCollection 2025 Oct.