• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多样性与包容性:开放数据隐藏的额外益处。

Diversity and inclusion: A hidden additional benefit of Open Data.

作者信息

Charpignon Marie-Laure, Celi Leo Anthony, Cobanaj Marisa, Eber Rene, Fiske Amelia, Gallifant Jack, Li Chenyu, Lingamallu Gurucharan, Petushkov Anton, Pierce Robin

机构信息

Institute for Data, Systems, and Society, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.

Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America.

出版信息

PLOS Digit Health. 2024 Jul 23;3(7):e0000486. doi: 10.1371/journal.pdig.0000486. eCollection 2024 Jul.

DOI:10.1371/journal.pdig.0000486
PMID:39042705
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11265679/
Abstract

The recent imperative by the National Institutes of Health to share scientific data publicly underscores a significant shift in academic research. Effective as of January 2023, it emphasizes that transparency in data collection and dedicated efforts towards data sharing are prerequisites for translational research, from the lab to the bedside. Given the role of data access in mitigating potential bias in clinical models, we hypothesize that researchers who leverage open-access datasets rather than privately-owned ones are more diverse. In this brief report, we proposed to test this hypothesis in the transdisciplinary and expanding field of artificial intelligence (AI) for critical care. Specifically, we compared the diversity among authors of publications leveraging open datasets, such as the commonly used MIMIC and eICU databases, with that among authors of publications relying exclusively on private datasets, unavailable to other research investigators (e.g., electronic health records from ICU patients accessible only to Mayo Clinic analysts). To measure the extent of author diversity, we characterized gender balance as well as the presence of researchers from low- and middle-income countries (LMIC) and minority-serving institutions (MSI) located in the United States (US). Our comparative analysis revealed a greater contribution of authors from LMICs and MSIs among researchers leveraging open critical care datasets (treatment group) than among those relying exclusively on private data resources (control group). The participation of women was similar between the two groups, albeit slightly larger in the former. Notably, although over 70% of all articles included at least one author inferred to be a woman, less than 25% had a woman as a first or last author. Importantly, we found that the proportion of authors from LMICs was substantially higher in the treatment than in the control group (10.1% vs. 6.2%, p<0.001), including as first and last authors. Moreover, we found that the proportion of US-based authors affiliated with a MSI was 1.5 times higher among articles in the treatment than in the control group, suggesting that open data resources attract a larger pool of participants from minority groups (8.6% vs. 5.6%, p<0.001). Thus, our study highlights the valuable contribution of the Open Data strategy to underrepresented groups, while also quantifying persisting gender gaps in academic and clinical research at the intersection of computer science and healthcare. In doing so, we hope our work points to the importance of extending open data practices in deliberate and systematic ways.

摘要

美国国立卫生研究院最近要求公开分享科学数据,这突显了学术研究的重大转变。自2023年1月起生效,该要求强调数据收集的透明度以及为数据共享所做的专门努力是从实验室到临床的转化研究的先决条件。鉴于数据获取在减轻临床模型中潜在偏差方面的作用,我们假设利用开放获取数据集而非私有数据集的研究人员更加多样化。在本简要报告中,我们提议在跨学科且不断发展的重症监护人工智能(AI)领域验证这一假设。具体而言,我们比较了利用开放数据集(如常用的MIMIC和eICU数据库)的出版物作者与仅依赖其他研究人员无法获取的私有数据集(例如仅梅奥诊所分析师可访问的ICU患者电子健康记录)的出版物作者之间的多样性。为了衡量作者多样性的程度,我们刻画了性别平衡以及来自低收入和中等收入国家(LMIC)的研究人员以及美国少数族裔服务机构(MSI)的研究人员的情况。我们的比较分析显示,与仅依赖私有数据资源的研究人员(对照组)相比,利用开放重症监护数据集的研究人员(治疗组)中来自LMIC和MSI的作者贡献更大。两组中女性的参与度相似,尽管前者略高。值得注意的是,尽管所有文章中超过70%至少有一位被推断为女性的作者,但不到25%的文章以女性为第一或最后作者。重要的是,我们发现治疗组中来自LMIC的作者比例显著高于对照组(10.1%对6.2%,p<0.001),包括第一和最后作者。此外,我们发现治疗组文章中隶属于MSI的美国作者比例比对照组高1.5倍,这表明开放数据资源吸引了更多来自少数群体的参与者(8.6%对5.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb51/11265679/1a2f91ef51d8/pdig.0000486.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb51/11265679/1a2f91ef51d8/pdig.0000486.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb51/11265679/1a2f91ef51d8/pdig.0000486.g001.jpg

相似文献

1
Diversity and inclusion: A hidden additional benefit of Open Data.多样性与包容性:开放数据隐藏的额外益处。
PLOS Digit Health. 2024 Jul 23;3(7):e0000486. doi: 10.1371/journal.pdig.0000486. eCollection 2024 Jul.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
4
Does diversity beget diversity? A scientometric analysis of over 150,000 studies and 49,000 authors published in high-impact medical journals between 2007 and 2022.多样性会催生多样性吗?对2007年至2022年间发表在高影响力医学期刊上的15万多项研究和4.9万多名作者的科学计量分析。
medRxiv. 2024 Mar 22:2024.03.21.24304695. doi: 10.1101/2024.03.21.24304695.
5
Authorship of Publications Supported by NCI-Funded Grants Involving Low- and Middle-Income Countries.由 NCI 资助的涉及中低收入国家的出版物的作者身份。
JAMA Netw Open. 2024 Mar 4;7(3):e243215. doi: 10.1001/jamanetworkopen.2024.3215.
6
Public sector reforms and their impact on the level of corruption: A systematic review.公共部门改革及其对腐败程度的影响:一项系统综述。
Campbell Syst Rev. 2021 May 24;17(2):e1173. doi: 10.1002/cl2.1173. eCollection 2021 Jun.
7
Local and foreign authorship of maternal health interventional research in low- and middle-income countries: systematic mapping of publications 2000-2012.低收入和中等收入国家孕产妇健康干预研究的国内外作者情况:2000 - 2012年出版物的系统梳理
Global Health. 2016 Jun 23;12(1):35. doi: 10.1186/s12992-016-0172-x.
8
Community-level interventions for improving access to food in low- and middle-income countries.在低收入和中等收入国家改善食品获取途径的社区层面干预措施。
Cochrane Database Syst Rev. 2020 Jul 28;7(7):CD011504. doi: 10.1002/14651858.CD011504.pub2.
9
Community-level interventions for improving access to food in low- and middle-income countries.中低收入国家改善粮食获取途径的社区层面干预措施。
Cochrane Database Syst Rev. 2020 Aug 5;8(8):CD011504. doi: 10.1002/14651858.CD011504.pub3.
10
[The different models of scientific journals].[科学期刊的不同模式]
Med Trop Sante Int. 2023 Dec 8;3(4). doi: 10.48327/mtsi.v3i4.2023.454. eCollection 2023 Dec 31.

引用本文的文献

1
Evaluation of machine learning methods for prediction of heart failure mortality and readmission: meta-analysis.用于预测心力衰竭死亡率和再入院的机器学习方法评估:荟萃分析
BMC Cardiovasc Disord. 2025 Apr 7;25(1):264. doi: 10.1186/s12872-025-04700-0.
2
Introducing the Team Card: Enhancing governance for medical Artificial Intelligence (AI) systems in the age of complexity.推出团队卡片:在复杂时代加强对医学人工智能(AI)系统的治理。
PLOS Digit Health. 2025 Mar 4;4(3):e0000495. doi: 10.1371/journal.pdig.0000495. eCollection 2025 Mar.

本文引用的文献

1
Quantifying digital health inequality across a national healthcare system.量化国家医疗体系中的数字健康不平等。
BMJ Health Care Inform. 2023 Nov 24;30(1):e100809. doi: 10.1136/bmjhci-2023-100809.
2
Patient diversity and author representation in clinical studies supporting the Surviving Sepsis Campaign guidelines for management of sepsis and septic shock 2021: a systematic review of citations.支持脓毒症和脓毒性休克管理 2021 年《存活脓毒症运动指南》的临床研究中患者多样性和作者代表性:引文的系统评价。
BMC Infect Dis. 2023 Nov 1;23(1):751. doi: 10.1186/s12879-023-08745-4.
3
Disparity dashboards: an evaluation of the literature and framework for health equity improvement.
差距仪表盘:对文献的评估和改善健康公平的框架。
Lancet Digit Health. 2023 Nov;5(11):e831-e839. doi: 10.1016/S2589-7500(23)00150-4.
4
External validation of AI models in health should be replaced with recurring local validation.健康领域人工智能模型的外部验证应由定期的本地验证取而代之。
Nat Med. 2023 Nov;29(11):2686-2687. doi: 10.1038/s41591-023-02540-z.
5
Mapping and evaluating national data flows: transparency, privacy, and guiding infrastructural transformation.绘制和评估国家数据流:透明度、隐私和指导基础设施转型。
Lancet Digit Health. 2023 Oct;5(10):e737-e748. doi: 10.1016/S2589-7500(23)00157-7.
6
Critical Bias in Critical Care Devices.危重病设备中的关键偏见。
Crit Care Clin. 2023 Oct;39(4):795-813. doi: 10.1016/j.ccc.2023.02.005. Epub 2023 Mar 27.
7
Digital determinants of health: opportunities and risks amidst health inequities.健康的数字决定因素:健康不平等中的机遇与风险。
Nat Rev Nephrol. 2023 Dec;19(12):749-750. doi: 10.1038/s41581-023-00763-4.
8
Bias in artificial intelligence algorithms and recommendations for mitigation.人工智能算法中的偏差及缓解建议。
PLOS Digit Health. 2023 Jun 22;2(6):e0000278. doi: 10.1371/journal.pdig.0000278. eCollection 2023 Jun.
9
Sources of bias in artificial intelligence that perpetuate healthcare disparities-A global review.导致医疗保健差距长期存在的人工智能偏差来源——一项全球综述。
PLOS Digit Health. 2022 Mar 31;1(3):e0000022. doi: 10.1371/journal.pdig.0000022. eCollection 2022 Mar.
10
Systemic Racism in EEG Research: Considerations and Potential Solutions.脑电图研究中的系统性种族主义:考量与潜在解决方案
Affect Sci. 2021 May 26;3(1):14-20. doi: 10.1007/s42761-021-00050-0. eCollection 2022 Mar.