• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

尽责分类:数据科学家的歧视感知分类指南。

Conscientious Classification: A Data Scientist's Guide to Discrimination-Aware Classification.

机构信息

1 Department of Data Science, Zocdoc , New York, New York.

2 NYU Center for Data Science , New York, New York.

出版信息

Big Data. 2017 Jun;5(2):120-134. doi: 10.1089/big.2016.0048.

DOI:10.1089/big.2016.0048
PMID:28632437
Abstract

Recent research has helped to cultivate growing awareness that machine-learning systems fueled by big data can create or exacerbate troubling disparities in society. Much of this research comes from outside of the practicing data science community, leaving its members with little concrete guidance to proactively address these concerns. This article introduces issues of discrimination to the data science community on its own terms. In it, we tour the familiar data-mining process while providing a taxonomy of common practices that have the potential to produce unintended discrimination. We also survey how discrimination is commonly measured, and suggest how familiar development processes can be augmented to mitigate systems' discriminatory potential. We advocate that data scientists should be intentional about modeling and reducing discriminatory outcomes. Without doing so, their efforts will result in perpetuating any systemic discrimination that may exist, but under a misleading veil of data-driven objectivity.

摘要

最近的研究帮助人们越来越意识到,由大数据驱动的机器学习系统可能会在社会中造成或加剧令人不安的差异。这些研究大多来自实践数据科学界之外,使得成员们几乎没有具体的指导来主动解决这些问题。本文以其自身的术语向数据科学界介绍了歧视问题。在本文中,我们介绍了熟悉的数据挖掘过程,同时提供了可能产生意外歧视的常见做法的分类法。我们还调查了如何衡量歧视,并提出了如何增强常见的开发过程以减轻系统的歧视潜力。我们主张数据科学家应该有意地对模型和减少歧视性结果进行建模。如果不这样做,他们的努力将导致可能存在的任何系统性歧视的延续,而这种歧视是在一个具有误导性的数据驱动客观性的面纱下进行的。

相似文献

1
Conscientious Classification: A Data Scientist's Guide to Discrimination-Aware Classification.尽责分类:数据科学家的歧视感知分类指南。
Big Data. 2017 Jun;5(2):120-134. doi: 10.1089/big.2016.0048.
2
Toward Accountable Discrimination-Aware Data Mining: The Importance of Keeping the Human in the Loop-and Under the Looking Glass.迈向负责任的歧视感知数据挖掘:保持人机交互和监督的重要性。
Big Data. 2017 Jun;5(2):135-152. doi: 10.1089/big.2016.0055. Epub 2017 Jun 6.
3
Real alerts and artifact classification in archived multi-signal vital sign monitoring data: implications for mining big data.存档多信号生命体征监测数据中的真实警报与伪迹分类:对大数据挖掘的启示
J Clin Monit Comput. 2016 Dec;30(6):875-888. doi: 10.1007/s10877-015-9788-2. Epub 2015 Oct 5.
4
Values in environmental research: Citizens' views of scientists who acknowledge values.环境研究中的价值观:公民对承认价值观的科学家的看法。
PLoS One. 2017 Oct 25;12(10):e0186049. doi: 10.1371/journal.pone.0186049. eCollection 2017.
5
The Structural Consequences of Big Data-Driven Education.大数据驱动教育的结构后果。
Big Data. 2017 Jun;5(2):164-172. doi: 10.1089/big.2016.0061.
6
The evolution of boosting algorithms. From machine learning to statistical modelling.提升算法的演进。从机器学习到统计建模。
Methods Inf Med. 2014;53(6):419-27. doi: 10.3414/ME13-01-0122. Epub 2014 Aug 12.
7
Clinical chemistry in higher dimensions: Machine-learning and enhanced prediction from routine clinical chemistry data.高维度临床化学:基于常规临床化学数据的机器学习与增强预测
Clin Biochem. 2016 Nov;49(16-17):1213-1220. doi: 10.1016/j.clinbiochem.2016.07.013. Epub 2016 Jul 22.
8
A critical evaluation of science outreach via social media: its role and impact on scientists.对通过社交媒体进行科学推广的批判性评估:其对科学家的作用和影响。
F1000Res. 2014 Dec 9;3:300. doi: 10.12688/f1000research.5918.2. eCollection 2014.
9
Visual Analysis of Discrimination in Machine Learning.机器学习中的歧视可视化分析。
IEEE Trans Vis Comput Graph. 2021 Feb;27(2):1470-1480. doi: 10.1109/TVCG.2020.3030471. Epub 2021 Jan 28.
10
Machine learning: Trends, perspectives, and prospects.机器学习:趋势、观点和展望。
Science. 2015 Jul 17;349(6245):255-60. doi: 10.1126/science.aaa8415.

引用本文的文献

1
Phenotype augmentation using generative AI for isocitrate dehydrogenase mutation prediction in glioma.使用生成式人工智能进行胶质瘤异柠檬酸脱氢酶突变预测的表型增强。
Sci Rep. 2025 Aug 7;15(1):28913. doi: 10.1038/s41598-025-14477-z.
2
Empirical Comparison of Post-processing Debiasing Methods for Machine Learning Classifiers in Healthcare.医疗保健领域机器学习分类器后处理去偏方法的实证比较
J Healthc Inform Res. 2025 Mar 20;9(3):465-493. doi: 10.1007/s41666-025-00196-7. eCollection 2025 Sep.
3
Development and Validation of a Novel Nomogram Risk Prediction Model for In-Hospital Death Following Extended Aortic Arch Repair for Acute Type A Aortic Dissection.
一种新型列线图风险预测模型的开发与验证,用于急性A型主动脉夹层广泛性主动脉弓修复术后院内死亡情况
Rev Cardiovasc Med. 2025 Apr 21;26(4):26943. doi: 10.31083/RCM26943. eCollection 2025 Apr.
4
Bias in medical AI: Implications for clinical decision-making.医学人工智能中的偏差:对临床决策的影响。
PLOS Digit Health. 2024 Nov 7;3(11):e0000651. doi: 10.1371/journal.pdig.0000651. eCollection 2024 Nov.
5
Challenges in Reducing Bias Using Post-Processing Fairness for Breast Cancer Stage Classification with Deep Learning.使用深度学习对乳腺癌分期分类进行后处理公平性以减少偏差时面临的挑战。
Algorithms. 2024 Apr;17(4). doi: 10.3390/a17040141. Epub 2024 Mar 28.
6
Minimizing bias when using artificial intelligence in critical care medicine.在重症医学中使用人工智能时尽量减少偏差。
J Crit Care. 2024 Aug;82:154796. doi: 10.1016/j.jcrc.2024.154796. Epub 2024 Mar 29.
7
Preparing for the bedside-optimizing a postpartum depression risk prediction model for clinical implementation in a health system.为床边准备-优化产后抑郁症风险预测模型,以在卫生系统中临床实施。
J Am Med Inform Assoc. 2024 May 20;31(6):1258-1267. doi: 10.1093/jamia/ocae056.
8
Assessing machine learning for fair prediction of ADHD in school pupils using a retrospective cohort study of linked education and healthcare data.使用回顾性队列研究,结合教育和医疗保健数据,评估机器学习在预测小学生注意力缺陷多动障碍(ADHD)方面的公平性。
BMJ Open. 2022 Dec 5;12(12):e058058. doi: 10.1136/bmjopen-2021-058058.
9
Developing medical imaging AI for emerging infectious diseases.开发用于新发传染病的医学影像 AI。
Nat Commun. 2022 Nov 18;13(1):7060. doi: 10.1038/s41467-022-34234-4.
10
Clinical significance, challenges and limitations in using artificial intelligence for electrocardiography-based diagnosis.基于心电图的人工智能诊断的临床意义、挑战与局限
Int J Arrhythmia. 2022;23(1):24. doi: 10.1186/s42444-022-00075-x. Epub 2022 Oct 1.