Jeske Melanie, Saperstein Aliya, Lee Sandra Soo-Jin, Shim Janet K
Center for Medical Ethics and Health Policy, Baylor College of Medicine, Houston, TX, USA.
Department of Sociology, Stanford University, Stanford, CA, USA.
Soc Stud Sci. 2025 Apr;55(2):178-208. doi: 10.1177/03063127241288498. Epub 2024 Oct 7.
The production of large, shareable datasets is increasingly prioritized for a wide range of research purposes. In biomedicine, especially in the United States, calls to enhance representation of historically underrepresented populations in databases that integrate genomic, health history, demographic and lifestyle data have also increased in order to support the goals of precision medicine. Understanding the assumptions and values that shape the design of such datasets and the practices through which they are constructed are a pressing area of social inquiry. We examine how diversity is conceptualized in U.S. precision medicine research initiatives, specifically attending to how measures of diversity, including race, ethnicity, and medically underserved status, are constructed and harmonized to build commensurate datasets. In three case studies, we show how symbolic embrace of both diversity and harmonization efforts can compromise the utility of diversity data. Although big data and diverse population representation are heralded as the keys to unlocking the promises of precision medicine research, these cases reveal core tensions between what kinds of data are seen as central to 'the science' and which are marginalized.
为了实现广泛的研究目的,越来越重视生成大型、可共享的数据集。在生物医学领域,尤其是在美国,为了支持精准医学的目标,呼吁在整合基因组、健康史、人口统计学和生活方式数据的数据库中,提高历史上代表性不足人群的代表性的呼声也越来越高。理解塑造此类数据集设计的假设和价值观,以及构建这些数据集的实践,是社会调查中一个紧迫的领域。我们研究了美国精准医学研究计划中多样性是如何被概念化的,特别关注包括种族、族裔和医疗服务不足状况在内的多样性衡量标准是如何构建和协调,以建立相应的数据集。在三个案例研究中,我们展示了对多样性和协调努力的象征性接受如何可能损害多样性数据的效用。尽管大数据和多样化的人群代表性被誉为开启精准医学研究前景的关键,但这些案例揭示了在哪些数据被视为 “科学” 的核心数据以及哪些数据被边缘化之间的核心矛盾。