Suppr超能文献

编码集体知识,指导数据复用者:数字科学数据集的协作固化

Encoding Collective Knowledge, Instructing Data Reusers: The Collaborative Fixation of a Digital Scientific Data Set.

作者信息

Hoeppe Götz

机构信息

Departments of Anthropology and Sociology & Legal Studies, University of Waterloo, 200 University Avenue West, Waterloo, Ontario N2L 3G1 Canada.

出版信息

Comput Support Coop Work. 2021;30(4):463-505. doi: 10.1007/s10606-021-09407-2. Epub 2021 Oct 25.

Abstract

This article provides a novel perspective on the use and reuse of scientific data by providing a chronological ethnographic account and analysis of how a team of researchers prepared an astronomical catalogue (a table of measured properties of galaxies) for public release. Whereas much existing work on data reuse has focused on information about data (such as metadata), whose form or lack has been described as a hurdle for reusing data successfully, I describe how data makers tried to instruct users through the processed data themselves. The fixation of this catalogue was a negotiation, resulting in what was acceptable to team members and coherent with the diverse data uses pertinent to their completed work. It was through preparing their catalogue as an 'instructing data object' that this team seeked to encode its members' knowledge of how the data were processed and to make it consequential for users by devising methodical ways to structure anticipated uses. These methods included introducing redundancies that would help users to self-correct mistaken uses, selectively deleting data, and deflecting accountability through making notational choices. They dwell on an understanding of knowledge not as exclusively propositional (such as the belief in propositions), but as embedded in witnessable activities and the products of these activities. I discuss the implications of this account for philosophical notions of collective knowledge and for theorizing coordinative artifacts in CSCW. Eventually, I identify a tension between 'using algorithms' and 'doing science' in preparing data sets and show how it was resolved in this case.

摘要

本文通过按时间顺序对一组研究人员如何准备一份天文星表(星系测量属性表)以供公开发布进行民族志描述和分析,为科学数据的使用和再利用提供了一个全新视角。尽管现有的许多关于数据再利用的工作都集中在数据相关信息(如元数据)上,其形式或缺失被描述为成功再利用数据的障碍,但我描述了数据制作者如何试图通过处理后的数据本身来指导用户。这份星表的确定是一个协商过程,结果是团队成员可接受的,并且与他们已完成工作相关的各种数据用途相一致。正是通过将他们的星表准备成一个“指导性数据对象”,这个团队试图对其成员关于数据如何处理的知识进行编码,并通过设计系统的方法来构建预期用途,使其对用户产生影响。这些方法包括引入冗余信息以帮助用户自我纠正错误用途、有选择地删除数据,以及通过符号选择转移责任。他们强调对知识的理解并非仅仅是命题性的(比如对命题的信念),而是嵌入在可见证的活动以及这些活动的产物之中。我讨论了这一描述对于集体知识的哲学概念以及对计算机支持的协同工作(CSCW)中协调工件进行理论化的意义。最终,我指出在准备数据集时“使用算法”和“做科学”之间的一种张力,并展示了在这种情况下它是如何得到解决的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/986a/8608782/b8e78a0b5935/10606_2021_9407_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验