Seghier Mohamed L
Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.
F1000Res. 2021 Nov 8;10:1127. doi: 10.12688/f1000research.73876.1. eCollection 2021.
Big data is transforming many sectors, with far-reaching consequences to how decisions are made and how knowledge is produced and shared. In the current move toward more data-led decisions and data-intensive science, we aim here to examine three issues that are changing the way data are read and used. First, there is a shift toward paradigms that involve a large amount of data. In such paradigms, the creation of complex data-led models becomes tractable and appealing to generate predictions and explanations. This necessitates for instance a rethinking of Occam's razor principle in the context of knowledge discovery. Second, there is a growing erosion of the human role in decision making and knowledge discovery processes. Human users' involvement is decreasing at an alarming rate, with no say on how to read, process, and summarize data. This makes legal responsibility and accountability hard to define. Third, thanks to its increasing popularity, big data is gaining a seductive allure, where volume and complexity of big data can de facto confer more persuasion and significance to knowledge or decisions that result from big-data-based processes. These issues call for an active human role by creating opportunities to incorporate, in the most unbiased way, human expertise and prior knowledge in decision making and knowledge production. This also requires putting in place robust monitoring and appraisal mechanisms to ensure that relevant data is answering the right questions. As the proliferation of data continues to grow, we need to rethink the way we interact with data to serve human needs.
大数据正在改变许多领域,对决策方式以及知识的产生和共享产生深远影响。在当前朝着更多以数据为导向的决策和数据密集型科学发展的趋势下,我们在此旨在探讨正在改变数据读取和使用方式的三个问题。首先,正朝着涉及大量数据的范式转变。在这种范式中,创建复杂的数据驱动模型变得可行且有吸引力,以生成预测和解释。例如,这就需要在知识发现的背景下重新思考奥卡姆剃刀原理。其次,在决策和知识发现过程中,人类的作用正在日益受到侵蚀。人类用户的参与度正以惊人的速度下降,对如何读取、处理和总结数据没有发言权。这使得法律责任和问责制难以界定。第三,由于大数据越来越受欢迎,它正具有一种诱人的魅力,即大数据的体量和复杂性实际上可以赋予基于大数据的过程所产生的知识或决策更多的说服力和重要性。这些问题要求人类积极发挥作用,通过创造机会以最公正的方式将人类专业知识和先验知识纳入决策和知识生产中。这还需要建立强大的监测和评估机制,以确保相关数据能回答正确的问题。随着数据的不断激增,我们需要重新思考我们与数据交互以满足人类需求的方式。