Blei David M, Smyth Padhraic
Department of Computer Science, Columbia University, New York, NY 10027;
Department of Statistics, Columbia University, New York, NY 10027.
Proc Natl Acad Sci U S A. 2017 Aug 15;114(33):8689-8692. doi: 10.1073/pnas.1702076114. Epub 2017 Aug 7.
Data science has attracted a lot of attention, promising to turn vast amounts of data into useful predictions and insights. In this article, we ask why scientists should care about data science. To answer, we discuss data science from three perspectives: statistical, computational, and human. Although each of the three is a critical component of data science, we argue that the effective combination of all three components is the essence of what data science is about.
数据科学已经引起了广泛关注,有望将大量数据转化为有用的预测和见解。在本文中,我们探讨科学家为何应该关注数据科学。为了回答这个问题,我们从统计学、计算和人文这三个角度来讨论数据科学。虽然这三个方面都是数据科学的关键组成部分,但我们认为这三个组成部分的有效结合才是数据科学的核心所在。