Pentland Alex, Lazer David, Brewer Devon, Heibeck Tracy
Massachusetts Institute of Technology, USA.
Stud Health Technol Inform. 2009;149:93-102.
We live our lives in digital networks. We wake up in the morning, check our e-mail, make a quick phone call, commute to work, buy lunch. Many of these transactions leave digital breadcrumbs--tiny records of our daily experiences. Reality mining, which pulls together these crumbs using statistical analysis and machine learning methods, offers an increasingly comprehensive picture of our lives, both individually and collectively, with the potential of transforming our understanding of ourselves, our organizations, and our society in a fashion that was barely conceivable just a few years ago. It is for this reason that reality mining was recently identified by Technology Review as one of "10 emerging technologies that could change the world". Many everyday devices provide the raw database upon which reality mining builds; sensors in mobile phones, cars, security cameras, RFID ('smart card') readers, and others, all allow for the measurement of human physical and social activity. Computational models based on such data have the potential to dramatically transform the arenas of both individual and community health. Reality mining can provide new opportunities with respect to diagnosis, patient and treatment monitoring, health services planning, surveillance of disease and risk factors, and public health investigation and disease control. Currently, the single most important source of reality mining data is the ubiquitous mobile phone. Every time a person uses a mobile phone, a few bits of information are left behind. The phone pings the nearest mobile-phone towers, revealing its location. The mobile phone service provider records the duration of the call and the number dialed. In the near future, mobile phones and other technologies will collect even more information about their users, recording everything from their physical activity to their conversational cadences. While such data pose a potential threat to individual privacy, they also offer great potential value both to individuals and communities. With the aid of data-mining algorithms, these data could shed light on individual patterns of behavior and even on the well-being of communities, creating new ways to improve public health and medicine. To illustrate, consider two examples of how reality mining may benefit individual health care. By taking advantage of special sensors in mobile phones, such as the microphone or the accelerometers built into newer devices such as Apple's iPhone, important diagnostic data can be captured. Clinical pilot data demonstrate that it may be possible to diagnose depression from the way a person talks--a depressed person tends to speak more slowly, a change that speech analysis software on a phone might recognize more readily than friends or family do. Similarly, monitoring a phone's motion sensors can also reveal small changes in gait, which could be an early indicator of ailments such as Parkinson's disease. Within the next few years reality mining will become more common, thanks in part to the proliferation and increasing sophistication of mobile phones. Many handheld devices now have the processing power of low-end desktop computers, and they can also collect more varied data, due to components such as GPS chips that track location. The Chief Technology Officer of EMC, a large digital storage company, estimates that this sort of personal sensor data will balloon from 10% of all stored information to 90% within the next decade. While the promise of reality mining is great, the idea of collecting so much personal information naturally raises many questions about privacy. It is crucial that behavior-logging technology not be forced on anyone. But legal statutes are lagging behind data collection capabilities, making it particularly important to begin discussing how the technology will and should be used. Therefore, an additional focus of this chapter will be the development of a legal and ethical framework concerning the data used by reality mining techniques.
我们生活在数字网络之中。清晨醒来,我们查看电子邮件,打个简短的电话,乘车上班,购买午餐。这些活动大多会留下数字踪迹——我们日常经历的微小记录。现实挖掘利用统计分析和机器学习方法收集这些踪迹,正为我们个人和集体的生活描绘出一幅日益全面的图景,其有可能以一种就在几年前还几乎无法想象的方式改变我们对自身、组织及社会的理解。正因如此,现实挖掘最近被《技术评论》列为“能够改变世界的10项新兴技术”之一。许多日常设备提供了现实挖掘所依赖的原始数据库;手机、汽车、安全摄像头、射频识别(“智能卡”)读卡器及其他设备中的传感器,都能对人类的身体活动和社交活动进行测量。基于此类数据的计算模型有可能极大地改变个人健康和社区健康领域。现实挖掘在诊断、患者及治疗监测、卫生服务规划、疾病及风险因素监测以及公共卫生调查与疾病控制等方面能够提供新的机遇。目前,现实挖掘数据最重要的单一来源是无处不在的手机。每当一个人使用手机时,就会留下一些信息。手机会向最近的手机信号塔发送信号,从而显示其位置。移动电话服务提供商记录通话时长和拨打的号码。在不久的将来,手机及其他技术将收集更多有关用户的信息,记录从身体活动到谈话节奏等所有信息。虽然此类数据对个人隐私构成潜在威胁,但它们对个人和社区也具有巨大的潜在价值。借助数据挖掘算法,这些数据能够揭示个人行为模式乃至社区的健康状况,创造改善公共卫生和医学的新途径。举例来说,考虑两个现实挖掘可能有益于个人医疗保健的例子。通过利用手机中的特殊传感器,比如苹果iPhone等较新设备内置的麦克风或加速度计,能够获取重要的诊断数据。临床试点数据表明,有可能从一个人的说话方式诊断出抑郁症——抑郁的人往往说话较慢,手机上的语音分析软件可能比朋友或家人更容易识别这种变化。同样,监测手机的运动传感器也能揭示步态的微小变化,这可能是帕金森病等疾病的早期指标。在未来几年,现实挖掘将变得更加普遍,这在一定程度上得益于手机的普及和日益复杂。现在许多手持设备具备低端台式电脑的处理能力,而且由于诸如用于跟踪位置的全球定位系统芯片等组件,它们还能收集更多种类的数据。大型数字存储公司EMC的首席技术官估计,在未来十年内,这类个人传感器数据将从所有存储信息的10%激增至90%。虽然现实挖掘前景广阔,但收集如此多个人信息的想法自然引发了许多关于隐私的问题。至关重要的是,行为记录技术不能强加于任何人。但法律法规滞后于数据收集能力,因此开始讨论该技术将如何以及应该如何使用就显得尤为重要。所以,本章的另一个重点将是制定关于现实挖掘技术所使用数据的法律和道德框架。