Stark Anthony, Shafran Izhak, Kaye Jeffrey
Biomedical Engineering, OHSU, Portland, USA
Proc IEEE Workshop Autom Speech Recognit Underst. 2008 Apr 3;1:378-384. doi: 10.1109/ICCV.2003.1238369. Epub 2003 Oct 13.
The ability to reliably infer the nature of telephone conversations opens up a variety of applications, ranging from designing context-sensitive user interfaces on smartphones, to providing new tools for social psychologists and social scientists to study and understand social life of different subpopulations within different contexts. Using a unique corpus of everyday telephone conversations collected from eight residences over the duration of a year, we investigate the utility of popular features, extracted solely from the content, in classifying business-oriented calls from others. Through feature selection experiments, we find that the discrimination can be performed robustly for a majority of the calls using a small set of features. Remarkably, features learned from unsupervised methods, specifically latent Dirichlet allocation, perform almost as well as with as those from supervised methods. The unsupervised clusters learned in this task shows promise of finer grain inference of social nature of telephone conversations.
可靠推断电话对话性质的能力开启了各种应用,从在智能手机上设计上下文敏感的用户界面,到为社会心理学家和社会科学家提供新工具,以研究和理解不同背景下不同亚群体的社会生活。我们使用从八个住所收集的为期一年的独特日常电话对话语料库,研究仅从内容中提取的流行特征在区分商务电话和其他电话方面的效用。通过特征选择实验,我们发现使用一小部分特征就能对大多数电话进行稳健的区分。值得注意的是,从无监督方法(特别是潜在狄利克雷分配)中学习到的特征,其表现几乎与从监督方法中学习到的特征一样好。在这项任务中学习到的无监督聚类显示出对电话对话社会性质进行更精细推断的前景。