Dürr Oliver, Sick Beate
Zurich University of Applied Sciences, School of Engineering, Winterthur, Switzerland
Zurich University of Applied Sciences, School of Engineering, Winterthur, Switzerland.
J Biomol Screen. 2016 Oct;21(9):998-1003. doi: 10.1177/1087057116631284. Epub 2016 Feb 12.
Deep learning methods are currently outperforming traditional state-of-the-art computer vision algorithms in diverse applications and recently even surpassed human performance in object recognition. Here we demonstrate the potential of deep learning methods to high-content screening-based phenotype classification. We trained a deep learning classifier in the form of convolutional neural networks with approximately 40,000 publicly available single-cell images from samples treated with compounds from four classes known to lead to different phenotypes. The input data consisted of multichannel images. The construction of appropriate feature definitions was part of the training and carried out by the convolutional network, without the need for expert knowledge or handcrafted features. We compare our results against the recent state-of-the-art pipeline in which predefined features are extracted from each cell using specialized software and then fed into various machine learning algorithms (support vector machine, Fisher linear discriminant, random forest) for classification. The performance of all classification approaches is evaluated on an untouched test image set with known phenotype classes. Compared to the best reference machine learning algorithm, the misclassification rate is reduced from 8.9% to 6.6%.
深度学习方法目前在各种应用中优于传统的最先进计算机视觉算法,最近甚至在目标识别方面超越了人类表现。在此,我们展示了深度学习方法在基于高内涵筛选的表型分类中的潜力。我们以卷积神经网络的形式训练了一个深度学习分类器,使用了来自四类已知会导致不同表型的化合物处理样本的约40,000张公开可用的单细胞图像。输入数据由多通道图像组成。构建适当的特征定义是训练的一部分,由卷积网络执行,无需专家知识或手工制作的特征。我们将我们的结果与最近的最先进流程进行比较,在该流程中,使用专门软件从每个细胞中提取预定义特征,然后将其输入到各种机器学习算法(支持向量机、Fisher线性判别、随机森林)中进行分类。所有分类方法的性能在具有已知表型类别的未触及测试图像集上进行评估。与最佳参考机器学习算法相比,误分类率从8.9%降至6.6%。