Hamilton Nicholas A, Teasdale Rohan D
ARC Centre of Excellence in Bioinformatics, The University of Queensland, Brisbane, Queensland 4072, Australia.
BMC Bioinformatics. 2008 Feb 4;9:81. doi: 10.1186/1471-2105-9-81.
BACKGROUND: The expansion of automatic imaging technologies has created a need to be able to efficiently compare and review large sets of image data. To enable comparisons of image data between samples we need to define the normal variation within distinct images of the same sample. Even with tightly controlled experimental conditions, protein expression can vary widely between cells, and because of the difficulty in viewing and comparing large image sets this might not be observed. Here we introduce a novel methodology, iCluster, for visualizing, clustering and comparing large sub-cellular localization image sets. For each member of an image set, iCluster generates statistics that have been found to be useful in distinguishing sub-cellular localization. The statistics are mapped into two or three dimensions such as to preserve distances between the statistics vectors. The complete image set is then visualized in two or three dimensions using the coordinates so determined. The result is images that are statistically similar are spatially close in the visualization allowing for easy comparison of images that are similar and distinguishment of dissimilar images into distinct clusters. RESULTS: The methodology was tested on a set of 502 previously published images containing 10 known sub-cellular localizations. The clustering of images of like type was evaluated both by examining the classes of nearest neighbors to each image and by visual inspection. In three dimensions, 3-neighbor classification accuracy was 83.2%. Visually, each class clustered well with the majority of classes localizing to distinct regions of the space. In two dimensions, 3-neighbor classification accuracy was 68.9%, though visually clustering into classes could be readily discerned. Computational expense was found to be relatively low, and sets of up to 1400 images visualized and interacted with in real time. CONCLUSION: The feasibility of automated spatial layout to allow comparison and discrimination of high throughput sub-cellular imaging has been demonstrated. There are many potential applications such as image database curation, semi-automated interactive classification, outlier detection and reference image comparison. By allowing the observation of the full range of imaging data available using modern microscopes these methods will provide an invaluable tool for cell biologists.
背景:自动成像技术的扩展使得有必要能够高效地比较和审查大量图像数据。为了能够比较不同样本之间的图像数据,我们需要定义同一样本不同图像中的正常变异。即使在严格控制的实验条件下,蛋白质表达在细胞之间也可能有很大差异,并且由于查看和比较大型图像集存在困难,这种差异可能无法被观察到。在此,我们引入一种新方法iCluster,用于可视化、聚类和比较大型亚细胞定位图像集。对于图像集的每个成员,iCluster生成已被发现有助于区分亚细胞定位的统计数据。这些统计数据被映射到二维或三维空间,以保留统计向量之间的距离。然后使用如此确定的坐标在二维或三维空间中可视化完整的图像集。结果是,在可视化中,统计上相似的图像在空间上接近,从而便于相似图像的轻松比较以及将不同图像区分为不同的聚类。 结果:该方法在一组502张先前发表的包含10种已知亚细胞定位的图像上进行了测试。通过检查每个图像的最近邻类别以及视觉检查来评估同类图像聚类情况。在三维空间中,3近邻分类准确率为83.2%。从视觉上看,每个类别聚类良好,大多数类别定位到空间的不同区域。在二维空间中,3近邻分类准确率为68.9%,不过从视觉上仍可容易地辨别出聚类成不同类别。发现计算成本相对较低,并且能够实时可视化和交互处理多达1400张图像的集合。 结论:已经证明了自动空间布局用于高通量亚细胞成像比较和辨别的可行性。存在许多潜在应用,如图像数据库管理、半自动交互式分类、异常值检测和参考图像比较。通过允许观察使用现代显微镜可获得的全部成像数据,这些方法将为细胞生物学家提供一个宝贵工具。
BMC Bioinformatics. 2008-2-4
BMC Bioinformatics. 2009-3-22
BMC Bioinformatics. 2011-4-30
BMC Bioinformatics. 2007-3-30
IEEE Trans Vis Comput Graph. 2005
IEEE Trans Biomed Eng. 2005-12
J Nucl Med. 2004-1
BMC Bioinformatics. 2014-10-3
Cell Commun Signal. 2013-4-11
Circ Res. 2011-2-18
BMC Bioinformatics. 2009-3-22
IEEE Trans Neural Netw. 1995
Nucleic Acids Res. 2008-1
J Cell Sci. 2007-11-1
BMC Bioinformatics. 2007-3-30
Nat Biotechnol. 2006-10
Nat Rev Drug Discov. 2006-4
Nucleic Acids Res. 2006-1-1
J Biomed Biotechnol. 2005-6-30
Science. 2004-11-12