Department of Informatics and Telecommunications, University of Athens, Athens, Greece.
BMC Bioinformatics. 2010 Jan 25;11:49. doi: 10.1186/1471-2105-11-49.
Complementary DNA (cDNA) microarrays are a well established technology for studying gene expression. A microarray image is obtained by laser scanning a hybridized cDNA microarray, which consists of thousands of spots representing chains of cDNA sequences, arranged in a two-dimensional array. The separation of the spots into distinct cells is widely known as microarray image gridding.
In this paper we propose M3G, a novel method for automatic gridding of cDNA microarray images based on the maximization of the margin between the rows and the columns of the spots. Initially the microarray image rotation is estimated and then a pre-processing algorithm is applied for a rough spot detection. In order to diminish the effect of artefacts, only a subset of the detected spots is selected by matching the distribution of the spot sizes to the normal distribution. Then, a set of grid lines is placed on the image in order to separate each pair of consecutive rows and columns of the selected spots. The optimal positioning of the lines is determined by maximizing the margin between these rows and columns by using a maximum margin linear classifier, effectively facilitating the localization of the spots.
The experimental evaluation was based on a reference set of microarray images containing more than two million spots in total. The results show that M3G outperforms state of the art methods, demonstrating robustness in the presence of noise and artefacts. More than 98% of the spots reside completely inside their respective grid cells, whereas the mean distance between the spot center and the grid cell center is 1.2 pixels.
The proposed method performs highly accurate gridding in the presence of noise and artefacts, while taking into account the input image rotation. Thus, it provides the potential of achieving perfect gridding for the vast majority of the spots.
互补 DNA(cDNA)微阵列是一种用于研究基因表达的成熟技术。通过激光扫描杂交 cDNA 微阵列获得微阵列图像,该微阵列由数千个代表 cDNA 序列链的斑点组成,排列在二维阵列中。斑点分离成不同的细胞通常被称为微阵列图像网格划分。
在本文中,我们提出了 M3G,这是一种基于最大化斑点行和列之间边界的 cDNA 微阵列图像自动网格划分的新方法。最初估计微阵列图像的旋转,然后应用预处理算法进行粗略斑点检测。为了减少伪影的影响,仅通过将斑点大小的分布与正态分布匹配来选择检测到的斑点的子集。然后,在图像上放置一组网格线,以便分离所选斑点的每一对连续行和列。通过使用最大边界线性分类器最大化这些行和列之间的边界来确定线的最佳定位,有效地促进了斑点的定位。
实验评估基于包含总共超过两百万个斑点的参考集微阵列图像。结果表明,M3G 优于最先进的方法,在存在噪声和伪影的情况下表现出稳健性。超过 98%的斑点完全位于各自的网格单元内,而斑点中心和网格单元中心之间的平均距离为 1.2 像素。
该方法在存在噪声和伪影的情况下,同时考虑输入图像旋转,实现了高精度的网格划分。因此,它为绝大多数斑点提供了实现完美网格划分的潜力。