Memon Farhat N, Owen Anne M, Sanchez-Graillet Olivia, Upton Graham J G, Harrison Andrew P
Departments of Mathematical Sciences and Biological Sciences, University of Essex, Wivenhoe Park, Essex, United Kingdom.
J Integr Bioinform. 2010 Jan 15;7(2):111. doi: 10.2390/biecoll-jib-2010-111.
A tetramer quadruplex structure is formed by four parallel strands of DNA/ RNA containing runs of guanine. These quadruplexes are able to form because guanine can Hoogsteen hydrogen bond to other guanines, and a tetrad of guanines can form a stable arrangement. Recently we have discovered that probes on Affymetrix GeneChips that contain runs of guanine do not measure gene expression reliably. We associate this finding with the likelihood that quadruplexes are forming on the surface of GeneChips. In order to cope with the rapidly expanding size of GeneChip array datasets in the public domain, we are exploring the use of cloud computing to replicate our experiments on 3' arrays to look at the effect of the location of G-spots (runs of guanines). Cloud computing is a recently introduced high-performance solution that takes advantage of the computational infrastructure of large organisations such as Amazon and Google. We expect that cloud computing will become widely adopted because it enables bioinformaticians to avoid capital expenditure on expensive computing resources and to only pay a cloud computing provider for what is used. Moreover, as well as financial efficiency, cloud computing is an ecologically-friendly technology, it enables efficient data-sharing and we expect it to be faster for development purposes. Here we propose the advantageous use of cloud computing to perform a large data-mining analysis of public domain 3' arrays.
四聚体四重结构由四条含有鸟嘌呤序列的平行DNA/RNA链形成。这些四重结构能够形成是因为鸟嘌呤可以通过Hoogsteen氢键与其他鸟嘌呤结合,并且四个鸟嘌呤可以形成稳定的排列。最近我们发现,Affymetrix基因芯片上含有鸟嘌呤序列的探针不能可靠地测量基因表达。我们将这一发现与基因芯片表面形成四重结构的可能性联系起来。为了应对公共领域中基因芯片阵列数据集迅速扩大的规模,我们正在探索使用云计算在3'阵列上重复我们的实验,以研究G位点(鸟嘌呤序列)位置的影响。云计算是最近引入的一种高性能解决方案,它利用了亚马逊和谷歌等大型组织的计算基础设施。我们预计云计算将被广泛采用,因为它使生物信息学家能够避免在昂贵的计算资源上进行资本支出,而只需向云计算提供商支付所使用资源的费用。此外,除了财务效率外,云计算还是一种生态友好型技术,它能够实现高效的数据共享,并且我们预计它在开发目的上会更快。在此,我们提出利用云计算对公共领域的3'阵列进行大数据挖掘分析的优势。