Mohammadi-Kambs Mina, Hölz Kathrin, Somoza Mark M, Ott Albrecht
Biological Experimental Physics, Saarland University, Campus B2.1, 66123 Saarbrücken, Germany.
Institute of Inorganic Chemistry, Faculty of Chemistry, University of Vienna, Althanstraße 14 (UZA II), 1090 Vienna, Austria.
ACS Omega. 2017 Apr 30;2(4):1302-1308. doi: 10.1021/acsomega.7b00053. Epub 2017 Apr 5.
DNA microarrays constitute an in vitro example system of a highly crowded molecular recognition environment. Although they are widely applied in many biological applications, some of the basic mechanisms of the hybridization processes of DNA remain poorly understood. On a microarray, cross-hybridization arises from similarities of sequences that may introduce errors during the transmission of information. Experimentally, we determine an appropriate distance, called minimum Hamming distance, in which the sequences of a set differ. By applying an algorithm based on a graph-theoretical method, we find large orthogonal sets of sequences that are sufficiently different not to exhibit any cross-hybridization. To create such a set, we first derive an analytical solution for the number of sequences that include at least four guanines in a row for a given sequence length and eliminate them from the list of candidate sequences. We experimentally confirm the orthogonality of the largest possible set with a size of 23 for the length of 7. We anticipate our work to be a starting point toward the study of signal propagation in highly competitive environments, besides its obvious application in DNA high throughput experiments.
DNA微阵列构成了一个高度拥挤的分子识别环境的体外示例系统。尽管它们在许多生物学应用中被广泛应用,但DNA杂交过程的一些基本机制仍知之甚少。在微阵列上,交叉杂交源于序列的相似性,这可能在信息传递过程中引入错误。通过实验,我们确定了一个合适的距离,称为最小汉明距离,其中一组序列存在差异。通过应用基于图论方法的算法,我们找到了足够不同以至于不会出现任何交叉杂交的大的正交序列集。为了创建这样一个集合,我们首先针对给定序列长度推导了连续包含至少四个鸟嘌呤的序列数量的解析解,并从候选序列列表中消除它们。我们通过实验证实了长度为7时最大可能集合(大小为23)的正交性。除了在DNA高通量实验中的明显应用外,我们预计我们的工作将成为研究高竞争环境中信号传播的起点。