College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China.
Anal Chim Acta. 2012 Aug 31;740:20-6. doi: 10.1016/j.aca.2012.06.031. Epub 2012 Jun 28.
The identification of disease-relevant genes represents a challenge in microarray-based disease diagnosis where the sample size is often limited. Among established methods, reversible jump Markov Chain Monte Carlo (RJMCMC) methods have proven to be quite promising for variable selection. However, the design and application of an RJMCMC algorithm requires, for example, special criteria for prior distributions. Also, the simulation from joint posterior distributions of models is computationally extensive, and may even be mathematically intractable. These disadvantages may limit the applications of RJMCMC algorithms. Therefore, the development of algorithms that possess the advantages of RJMCMC methods and are also efficient and easy to follow for selecting disease-associated genes is required. Here we report a RJMCMC-like method, called random frog that possesses the advantages of RJMCMC methods and is much easier to implement. Using the colon and the estrogen gene expression datasets, we show that random frog is effective in identifying discriminating genes. The top 2 ranked genes for colon and estrogen are Z50753, U00968, and Y10871_at, Z22536_at, respectively. (The source codes with GNU General Public License Version 2.0 are freely available to non-commercial users at: http://code.google.com/p/randomfrog/.).
在基于微阵列的疾病诊断中,样本量通常有限,因此鉴定与疾病相关的基因是一项挑战。在已建立的方法中,可逆跳跃马尔可夫链蒙特卡罗(RJMCMC)方法已被证明在变量选择方面非常有前途。然而,RJMCMC 算法的设计和应用需要例如先验分布的特殊标准。此外,从模型的联合后验分布进行模拟在计算上是广泛的,甚至可能在数学上是难以处理的。这些缺点可能会限制 RJMCMC 算法的应用。因此,需要开发具有 RJMCMC 方法优势且高效且易于遵循的算法,用于选择与疾病相关的基因。在这里,我们报告了一种类似于 RJMCMC 的方法,称为随机青蛙(random frog),它具有 RJMCMC 方法的优势,并且更容易实现。使用结肠和雌激素基因表达数据集,我们表明随机青蛙在识别区分基因方面是有效的。对于结肠和雌激素,排名前 2 的基因分别为 Z50753、U00968 和 Y10871_at、Z22536_at。(带有 GNU 通用公共许可证版本 2.0 的源代码可供非商业用户免费使用:http://code.google.com/p/randomfrog/。)。