Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China.
Bioinformatics. 2015 Mar 1;31(5):634-41. doi: 10.1093/bioinformatics/btu702. Epub 2014 Oct 22.
The existing methods for genetic-interaction detection in genome-wide association studies are designed from different paradigms, and their performances vary considerably for different disease models. One important reason for this variability is that their construction is based on a single-correlation model between SNPs and disease. Due to potential model preference and disease complexity, a single-objective method will therefore not work well in general, resulting in low power and a high false-positive rate.
In this work, we present a multi-objective heuristic optimization methodology named MACOED for detecting genetic interactions. In MACOED, we combine both logistical regression and Bayesian network methods, which are from opposing schools of statistics. The combination of these two evaluation objectives proved to be complementary, resulting in higher power with a lower false-positive rate than observed for optimizing either objective independently. To solve the space and time complexity for high-dimension problems, a memory-based multi-objective ant colony optimization algorithm is designed in MACOED that is able to retain non-dominated solutions found in past iterations.
We compared MACOED with other recent algorithms using both simulated and real datasets. The experimental results demonstrate that our method outperforms others in both detection power and computational feasibility for large datasets.
Codes and datasets are available at: www.csbio.sjtu.edu.cn/bioinf/MACOED/.
全基因组关联研究中现有的遗传相互作用检测方法是基于不同的范式设计的,它们在不同的疾病模型中的性能差异很大。造成这种可变性的一个重要原因是,它们的构建是基于 SNP 与疾病之间的单一相关模型。由于潜在的模型偏好和疾病的复杂性,因此单一目标方法通常不能很好地发挥作用,导致低功效和高假阳性率。
在这项工作中,我们提出了一种名为 MACOED 的用于检测遗传相互作用的多目标启发式优化方法。在 MACOED 中,我们结合了逻辑回归和贝叶斯网络方法,它们分别来自统计学的两个对立学派。这两种评估目标的结合被证明是互补的,与独立优化任何一个目标相比,其功效更高,假阳性率更低。为了解决高维问题的空间和时间复杂度,我们在 MACOED 中设计了一种基于记忆的多目标蚁群优化算法,能够保留过去迭代中找到的非支配解。
我们使用模拟数据集和真实数据集将 MACOED 与其他最近的算法进行了比较。实验结果表明,我们的方法在检测功效和计算大型数据集的可行性方面都优于其他方法。
代码和数据集可在:www.csbio.sjtu.edu.cn/bioinf/MACOED/ 获取。