Sharifi Noghabi Hossein, Mohammadi Majid, Tan Yao-Hua
The Center of Excellence of Soft Computing and Intelligent Information Processing (SCIIP), Ferdowsi University of Mashhad, Iran.
Department of Technology, Policy and Management, Delft University of Technology, Netherlands.
IET Syst Biol. 2016 Dec;10(6):229-236. doi: 10.1049/iet-syb.2015.0081.
One of the most important needs in the post-genome era is providing the researchers with reliable and efficient computational tools to extract and analyse this huge amount of biological data, in which DNA copy number variation (CNV) is a vitally important one. Array-based comparative genomic hybridisation (aCGH) is a common approach in order to detect CNVs. Most of methods for this purpose were proposed for one-dimensional profiles. However, slightly this focus has moved from one- to multi-dimensional signals. In addition, since contamination of these profiles with noise is always an issue, it is highly important to have a robust method for analysing multi-sample aCGH profiles. In this study, the authors propose robust group fused lasso which utilises the robust group total variations. Instead of norm, the - M-estimator is used which is more robust in dealing with non-Gaussian noise and high corruption. More importantly, Correntropy (Welsch M-estimator) is also applied for fitting error. Extensive experiments indicate that the proposed method outperforms the state-of-the art algorithms and techniques under a wide range of scenarios with diverse noises.
后基因组时代最重要的需求之一是为研究人员提供可靠且高效的计算工具,以提取和分析这类海量的生物数据,其中DNA拷贝数变异(CNV)是极为重要的一种。基于阵列的比较基因组杂交(aCGH)是检测CNV的常用方法。为此目的的大多数方法都是针对一维图谱提出的。然而,这种关注点已从一维信号略微转向了多维信号。此外,由于这些图谱总是存在噪声污染问题,拥有一种稳健的方法来分析多样本aCGH图谱非常重要。在本研究中,作者提出了利用稳健组总变差的稳健组融合套索。使用了 - M估计器而非范数,它在处理非高斯噪声和高噪声干扰时更稳健。更重要的是,相关熵(韦尔施M估计器)也被用于拟合误差。大量实验表明,所提出的方法在各种具有不同噪声的场景下优于现有算法和技术。