School of Computer Science and Technology, Xidian University, Xi'an, PR China.
PLoS One. 2012;7(12):e52516. doi: 10.1371/journal.pone.0052516. Epub 2012 Dec 20.
Recurrent copy number alterations (CNAs) play an important role in cancer genesis. While a number of computational methods have been proposed for identifying such CNAs, their relative merits remain largely unknown in practice since very few efforts have been focused on comparative analysis of the methods. To facilitate studies of recurrent CNA identification in cancer genome, it is imperative to conduct a comprehensive comparison of performance and limitations among existing methods. In this paper, six representative methods proposed in the latest six years are compared. These include one-stage and two-stage approaches, working with raw intensity ratio data and discretized data respectively. They are based on various techniques such as kernel regression, correlation matrix diagonal segmentation, semi-parametric permutation and cyclic permutation schemes. We explore multiple criteria including type I error rate, detection power, Receiver Operating Characteristics (ROC) curve and the area under curve (AUC), and computational complexity, to evaluate performance of the methods under multiple simulation scenarios. We also characterize their abilities on applications to two real datasets obtained from cancers with lung adenocarcinoma and glioblastoma. This comparison study reveals general characteristics of the existing methods for identifying recurrent CNAs, and further provides new insights into their strengths and weaknesses. It is believed helpful to accelerate the development of novel and improved methods.
复发性拷贝数改变(CNAs)在癌症发生中起着重要作用。虽然已经提出了许多用于识别此类 CNAs 的计算方法,但由于很少有研究关注方法的比较分析,因此它们在实践中的相对优势在很大程度上仍然未知。为了促进癌症基因组中复发性 CNA 识别的研究,有必要对现有方法的性能和局限性进行全面比较。在本文中,我们比较了最近六年提出的六种代表性方法。这些方法包括分别使用原始强度比数据和离散数据的一阶段和两阶段方法。它们基于核回归、相关矩阵对角分割、半参数置换和循环置换方案等各种技术。我们探索了多种标准,包括误报率、检测能力、接收器操作特性(ROC)曲线和曲线下面积(AUC)以及计算复杂度,以评估方法在多种模拟场景下的性能。我们还描述了它们在应用于从肺腺癌和胶质母细胞瘤获得的两个真实数据集的能力。这项比较研究揭示了现有方法识别复发性 CNA 的一般特征,并进一步深入了解了它们的优缺点。我们相信这有助于加速新型和改进方法的发展。