基于稳定性的DNA拷贝数图谱类发现方法比较

Stability-based comparison of class discovery methods for DNA copy number profiles.

作者信息

Brito Isabel, Hupé Philippe, Neuvial Pierre, Barillot Emmanuel

机构信息

Institut Curie, Paris, France ; INSERM, U900, Paris, France ; Mines ParisTech, Fontainebleau, France.

出版信息

PLoS One. 2013 Dec 5;8(12):e81458. doi: 10.1371/journal.pone.0081458. eCollection 2013.

DOI:10.1371/journal.pone.0081458

PMID:24339933

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3855312/

Abstract

MOTIVATION

Array-CGH can be used to determine DNA copy number, imbalances in which are a fundamental factor in the genesis and progression of tumors. The discovery of classes with similar patterns of array-CGH profiles therefore adds to our understanding of cancer and the treatment of patients. Various input data representations for array-CGH, dissimilarity measures between tumor samples and clustering algorithms may be used for this purpose. The choice between procedures is often difficult. An evaluation procedure is therefore required to select the best class discovery method (combination of one input data representation, one dissimilarity measure and one clustering algorithm) for array-CGH. Robustness of the resulting classes is a common requirement, but no stability-based comparison of class discovery methods for array-CGH profiles has ever been reported.

RESULTS

We applied several class discovery methods and evaluated the stability of their solutions, with a modified version of Bertoni's [Formula: see text]-based test [1]. Our version relaxes the assumption of independency required by original Bertoni's [Formula: see text]-based test. We conclude that Minimal Regions of alteration (a concept introduced by [2]) for input data representation, sim [3] or agree [4] for dissimilarity measure and the use of average group distance in the clustering algorithm produce the most robust classes of array-CGH profiles.

AVAILABILITY

The software is available from http://bioinfo.curie.fr/projects/cgh-clustering. It has also been partly integrated into "Visualization and analysis of array-CGH"(VAMP)[5]. The data sets used are publicly available from ACTuDB [6].

摘要

动机

阵列比较基因组杂交（Array-CGH）可用于确定DNA拷贝数，其失衡是肿瘤发生和发展的一个基本因素。因此，发现具有相似阵列比较基因组杂交图谱模式的类别有助于我们对癌症和患者治疗的理解。为此，可以使用阵列比较基因组杂交的各种输入数据表示、肿瘤样本之间的差异度量和聚类算法。程序之间的选择通常很困难。因此，需要一种评估程序来为阵列比较基因组杂交选择最佳的类别发现方法（一种输入数据表示、一种差异度量和一种聚类算法的组合）。所得类别的稳健性是一个常见要求，但从未有过基于稳定性对阵列比较基因组杂交图谱的类别发现方法进行比较的报道。

结果

我们应用了几种类别发现方法，并使用基于贝托尼（Bertoni）的[公式：见正文]检验的修改版本[1]评估了它们解决方案的稳定性。我们的版本放宽了原始基于贝托尼的[公式：见正文]检验所需的独立性假设。我们得出结论，对于输入数据表示，改变的最小区域（由[2]引入的概念）、对于差异度量使用sim[3]或agree[4]以及在聚类算法中使用平均组距离会产生最稳健的阵列比较基因组杂交图谱类别。

可用性

该软件可从http://bioinfo.curie.fr/projects/cgh-clustering获取。它也已部分集成到“阵列比较基因组杂交的可视化与分析”（VAMP）[5]中。所使用的数据集可从ACTuDB[6]公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eab/3855312/6dbcaf3bea26/pone.0081458.g001.jpg

相似文献

Stability-based comparison of class discovery methods for DNA copy number profiles.基于稳定性的DNA拷贝数图谱类发现方法比较

PLoS One. 2013 Dec 5;8(12):e81458. doi: 10.1371/journal.pone.0081458. eCollection 2013.

ACTuDB, a new database for the integrated analysis of array-CGH and clinical data for tumors.ACTuDB，一个用于肿瘤阵列比较基因组杂交（array-CGH）和临床数据综合分析的新数据库。

Oncogene. 2007 Oct 11;26(46):6641-52. doi: 10.1038/sj.onc.1210488. Epub 2007 May 14.

VAMP: visualization and analysis of array-CGH, transcriptome and other molecular profiles.VAMP：阵列比较基因组杂交、转录组及其他分子图谱的可视化与分析

Bioinformatics. 2006 Sep 1;22(17):2066-73. doi: 10.1093/bioinformatics/btl359. Epub 2006 Jul 4.

Distance-based clustering of CGH data.基于距离的比较基因组杂交数据聚类

Bioinformatics. 2006 Aug 15;22(16):1971-8. doi: 10.1093/bioinformatics/btl185. Epub 2006 May 16.

CAPweb: a bioinformatics CGH array Analysis Platform.CAPweb：一个生物信息学比较基因组杂交阵列分析平台。

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W477-81. doi: 10.1093/nar/gkl215.

Smoothing waves in array CGH tumor profiles.平滑阵列比较基因组杂交肿瘤图谱中的波形。

Bioinformatics. 2009 May 1;25(9):1099-104. doi: 10.1093/bioinformatics/btp132. Epub 2009 Mar 10.

Evaluation of copy number variation detection for a SNP array platform.SNP 芯片平台拷贝数变异检测评估。

BMC Bioinformatics. 2014 Feb 21;15:50. doi: 10.1186/1471-2105-15-50.

The use of ultra-dense array CGH analysis for the discovery of micro-copy number alterations and gene fusions in the cancer genome.超高密度阵列 CGH 分析在癌症基因组中发现微小拷贝数改变和基因融合。

BMC Med Genomics. 2011 Jan 27;4:16. doi: 10.1186/1755-8794-4-16.

Spatial normalization of array-CGH data.阵列比较基因组杂交数据的空间标准化

BMC Bioinformatics. 2006 May 22;7:264. doi: 10.1186/1471-2105-7-264.

Algorithms for calling gains and losses in array CGH data.用于在阵列比较基因组杂交（array CGH）数据中识别增益和缺失的算法。

Methods Mol Biol. 2009;556:99-116. doi: 10.1007/978-1-60327-192-9_8.

引用本文的文献

Unsupervised Algorithms for Microarray Sample Stratification.非监督算法在微阵列样本分层中的应用。

Methods Mol Biol. 2022;2401:121-146. doi: 10.1007/978-1-0716-1839-4_9.

本文引用的文献

Normalized, segmented or called aCGH data?标准化、分段还是所谓的比较基因组杂交（aCGH）数据？

Cancer Inform. 2007 Sep 17;3:321-7.

CGHregions: dimension reduction for array CGH data with minimal information loss.比较基因组杂交区域：在信息损失最小的情况下对微阵列比较基因组杂交数据进行降维。

Cancer Inform. 2007 Feb 8;3:55-63.

Copy number alterations that predict metastatic capability of human breast cancer.预测人类乳腺癌转移能力的拷贝数改变。

Cancer Res. 2009 May 1;69(9):3795-801. doi: 10.1158/0008-5472.CAN-08-4596. Epub 2009 Mar 31.

Genomic profiling and identification of high-risk uveal melanoma by array CGH analysis of primary tumors and liver metastases.通过对原发性肿瘤和肝转移灶进行阵列比较基因组杂交分析对高危葡萄膜黑色素瘤进行基因组分析和鉴定。

Invest Ophthalmol Vis Sci. 2009 Jun;50(6):2572-80. doi: 10.1167/iovs.08-2296. Epub 2009 Jan 17.

Weighted clustering of called array CGH data.对已调用的阵列比较基因组杂交（array CGH）数据进行加权聚类。

Biostatistics. 2008 Jul;9(3):484-500. doi: 10.1093/biostatistics/kxm048. Epub 2007 Dec 22.

SIRAC: Supervised Identification of Regions of Aberration in aCGH datasets.

BMC Bioinformatics. 2007 Oct 30;8:422. doi: 10.1186/1471-2105-8-422.

Oncogene. 2007 Oct 11;26(46):6641-52. doi: 10.1038/sj.onc.1210488. Epub 2007 May 14.

Model order selection for bio-molecular data clustering.生物分子数据聚类的模型阶次选择

BMC Bioinformatics. 2007 May 3;8 Suppl 2(Suppl 2):S7. doi: 10.1186/1471-2105-8-S2-S7.

Markers improve clustering of CGH data.标记物可改善比较基因组杂交（CGH）数据的聚类。

Bioinformatics. 2007 Feb 15;23(4):450-7. doi: 10.1093/bioinformatics/btl624. Epub 2006 Dec 6.

Computational approaches to analysis of DNA microarray data.DNA微阵列数据分析的计算方法。

Yearb Med Inform. 2006:91-103.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于稳定性的DNA拷贝数图谱类发现方法比较

Stability-based comparison of class discovery methods for DNA copy number profiles.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献