对高甲基化基因进行基因组扫描。

Genomic sweeping for hypermethylated genes.

作者信息

Goh Liang, Murphy Susan K, Muhkerjee Sayan, Furey Terrence S

机构信息

Institute for Genome Sciences Policy, Duke University, USA.

出版信息

Bioinformatics. 2007 Feb 1;23(3):281-8. doi: 10.1093/bioinformatics/btl620. Epub 2006 Dec 5.

DOI:10.1093/bioinformatics/btl620

PMID:17148511

Abstract

MOTIVATION

Genes silenced by the aberrent methylation of nearby CpG islands can contribute to the onset or progression of cancer and represent potential biomarkers for diagnosis and prognosis. Relatively few have thus far been validated as hypermethylated in cancer among over 14,000 candidates with promoter region CpG islands. A descriptive set of genes known to be unmethylated in cancer does not exist. This lack of a negative set and a large number of candidates necessitated the development of a new approach to identify novel genes hypermethylated in cancer.

RESULTS

We developed a general method, cluster_boost, that in an imbalanced data setting predicts new minority class members given limited known samples and a large set of unlabeled samples. Synthetic datasets modeled after the hypermethylated genes data show that cluster_boost can successfully identify minority samples within unlabeled data. Using genome sequence features, cluster_boost predicted candidate hypermethylated genes among 14,000 genes of unknown status. In primary ovarian cancers, we determined the methylation status for 15 genes with different levels of support for being hypermethlyated. Results indicate cluster_boost can accurately identify novel genes hypermethylated in cancer.

AVAILABILITY

Software and datasets are freely available at http://labs.genome.duke.edu/FureyLab/cluster_boost.php.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

因附近CpG岛异常甲基化而沉默的基因可能促成癌症的发生或发展，并代表诊断和预后的潜在生物标志物。在超过14,000个具有启动子区域CpG岛的候选基因中，到目前为止，相对较少的基因已被证实在癌症中发生高甲基化。目前尚不存在一组已知在癌症中未发生甲基化的描述性基因。由于缺乏阴性样本集以及大量的候选基因，因此需要开发一种新方法来识别癌症中发生高甲基化的新基因。

结果

我们开发了一种通用方法cluster_boost，该方法在不平衡数据设置中，在已知样本有限且有大量未标记样本的情况下预测新的少数类成员。以高甲基化基因数据为模型的合成数据集表明，cluster_boost可以成功识别未标记数据中的少数样本。利用基因组序列特征，cluster_boost在14,000个状态未知的基因中预测了候选高甲基化基因。在原发性卵巢癌中，我们确定了15个基因的甲基化状态，这些基因在高甲基化方面有不同程度的支持。结果表明，cluster_boost可以准确识别癌症中发生高甲基化的新基因。

可用性

软件和数据集可在http://labs.genome.duke.edu/FureyLab/cluster_boost.php免费获取。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

Genomic sweeping for hypermethylated genes.

Bioinformatics. 2007 Feb 1;23(3):281-8. doi: 10.1093/bioinformatics/btl620. Epub 2006 Dec 5.

Predicting methylation status of CpG islands in the human brain.

Bioinformatics. 2006 Sep 15;22(18):2204-9. doi: 10.1093/bioinformatics/btl377. Epub 2006 Jul 12.

PRIMEGENS-v2: genome-wide primer design for analyzing DNA methylation patterns of CpG islands.

Bioinformatics. 2008 Sep 1;24(17):1837-42. doi: 10.1093/bioinformatics/btn320. Epub 2008 Jun 25.

The CpG island methylator phenotype correlates with long-range epigenetic silencing in colorectal cancer.

Mol Cancer Res. 2008 Apr;6(4):585-91. doi: 10.1158/1541-7786.MCR-07-2158.

Detecting novel hypermethylated genes in breast cancer benefiting from feature selection.

Comput Biol Med. 2010 Feb;40(2):159-67. doi: 10.1016/j.compbiomed.2009.11.012. Epub 2009 Dec 31.

Formamide as a denaturant for bisulfite conversion of genomic DNA: Bisulfite sequencing of the GSTPi and RARbeta2 genes of 43 formalin-fixed paraffin-embedded prostate cancer specimens.

Anal Biochem. 2009 Sep 15;392(2):117-25. doi: 10.1016/j.ab.2009.06.001. Epub 2009 Jun 6.

DNA motifs associated with aberrant CpG island methylation.

Genomics. 2006 May;87(5):572-9. doi: 10.1016/j.ygeno.2005.12.016. Epub 2006 Feb 17.

Methylated genes as new cancer biomarkers.

Eur J Cancer. 2009 Feb;45(3):335-46. doi: 10.1016/j.ejca.2008.12.008. Epub 2009 Jan 12.

CpG island mapping by epigenome prediction.

PLoS Comput Biol. 2007 Jun;3(6):e110. doi: 10.1371/journal.pcbi.0030110. Epub 2007 May 2.

Identification of PRTFDC1 silencing and aberrant promoter methylation of GPR150, ITGA8 and HOXD11 in ovarian cancers.

Life Sci. 2007 Mar 27;80(16):1458-65. doi: 10.1016/j.lfs.2007.01.015. Epub 2007 Jan 20.

引用本文的文献

Changes in Methylation across Structural and MicroRNA Genes Relevant for Progression and Metastasis in Colorectal Cancer.

Cancers (Basel). 2021 Nov 26;13(23):5951. doi: 10.3390/cancers13235951.

Methylation of PRDM2, PRDM5 and PRDM16 genes in lung cancer cells.

Int J Clin Exp Pathol. 2014 Apr 15;7(5):2305-11. eCollection 2014.

Diagnostic and prognostic utility of a DNA hypermethylated gene signature in prostate cancer.

PLoS One. 2014 Mar 13;9(3):e91666. doi: 10.1371/journal.pone.0091666. eCollection 2014.

Empirical bayes model comparisons for differential methylation analysis.

Comp Funct Genomics. 2012;2012:376706. doi: 10.1155/2012/376706. Epub 2012 Aug 22.

A novel k-mer mixture logistic regression for methylation susceptibility modeling of CpG dinucleotides in human gene promoters.

BMC Bioinformatics. 2012 Mar 21;13 Suppl 3(Suppl 3):S15. doi: 10.1186/1471-2105-13-S3-S15.

Linking genome to epigenome.

Wiley Interdiscip Rev Syst Biol Med. 2012 May-Jun;4(3):297-309. doi: 10.1002/wsbm.1165. Epub 2012 Feb 17.

Correlating CpG islands, motifs, and sequence variants in human chromosome 21.

BMC Genomics. 2011;12 Suppl 2(Suppl 2):S10. doi: 10.1186/1471-2164-12-S2-S10. Epub 2011 Jul 27.

SPG20, a novel biomarker for early detection of colorectal cancer, encodes a regulator of cytokinesis.

Oncogene. 2011 Sep 15;30(37):3967-78. doi: 10.1038/onc.2011.109. Epub 2011 Apr 18.

Epigenetic suppression of the TGF-beta pathway revealed by transcriptome profiling in ovarian cancer.

Genome Res. 2011 Jan;21(1):74-82. doi: 10.1101/gr.108803.110. Epub 2010 Dec 14.

Cancer DNA methylation: molecular mechanisms and clinical implications.

Clin Cancer Res. 2009 Jun 15;15(12):3927-37. doi: 10.1158/1078-0432.CCR-08-2784. Epub 2009 Jun 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

对高甲基化基因进行基因组扫描。

Genomic sweeping for hypermethylated genes.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

SUPPLEMENTARY INFORMATION

动机

结果

可用性

补充信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献