Suppr超能文献

使用单核苷酸多态性(SNP)芯片和基因表达芯片在骨髓增生异常综合征模型中鉴定含SNP的调控基序。

Identification of SNP-containing regulatory motifs in the myelodysplastic syndromes model using SNP arrays and gene expression arrays.

作者信息

Fan Jing, Dy Jennifer G, Chang Chung-Che, Zhou Xiaobo

机构信息

Department of Electrical and Computer Engineering, Northeastern University, Boston, MA 02115, USA.

出版信息

Chin J Cancer. 2013 Apr;32(4):170-85. doi: 10.5732/cjc.012.10113. Epub 2013 Jan 18.

Abstract

Myelodysplastic syndromes have increased in frequency and incidence in the American population, but patient prognosis has not significantly improved over the last decade. Such improvements could be realized if biomarkers for accurate diagnosis and prognostic stratification were successfully identified. In this study, we propose a method that associates two state-of-the-art array technologies--single nucleotide polymor-phism(SNP) array and gene expression array--with gene motifs considered transcription factor-binding sites (TFBS). We are particularly interested in SNP-containing motifs introduced by genetic variation and mutation as TFBS. The potential regulation of SNP-containing motifs affects only when certain mutations occur. These motifs can be identified from a group of co-expressed genes with copy number variation. Then, we used a sliding window to identify motif candidates near SNPs on gene sequences. The candidates were filtered by coarse thresholding and fine statistical testing. Using the regression-based LARS-EN algorithm and a level-wise sequence combination procedure, we identified 28 SNP-containing motifs as candidate TFBS. We confirmed 21 of the 28 motifs with ChIP-chip fragments in the TRANSFAC database. Another six motifs were validated by TRANSFAC via searching binding fragments on co-regulated genes. The identified motifs and their location genes can be considered potential biomarkers for myelodysplastic syndromes. Thus, our proposed method, a novel strategy for associating two data categories, is capable of integrating information from different sources to identify reliable candidate regulatory SNP-containing motifs introduced by genetic variation and mutation.

摘要

骨髓增生异常综合征在美国人群中的发病频率和发病率均有所上升,但在过去十年中患者的预后并未得到显著改善。如果能够成功识别出用于准确诊断和预后分层的生物标志物,就有可能实现这种改善。在本研究中,我们提出了一种方法,将两种最先进的阵列技术——单核苷酸多态性(SNP)阵列和基因表达阵列——与被视为转录因子结合位点(TFBS)的基因基序相关联。我们特别关注由遗传变异和突变引入的作为TFBS的含SNP基序。含SNP基序的潜在调控仅在某些突变发生时才会产生影响。这些基序可以从一组具有拷贝数变异的共表达基因中识别出来。然后,我们使用滑动窗口在基因序列上的SNP附近识别基序候选物。通过粗阈值筛选和精细统计测试对候选物进行过滤。使用基于回归的LARS-EN算法和逐级序列组合程序,我们确定了28个含SNP基序作为候选TFBS。我们在TRANSFAC数据库中用ChIP-chip片段证实了28个基序中的21个。另外六个基序通过TRANSFAC在共调控基因上搜索结合片段得到了验证。所识别的基序及其定位基因可被视为骨髓增生异常综合征的潜在生物标志物。因此,我们提出的方法,一种关联两种数据类别的新策略,能够整合来自不同来源的信息,以识别由遗传变异和突变引入的可靠候选调控含SNP基序。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dacf/3845573/62ad9029cbcc/cjc-32-04-170-g001.jpg

相似文献

2
Meta-analysis discovery of tissue-specific DNA sequence motifs from mammalian gene expression data.
BMC Bioinformatics. 2006 Apr 27;7:229. doi: 10.1186/1471-2105-7-229.
4
SNP arrays: comparing diagnostic yields for four platforms in children with developmental delay.
BMC Med Genomics. 2014 Dec 24;7:70. doi: 10.1186/s12920-014-0070-0.
5
Dynamic variable selection in SNP genotype autocalling from APEX microarray data.
BMC Bioinformatics. 2006 Nov 30;7:521. doi: 10.1186/1471-2105-7-521.
6
The use of cytogenetic microarrays in myelodysplastic syndrome characterization.
Methods Mol Biol. 2013;973:69-85. doi: 10.1007/978-1-62703-281-0_5.
8
SNiPer-HD: improved genotype calling accuracy by an expectation-maximization algorithm for high-density SNP arrays.
Bioinformatics. 2007 Jan 1;23(1):57-63. doi: 10.1093/bioinformatics/btl536. Epub 2006 Oct 24.
9
A genotype calling algorithm for affymetrix SNP arrays.
Bioinformatics. 2006 Jan 1;22(1):7-12. doi: 10.1093/bioinformatics/bti741. Epub 2005 Nov 2.

引用本文的文献

1
Identification and characteristic analysis of enhancers across 13 major cancer types.
Precis Clin Med. 2021 Aug 2;4(3):204-208. doi: 10.1093/pcmedi/pbab019. eCollection 2021 Sep.
2
3
Cancer bioinformatics: detection of chromatin states, SNP-containing motifs, and functional enrichment modules.
Chin J Cancer. 2013 Apr;32(4):153-4. doi: 10.5732/cjc.013.10045. Epub 2013 Apr 1.

本文引用的文献

1
Will Chinese ovarian cancer patients benefit from knowing the BRCA2 mutation status?
Chin J Cancer. 2012 Jan;31(1):1-4. doi: 10.5732/cjc.011.10432. Epub 2011 Dec 16.
3
Association between p21 Ser31Arg polymorphism and cancer risk: a meta-analysis.
Chin J Cancer. 2011 Apr;30(4):254-63. doi: 10.5732/cjc.010.10587.
6
Mapping the genetic architecture of gene expression in human liver.
PLoS Biol. 2008 May 6;6(5):e107. doi: 10.1371/journal.pbio.0060107.
10
Chromosomal lesions and uniparental disomy detected by SNP arrays in MDS, MDS/MPD, and MDS-derived AML.
Blood. 2008 Feb 1;111(3):1534-42. doi: 10.1182/blood-2007-05-092304. Epub 2007 Oct 22.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验