从头鉴定基序可提高 ChIP-Seq 数据分析中预测转录因子结合位点的准确性。

De novo motif identification improves the accuracy of predicting transcription factor binding sites in ChIP-Seq data analysis.

机构信息

Institut Curie, 26 rue d'Ulm, Paris, France.

出版信息

Nucleic Acids Res. 2010 Jun;38(11):e126. doi: 10.1093/nar/gkq217. Epub 2010 Apr 7.

DOI:10.1093/nar/gkq217

PMID:20375099

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2887977/

Abstract

Dramatic progress in the development of next-generation sequencing technologies has enabled accurate genome-wide characterization of the binding sites of DNA-associated proteins. This technique, baptized as ChIP-Seq, uses a combination of chromatin immunoprecipitation and massively parallel DNA sequencing. Other published tools that predict binding sites from ChIP-Seq data use only positional information of mapped reads. In contrast, our algorithm MICSA (Motif Identification for ChIP-Seq Analysis) combines this source of positional information with information on motif occurrences to better predict binding sites of transcription factors (TFs). We proved the greater accuracy of MICSA with respect to several other tools by running them on datasets for the TFs NRSF, GABP, STAT1 and CTCF. We also applied MICSA on a dataset for the oncogenic TF EWS-FLI1. We discovered >2000 binding sites and two functionally different binding motifs. We observed that EWS-FLI1 can activate gene transcription when (i) its binding site is located in close proximity to the gene transcription start site (up to approximately 150 kb), and (ii) it contains a microsatellite sequence. Furthermore, we observed that sites without microsatellites can also induce regulation of gene expression--positively as often as negatively--and at much larger distances (up to approximately 1 Mb).

摘要

下一代测序技术的飞速发展使得对 DNA 相关蛋白的结合位点进行全基因组精确描绘成为可能。这种技术被称为 ChIP-Seq，它结合了染色质免疫沉淀和大规模平行 DNA 测序。其他从 ChIP-Seq 数据预测结合位点的已发表工具仅使用映射读取的位置信息。相比之下，我们的算法 MICSA（ChIP-Seq 分析中的基序识别）将这种位置信息与基序出现的信息相结合，以更好地预测转录因子 (TF) 的结合位点。我们通过在 NRSF、GABP、STAT1 和 CTCF 的 TF 数据集上运行这些工具，证明了 MICSA 相对于其他几个工具具有更高的准确性。我们还将 MICSA 应用于致癌 TF EWS-FLI1 的数据集。我们发现了 >2000 个结合位点和两个功能不同的结合基序。我们观察到，当 EWS-FLI1 的结合位点 (i) 位于基因转录起始位点附近（最多约 150kb），和 (ii) 包含微卫星序列时，它可以激活基因转录。此外，我们还观察到没有微卫星的位点也可以诱导基因表达的调控——积极的和消极的——并且在更大的距离（最多约 1Mb）。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5be/2887977/1656e64a2858/gkq217f1.jpg

相似文献

De novo motif identification improves the accuracy of predicting transcription factor binding sites in ChIP-Seq data analysis.

Nucleic Acids Res. 2010 Jun;38(11):e126. doi: 10.1093/nar/gkq217. Epub 2010 Apr 7.

Genome-wide identification of in vivo protein-DNA binding sites from ChIP-Seq data.

Nucleic Acids Res. 2008 Sep;36(16):5221-31. doi: 10.1093/nar/gkn488. Epub 2008 Aug 6.

FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets.

DNA Res. 2019 Jun 1;26(3):231-242. doi: 10.1093/dnares/dsz004.

On the detection and refinement of transcription factor binding sites using ChIP-Seq data.

Nucleic Acids Res. 2010 Apr;38(7):2154-67. doi: 10.1093/nar/gkp1180. Epub 2010 Jan 6.

SLFN11 Is a Transcriptional Target of EWS-FLI1 and a Determinant of Drug Response in Ewing Sarcoma.

Clin Cancer Res. 2015 Sep 15;21(18):4184-93. doi: 10.1158/1078-0432.CCR-14-2112. Epub 2015 Mar 16.

MEME-ChIP: motif analysis of large DNA datasets.

Bioinformatics. 2011 Jun 15;27(12):1696-7. doi: 10.1093/bioinformatics/btr189. Epub 2011 Apr 12.

Microsatellites as EWS/FLI response elements in Ewing's sarcoma.

Proc Natl Acad Sci U S A. 2008 Jul 22;105(29):10149-54. doi: 10.1073/pnas.0801073105. Epub 2008 Jul 14.

Role of ChIP-seq in the discovery of transcription factor binding sites, differential gene regulation mechanism, epigenetic marks and beyond.

Cell Cycle. 2014;13(18):2847-52. doi: 10.4161/15384101.2014.949201.

EWS-FLI1 regulates a transcriptional program in cooperation with Foxq1 in mouse Ewing sarcoma.

Cancer Sci. 2018 Sep;109(9):2907-2918. doi: 10.1111/cas.13710. Epub 2018 Jul 18.

DREME: motif discovery in transcription factor ChIP-seq data.

Bioinformatics. 2011 Jun 15;27(12):1653-9. doi: 10.1093/bioinformatics/btr261. Epub 2011 May 4.

引用本文的文献

Chimeric protein EWS::FLI1 drives cell proliferation in Ewing Sarcoma via aberrant expression of KCNN1/SK1 and dysregulation of calcium signaling.

Oncogene. 2025 Jan;44(2):79-91. doi: 10.1038/s41388-024-03199-7. Epub 2024 Nov 1.

REST Is Not Resting: REST/NRSF in Health and Disease.

Biomolecules. 2023 Oct 2;13(10):1477. doi: 10.3390/biom13101477.

Ewing sarcoma from molecular biology to the clinic.

Front Cell Dev Biol. 2023 Sep 11;11:1248753. doi: 10.3389/fcell.2023.1248753. eCollection 2023.

Unraveling Ewing Sarcoma Tumorigenesis Originating from Patient-Derived Mesenchymal Stem Cells.

Cancer Res. 2021 Oct 1;81(19):4994-5006. doi: 10.1158/0008-5472.CAN-20-3837. Epub 2021 Aug 2.

Antiparkinson Drug Benztropine Suppresses Tumor Growth, Circulating Tumor Cells, and Metastasis by Acting on SLC6A3/DAT and Reducing STAT3.

Cancers (Basel). 2020 Feb 24;12(2):523. doi: 10.3390/cancers12020523.

EWSR1-FLI1 Activation of the Cancer/Testis Antigen FATE1 Promotes Ewing Sarcoma Survival.

Mol Cell Biol. 2019 Jun 27;39(14). doi: 10.1128/MCB.00138-19. Print 2019 Jul 15.

Computer-aided prediction of antigen presenting cell modulators for designing peptide-based vaccine adjuvants.

J Transl Med. 2018 Jul 3;16(1):181. doi: 10.1186/s12967-018-1560-1.

Ritornello: high fidelity control-free chromatin immunoprecipitation peak calling.

Nucleic Acids Res. 2017 Dec 1;45(21):e173. doi: 10.1093/nar/gkx799.

Analysis of Genomic Sequence Motifs for Deciphering Transcription Factor Binding and Transcriptional Regulation in Eukaryotic Cells.

Front Genet. 2016 Feb 23;7:24. doi: 10.3389/fgene.2016.00024. eCollection 2016.

RNA:DNA hybrids in the human genome have distinctive nucleotide characteristics, chromatin composition, and transcriptional relationships.

Epigenetics Chromatin. 2015 Nov 16;8:46. doi: 10.1186/s13072-015-0040-6. eCollection 2015.

本文引用的文献

The oncogenic EWS-FLI1 protein binds in vivo GGAA microsatellite sequences with potential transcriptional activation function.

PLoS One. 2009;4(3):e4932. doi: 10.1371/journal.pone.0004932. Epub 2009 Mar 23.

Genome-wide analysis of transcription factor binding sites based on ChIP-Seq data.

Nat Methods. 2008 Sep;5(9):829-34. doi: 10.1038/nmeth.1246.

PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls.

Nat Biotechnol. 2009 Jan;27(1):66-75. doi: 10.1038/nbt.1518. Epub 2009 Jan 4.

Empirical methods for controlling false positives and estimating confidence in ChIP-Seq peaks.

BMC Bioinformatics. 2008 Dec 5;9:523. doi: 10.1186/1471-2105-9-523.

Design and analysis of ChIP-seq experiments for DNA-binding proteins.

Nat Biotechnol. 2008 Dec;26(12):1351-9. doi: 10.1038/nbt.1508. Epub 2008 Nov 16.

An integrated software system for analyzing ChIP-chip and ChIP-seq data.

Nat Biotechnol. 2008 Nov;26(11):1293-300. doi: 10.1038/nbt.1505. Epub 2008 Nov 2.

Model-based analysis of ChIP-Seq (MACS).

Genome Biol. 2008;9(9):R137. doi: 10.1186/gb-2008-9-9-r137. Epub 2008 Sep 17.

F-Seq: a feature density estimator for high-throughput sequence tags.

Bioinformatics. 2008 Nov 1;24(21):2537-8. doi: 10.1093/bioinformatics/btn480. Epub 2008 Sep 10.

Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Genome Res. 2008 Nov;18(11):1851-8. doi: 10.1101/gr.078212.108. Epub 2008 Aug 19.

Genome-wide identification of in vivo protein-DNA binding sites from ChIP-Seq data.

Nucleic Acids Res. 2008 Sep;36(16):5221-31. doi: 10.1093/nar/gkn488. Epub 2008 Aug 6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从头鉴定基序可提高 ChIP-Seq 数据分析中预测转录因子结合位点的准确性。

De novo motif identification improves the accuracy of predicting transcription factor binding sites in ChIP-Seq data analysis.

机构信息

Institut Curie, 26 rue d'Ulm, Paris, France.

出版信息

Nucleic Acids Res. 2010 Jun;38(11):e126. doi: 10.1093/nar/gkq217. Epub 2010 Apr 7.

DOI:10.1093/nar/gkq217

PMID:20375099

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2887977/

Abstract

摘要

从头鉴定基序可提高 ChIP-Seq 数据分析中预测转录因子结合位点的准确性。

De novo motif identification improves the accuracy of predicting transcription factor binding sites in ChIP-Seq data analysis.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

从头鉴定基序可提高 ChIP-Seq 数据分析中预测转录因子结合位点的准确性。

De novo motif identification improves the accuracy of predicting transcription factor binding sites in ChIP-Seq data analysis.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献