• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AIControl:用机器学习替代匹配对照实验可提高 ChIP-seq 峰识别。

AIControl: replacing matched control experiments with machine learning improves ChIP-seq peak identification.

机构信息

Paul G. Allen School of Computer Science and Engineering, University of Washington, WA, USA, 98195-2350.

出版信息

Nucleic Acids Res. 2019 Jun 4;47(10):e58. doi: 10.1093/nar/gkz156.

DOI:10.1093/nar/gkz156
PMID:30869146
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6547432/
Abstract

ChIP-seq is a technique to determine binding locations of transcription factors, which remains a central challenge in molecular biology. Current practice is to use a 'control' dataset to remove background signals from a immunoprecipitation (IP) 'target' dataset. We introduce the AIControl framework, which eliminates the need to obtain a control dataset and instead identifies binding peaks by estimating the distributions of background signals from many publicly available control ChIP-seq datasets. We thereby avoid the cost of running control experiments while simultaneously increasing the accuracy of binding location identification. Specifically, AIControl can (i) estimate background signals at fine resolution, (ii) systematically weigh the most appropriate control datasets in a data-driven way, (iii) capture sources of potential biases that may be missed by one control dataset and (iv) remove the need for costly and time-consuming control experiments. We applied AIControl to 410 IP datasets in the ENCODE ChIP-seq database, using 440 control datasets from 107 cell types to impute background signal. Without using matched control datasets, AIControl identified peaks that were more enriched for putative binding sites than those identified by other popular peak callers that used a matched control dataset. We also demonstrated that our framework identifies binding sites that recover documented protein interactions more accurately.

摘要

ChIP-seq 是一种用于确定转录因子结合位置的技术,这在分子生物学中仍然是一个核心挑战。目前的做法是使用“对照”数据集来去除免疫沉淀 (IP) “靶”数据集的背景信号。我们引入了 AIControl 框架,该框架无需获取对照数据集,而是通过从许多公开可用的对照 ChIP-seq 数据集中估计背景信号的分布来识别结合峰。这样,我们在避免运行对照实验成本的同时,还提高了结合位置识别的准确性。具体来说,AIControl 可以 (i) 以精细的分辨率估计背景信号,(ii) 以数据驱动的方式系统地权衡最合适的对照数据集,(iii) 捕捉可能被一个对照数据集遗漏的潜在偏差源,以及 (iv) 无需进行昂贵且耗时的对照实验。我们将 AIControl 应用于 ENCODE ChIP-seq 数据库中的 410 个 IP 数据集,使用来自 107 种细胞类型的 440 个对照数据集来估算背景信号。在不使用匹配对照数据集的情况下,AIControl 识别出的峰比其他使用匹配对照数据集的流行峰调用者识别出的峰更富集假定的结合位点。我们还证明了我们的框架可以更准确地识别出可恢复已记录蛋白相互作用的结合位点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e0/6547432/a1b0aac053d3/gkz156fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e0/6547432/dda19aa2829d/gkz156fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e0/6547432/1e9f7ec78dc9/gkz156fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e0/6547432/7a6f48c99760/gkz156fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e0/6547432/f7b780bf38d8/gkz156fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e0/6547432/634f0ff9aed4/gkz156fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e0/6547432/a1b0aac053d3/gkz156fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e0/6547432/dda19aa2829d/gkz156fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e0/6547432/1e9f7ec78dc9/gkz156fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e0/6547432/7a6f48c99760/gkz156fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e0/6547432/f7b780bf38d8/gkz156fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e0/6547432/634f0ff9aed4/gkz156fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e0/6547432/a1b0aac053d3/gkz156fig6.jpg

相似文献

1
AIControl: replacing matched control experiments with machine learning improves ChIP-seq peak identification.AIControl:用机器学习替代匹配对照实验可提高 ChIP-seq 峰识别。
Nucleic Acids Res. 2019 Jun 4;47(10):e58. doi: 10.1093/nar/gkz156.
2
A novel statistical method for quantitative comparison of multiple ChIP-seq datasets.一种用于多个ChIP-seq数据集定量比较的新型统计方法。
Bioinformatics. 2015 Jun 15;31(12):1889-96. doi: 10.1093/bioinformatics/btv094. Epub 2015 Feb 13.
3
Collaborative Completion of Transcription Factor Binding Profiles via Local Sensitive Unified Embedding.通过局部敏感统一嵌入实现转录因子结合谱的协同完成
IEEE Trans Nanobioscience. 2016 Dec;15(8):946-958. doi: 10.1109/TNB.2016.2625823. Epub 2016 Nov 7.
4
Ritornello: high fidelity control-free chromatin immunoprecipitation peak calling.利托内洛:高保真无对照染色质免疫沉淀峰检测
Nucleic Acids Res. 2017 Dec 1;45(21):e173. doi: 10.1093/nar/gkx799.
5
FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets.FisherMP:一种用于从大型 ChIP-seq 数据集中检测组合基序的完全并行算法。
DNA Res. 2019 Jun 1;26(3):231-242. doi: 10.1093/dnares/dsz004.
6
Unified Analysis of Multiple ChIP-Seq Datasets.多个 ChIP-Seq 数据集的统一分析。
Methods Mol Biol. 2021;2198:451-465. doi: 10.1007/978-1-0716-0876-0_33.
7
Characterising ChIP-seq binding patterns by model-based peak shape deconvolution.基于模型的峰形反卷积分析 ChIP-seq 结合模式。
BMC Genomics. 2013 Nov 26;14(1):834. doi: 10.1186/1471-2164-14-834.
8
ChIP-BIT: Bayesian inference of target genes using a novel joint probabilistic model of ChIP-seq profiles.ChIP-BIT:使用一种新型的ChIP-seq图谱联合概率模型对靶基因进行贝叶斯推断。
Nucleic Acids Res. 2016 Apr 20;44(7):e65. doi: 10.1093/nar/gkv1491. Epub 2015 Dec 23.
9
Direct ChIP-Seq significance analysis improves target prediction.直接的染色质免疫沉淀测序(ChIP-Seq)显著性分析可改善靶点预测。
BMC Genomics. 2015;16 Suppl 5(Suppl 5):S4. doi: 10.1186/1471-2164-16-S5-S4. Epub 2015 May 26.
10
ChIPulate: A comprehensive ChIP-seq simulation pipeline.ChIPulate:一个全面的 ChIP-seq 模拟管道。
PLoS Comput Biol. 2019 Mar 21;15(3):e1006921. doi: 10.1371/journal.pcbi.1006921. eCollection 2019 Mar.

引用本文的文献

1
Exploitation of epigenetic variation of crop wild relatives for crop improvement and agrobiodiversity preservation.利用作物野生近缘种的表观遗传变异进行作物改良和农业生物多样性保护。
Theor Appl Genet. 2022 Nov;135(11):3987-4003. doi: 10.1007/s00122-022-04122-y. Epub 2022 Jun 9.
2
AI applications in functional genomics.人工智能在功能基因组学中的应用。
Comput Struct Biotechnol J. 2021 Oct 11;19:5762-5790. doi: 10.1016/j.csbj.2021.10.009. eCollection 2021.
3
Genome-Wide Chromatin Analysis of FFPE Tissues Using a Dual-Arm Robot with Clinical Potential.

本文引用的文献

1
JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework.JASPAR 2018:转录因子结合谱的开放获取数据库及其网络框架的更新。
Nucleic Acids Res. 2018 Jan 4;46(D1):D260-D266. doi: 10.1093/nar/gkx1126.
2
A graphical model approach visualizes regulatory relationships between genome-wide transcription factor binding profiles.图形模型方法直观地展示了全基因组转录因子结合谱之间的调控关系。
Brief Bioinform. 2018 Jan 1;19(1):162-173. doi: 10.1093/bib/bbw102.
3
Features that define the best ChIP-seq peak calling algorithms.
使用具有临床潜力的双臂机器人对福尔马林固定石蜡包埋组织进行全基因组染色质分析。
Cancers (Basel). 2021 Apr 28;13(9):2126. doi: 10.3390/cancers13092126.
4
F-Seq2: improving the feature density based peak caller with dynamic statistics.F-Seq2:利用动态统计改进基于特征密度的峰检测工具
NAR Genom Bioinform. 2021 Feb 23;3(1):lqab012. doi: 10.1093/nargab/lqab012. eCollection 2021 Mar.
5
WACS: improving ChIP-seq peak calling by optimally weighting controls.WACS:通过最优加权对照来提高 ChIP-seq 峰调用。
BMC Bioinformatics. 2021 Feb 15;22(1):69. doi: 10.1186/s12859-020-03927-2.
6
RECAP reveals the true statistical significance of ChIP-seq peak calls.RECAP 揭示了 ChIP-seq 峰调用的真实统计意义。
Bioinformatics. 2019 Oct 1;35(19):3592-3598. doi: 10.1093/bioinformatics/btz150.
7
Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities.用于整合生物学和医学数据的机器学习:原理、实践与机遇
Inf Fusion. 2019 Oct;50:71-91. doi: 10.1016/j.inffus.2018.09.012. Epub 2018 Sep 21.
定义最佳ChIP-seq峰检测算法的特征。
Brief Bioinform. 2017 May 1;18(3):441-450. doi: 10.1093/bib/bbw035.
4
ChromNet: Learning the human chromatin network from all ENCODE ChIP-seq data.ChromNet:从所有ENCODE ChIP-seq数据中学习人类染色质网络。
Genome Biol. 2016 Apr 30;17:82. doi: 10.1186/s13059-016-0925-0.
5
A Statistical Framework for the Analysis of ChIP-Seq Data.用于ChIP-Seq数据分析的统计框架
J Am Stat Assoc. 2011;106(495):891-903. doi: 10.1198/jasa.2011.ap09706. Epub 2012 Jan 24.
6
BIDCHIPS: bias decomposition and removal from ChIP-seq data clarifies true binding signal and its functional correlates.BIDCHIPS:从ChIP-seq数据中进行偏差分解和去除,可阐明真实的结合信号及其功能相关性。
Epigenetics Chromatin. 2015 Sep 17;8:33. doi: 10.1186/s13072-015-0028-2. eCollection 2015.
7
ChIPmentation: fast, robust, low-input ChIP-seq for histones and transcription factors.染色质免疫沉淀片段化测序法(ChIPmentation):用于组蛋白和转录因子的快速、稳健、低输入量的染色质免疫沉淀测序技术
Nat Methods. 2015 Oct;12(10):963-965. doi: 10.1038/nmeth.3542. Epub 2015 Aug 17.
8
Zinc finger independent genome-wide binding of Sp2 potentiates recruitment of histone-fold protein Nf-y distinguishing it from Sp1 and Sp3.Sp2不依赖锌指的全基因组结合增强了组蛋白折叠蛋白Nf-y的募集,使其区别于Sp1和Sp3。
PLoS Genet. 2015 Mar 20;11(3):e1005102. doi: 10.1371/journal.pgen.1005102. eCollection 2015 Mar.
9
Integrative analysis of 111 reference human epigenomes.111 个人类参考基因组的综合分析。
Nature. 2015 Feb 19;518(7539):317-30. doi: 10.1038/nature14248.
10
Large-scale imputation of epigenomic datasets for systematic annotation of diverse human tissues.用于多种人类组织系统注释的表观基因组数据集的大规模插补
Nat Biotechnol. 2015 Apr;33(4):364-76. doi: 10.1038/nbt.3157. Epub 2015 Feb 18.