• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有可变邻域的自组织映射促进了与调控元件相关的染色质可及性信号形状的学习。

Self-organizing maps with variable neighborhoods facilitate learning of chromatin accessibility signal shapes associated with regulatory elements.

机构信息

Department of Biomedical Informatics, The Ohio State University College of Medicine, 370 W. 9th Avenue, Columbus, OH, 43210, USA.

Department of Computer Science and Engineering, The Ohio State University College of Engineering, 2015 Neil Avenue, Columbus, OH, 43210, USA.

出版信息

BMC Bioinformatics. 2021 Jan 30;22(1):35. doi: 10.1186/s12859-021-03976-1.

DOI:10.1186/s12859-021-03976-1
PMID:33516170
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7847148/
Abstract

BACKGROUND

Assigning chromatin states genome-wide (e.g. promoters, enhancers, etc.) is commonly performed to improve functional interpretation of these states. However, computational methods to assign chromatin state suffer from the following drawbacks: they typically require data from multiple assays, which may not be practically feasible to obtain, and they depend on peak calling algorithms, which require careful parameterization and often exclude the majority of the genome. To address these drawbacks, we propose a novel learning technique built upon the Self-Organizing Map (SOM), Self-Organizing Map with Variable Neighborhoods (SOM-VN), to learn a set of representative shapes from a single, genome-wide, chromatin accessibility dataset to associate with a chromatin state assignment in which a particular RE is prevalent. These shapes can then be used to assign chromatin state using our workflow.

RESULTS

We validate the performance of the SOM-VN workflow on 14 different samples of varying quality, namely one assay each of A549 and GM12878 cell lines and two each of H1 and HeLa cell lines, primary B-cells, and brain, heart, and stomach tissue. We show that SOM-VN learns shapes that are (1) non-random, (2) associated with known chromatin states, (3) generalizable across sets of chromosomes, and (4) associated with magnitude and multimodality. We compare the accuracy of SOM-VN chromatin states against the Clustering Aggregation Tool (CAGT), an unsupervised method that learns chromatin accessibility signal shapes but does not associate these shapes with REs, and we show that overall precision and recall is increased when learning shapes using SOM-VN as compared to CAGT. We further compare enhancer state assignments from SOM-VN in signals above a set threshold to enhancer state assignments from Predicting Enhancers from ATAC-seq Data (PEAS), a deep learning method that assigns enhancer chromatin states to peaks. We show that the precision-recall area under the curve for the assignment of enhancer states is comparable to PEAS.

CONCLUSIONS

Our work shows that the SOM-VN workflow can learn relationships between REs and chromatin accessibility signal shape, which is an important step toward the goal of assigning and comparing enhancer state across multiple experiments and phenotypic states.

摘要

背景

对全基因组的染色质状态(例如启动子、增强子等)进行分配通常用于提高对这些状态的功能解释。 然而,分配染色质状态的计算方法存在以下缺点:它们通常需要来自多个测定的数据集,这在实际中可能无法获得,并且它们依赖于峰调用算法,这些算法需要仔细的参数化,并且通常排除基因组的大部分。 为了解决这些缺点,我们提出了一种基于自组织图(SOM)的新学习技术,该技术基于自组织图具有可变邻域(SOM-VN),从单个全基因组染色质可及性数据集中学习一组代表性形状,以关联与特定 RE 普遍存在的染色质状态分配。 然后可以使用我们的工作流程使用这些形状来分配染色质状态。

结果

我们在 14 个不同质量的样本上验证了 SOM-VN 工作流程的性能,即 A549 和 GM12878 细胞系的每种测定各一个,H1 和 HeLa 细胞系的每种测定各两个,以及原代 B 细胞和脑,心脏和胃组织。 我们表明,SOM-VN 学习的形状是(1)非随机的,(2)与已知的染色质状态相关,(3)可在染色体组之间推广,以及(4)与幅度和多模态相关。 我们将 SOM-VN 染色质状态的准确性与聚类聚合工具(CAGT)进行了比较,CAGT 是一种学习染色质可及性信号形状但不将这些形状与 RE 相关联的无监督方法,我们表明,与 CAGT 相比,使用 SOM-VN 学习形状可以提高整体精度和召回率。 我们进一步将 SOM-VN 在设定阈值以上的信号中的增强子状态分配与从 ATAC-seq 数据预测增强子(PEAS)的增强子状态分配进行了比较,PEAS 是一种将增强子染色质状态分配给峰的深度学习方法。 我们表明,分配增强子状态的精度-召回曲线下面积与 PEAS 相当。

结论

我们的工作表明,SOM-VN 工作流程可以学习 RE 和染色质可及性信号形状之间的关系,这是在多个实验和表型状态之间分配和比较增强子状态的目标的重要步骤。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7554/7847148/98a177f8f519/12859_2021_3976_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7554/7847148/20aed94f80c0/12859_2021_3976_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7554/7847148/833169913884/12859_2021_3976_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7554/7847148/46008a434711/12859_2021_3976_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7554/7847148/98a177f8f519/12859_2021_3976_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7554/7847148/20aed94f80c0/12859_2021_3976_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7554/7847148/833169913884/12859_2021_3976_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7554/7847148/46008a434711/12859_2021_3976_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7554/7847148/98a177f8f519/12859_2021_3976_Fig4_HTML.jpg

相似文献

1
Self-organizing maps with variable neighborhoods facilitate learning of chromatin accessibility signal shapes associated with regulatory elements.具有可变邻域的自组织映射促进了与调控元件相关的染色质可及性信号形状的学习。
BMC Bioinformatics. 2021 Jan 30;22(1):35. doi: 10.1186/s12859-021-03976-1.
2
Integrating and mining the chromatin landscape of cell-type specificity using self-organizing maps.利用自组织映射整合和挖掘细胞类型特异性的染色质景观。
Genome Res. 2013 Dec;23(12):2136-48. doi: 10.1101/gr.158261.113. Epub 2013 Oct 29.
3
Differential analysis of chromatin accessibility and histone modifications for predicting mouse developmental enhancers.预测小鼠发育增强子的染色质可及性和组蛋白修饰的差异分析。
Nucleic Acids Res. 2018 Nov 30;46(21):11184-11201. doi: 10.1093/nar/gky753.
4
A neural network based model effectively predicts enhancers from clinical ATAC-seq samples.基于神经网络的模型可有效预测临床 ATAC-seq 样本中的增强子。
Sci Rep. 2018 Oct 30;8(1):16048. doi: 10.1038/s41598-018-34420-9.
5
Efficient chromatin accessibility mapping in situ by nucleosome-tethered tagmentation.通过核小体连接的标签酶切技术进行高效的染色质可及性原位作图。
Elife. 2020 Nov 16;9:e63274. doi: 10.7554/eLife.63274.
6
Inferring time series chromatin states for promoter-enhancer pairs based on Hi-C data.基于 Hi-C 数据推断启动子-增强子对的时序染色质状态。
BMC Genomics. 2021 Jan 28;22(1):84. doi: 10.1186/s12864-021-07373-z.
7
Enhancer prediction in the human genome by probabilistic modelling of the chromatin feature patterns.通过对染色质特征模式的概率建模来预测人类基因组中的增强子。
BMC Bioinformatics. 2020 Jul 20;21(1):317. doi: 10.1186/s12859-020-03621-3.
8
HiCoP, a simple and robust method for detecting interactions of regulatory regions.HiCoP,一种简单而强大的检测调控区域相互作用的方法。
Epigenetics Chromatin. 2020 Jul 1;13(1):27. doi: 10.1186/s13072-020-00348-6.
9
Ubiquitous heterogeneity and asymmetry of the chromatin environment at regulatory elements.染色质环境在调控元件处的普遍异质性和不对称性。
Genome Res. 2012 Sep;22(9):1735-47. doi: 10.1101/gr.136366.111.
10
Genome-wide analysis of chromatin accessibility using ATAC-seq.使用ATAC-seq进行全基因组染色质可及性分析。
Methods Cell Biol. 2019;151:219-235. doi: 10.1016/bs.mcb.2018.11.002. Epub 2018 Dec 21.

本文引用的文献

1
TERT promoter mutations and telomeres during tumorigenesis.端粒酶启动子突变与肿瘤发生过程中的端粒
Curr Opin Genet Dev. 2020 Feb;60:56-62. doi: 10.1016/j.gde.2020.02.001. Epub 2020 Mar 9.
2
Neurofibromin Is an Estrogen Receptor-α Transcriptional Co-repressor in Breast Cancer.神经纤维瘤蛋白是乳腺癌中雌激素受体-α的转录共抑制子。
Cancer Cell. 2020 Mar 16;37(3):387-402.e7. doi: 10.1016/j.ccell.2020.02.003. Epub 2020 Mar 5.
3
Urinary TERT promoter mutations are detectable up to 10 years prior to clinical diagnosis of bladder cancer: Evidence from the Golestan Cohort Study.
尿路上皮 TERT 启动子突变可在膀胱癌临床诊断前 10 年检测到:来自戈勒斯坦队列研究的证据。
EBioMedicine. 2020 Mar;53:102643. doi: 10.1016/j.ebiom.2020.102643. Epub 2020 Feb 17.
4
Hypomethylation of C1q/tumor necrosis factor-related protein-1 promoter region in whole blood and risks for coronary artery aneurysms in Kawasaki disease.全血中C1q/肿瘤坏死因子相关蛋白-1启动子区域的低甲基化与川崎病冠状动脉瘤的风险
Int J Cardiol. 2020 May 15;307:159-163. doi: 10.1016/j.ijcard.2020.02.002. Epub 2020 Feb 3.
5
Familial analysis reveals rare risk variants for migraine in regulatory regions.家族分析揭示了调节区域中偏头痛的罕见风险变异。
Neurogenetics. 2020 Jul;21(3):149-157. doi: 10.1007/s10048-020-00606-5. Epub 2020 Feb 19.
6
Analysis of , , and as potential disease severity modifiers in spinal muscular atrophy.分析、和作为脊髓性肌萎缩症潜在疾病严重程度调节因子的情况。
Neurol Genet. 2019 Jan 3;6(1):e386. doi: 10.1212/NXG.0000000000000386. eCollection 2020 Feb.
7
Chromatin accessibility and the regulatory epigenome.染色质可及性和调控表观基因组。
Nat Rev Genet. 2019 Apr;20(4):207-220. doi: 10.1038/s41576-018-0089-8.
8
Classifying human promoters by occupancy patterns identifies recurring sequence elements, combinatorial binding, and spatial interactions.通过占据模式对人类启动子进行分类可识别出重复的序列元件、组合结合和空间相互作用。
BMC Biol. 2018 Nov 15;16(1):138. doi: 10.1186/s12915-018-0585-5.
9
A neural network based model effectively predicts enhancers from clinical ATAC-seq samples.基于神经网络的模型可有效预测临床 ATAC-seq 样本中的增强子。
Sci Rep. 2018 Oct 30;8(1):16048. doi: 10.1038/s41598-018-34420-9.
10
Ten principles of heterochromatin formation and function.异染色质形成和功能的十个原则。
Nat Rev Mol Cell Biol. 2018 Apr;19(4):229-244. doi: 10.1038/nrm.2017.119. Epub 2017 Dec 13.