Singh Ritambhara, Lanchantin Jack, Sekhon Arshdeep, Qi Yanjun
Department of Computer Science, University of Virginia.
Adv Neural Inf Process Syst. 2017 Dec;30:6785-6795.
The past decade has seen a revolution in genomic technologies that enabled a flood of genome-wide profiling of chromatin marks. Recent literature tried to understand gene regulation by predicting gene expression from large-scale chromatin measurements. Two fundamental challenges exist for such learning tasks: (1) genome-wide chromatin signals are spatially structured, high-dimensional and highly modular; and (2) the core aim is to understand what the relevant factors are and how they work together. Previous studies either failed to model complex dependencies among input signals or relied on separate feature analysis to explain the decisions. This paper presents an attention-based deep learning approach, AttentiveChrome, that uses a unified architecture to model and to interpret dependencies among chromatin factors for controlling gene regulation. AttentiveChrome uses a hierarchy of multiple Long Short-Term Memory (LSTM) modules to encode the input signals and to model how various chromatin marks cooperate automatically. AttentiveChrome trains two levels of attention jointly with the target prediction, enabling it to attend differentially to relevant marks and to locate important positions per mark. We evaluate the model across 56 different cell types (tasks) in humans. Not only is the proposed architecture more accurate, but its attention scores provide a better interpretation than state-of-the-art feature visualization methods such as saliency maps.
过去十年见证了基因组技术的一场革命,这场革命使得大量全基因组染色质标记谱得以实现。近期的文献试图通过从大规模染色质测量中预测基因表达来理解基因调控。此类学习任务存在两个基本挑战:(1)全基因组染色质信号具有空间结构、高维度且高度模块化;(2)核心目标是理解相关因素是什么以及它们如何共同起作用。先前的研究要么未能对输入信号之间的复杂依赖关系进行建模,要么依赖于单独的特征分析来解释决策。本文提出了一种基于注意力的深度学习方法AttentiveChrome,该方法使用统一架构对染色质因子之间的依赖关系进行建模和解释,以控制基因调控。AttentiveChrome使用多个长短期记忆(LSTM)模块的层次结构来编码输入信号,并对各种染色质标记如何自动协作进行建模。AttentiveChrome与目标预测一起联合训练两个层次的注意力,使其能够对相关标记进行差异化关注,并定位每个标记的重要位置。我们在人类的56种不同细胞类型(任务)上评估了该模型。所提出的架构不仅更准确,而且其注意力得分比诸如显著性图等现有最先进的特征可视化方法提供了更好的解释。