Meuleman Wouter, Muratov Alexander, Rynes Eric, Halow Jessica, Lee Kristen, Bates Daniel, Diegel Morgan, Dunn Douglas, Neri Fidencio, Teodosiadis Athanasios, Reynolds Alex, Haugen Eric, Nelson Jemma, Johnson Audra, Frerker Mark, Buckley Michael, Sandstrom Richard, Vierstra Jeff, Kaul Rajinder, Stamatoyannopoulos John
Altius Institute for Biomedical Sciences, Seattle, WA, USA.
Department of Genome Sciences, University of Washington, Seattle, WA, USA.
Nature. 2020 Aug;584(7820):244-251. doi: 10.1038/s41586-020-2559-3. Epub 2020 Jul 29.
DNase I hypersensitive sites (DHSs) are generic markers of regulatory DNA and contain genetic variations associated with diseases and phenotypic traits. We created high-resolution maps of DHSs from 733 human biosamples encompassing 438 cell and tissue types and states, and integrated these to delineate and numerically index approximately 3.6 million DHSs within the human genome sequence, providing a common coordinate system for regulatory DNA. Here we show that these maps highly resolve the cis-regulatory compartment of the human genome, which encodes unexpectedly diverse cell- and tissue-selective regulatory programs at very high density. These programs can be captured comprehensively by a simple vocabulary that enables the assignment to each DHS of a regulatory barcode that encapsulates its tissue manifestations, and global annotation of protein-coding and non-coding RNA genes in a manner orthogonal to gene expression. Finally, we show that sharply resolved DHSs markedly enhance the genetic association and heritability signals of diseases and traits. Rather than being confined to a small number of distal elements or promoters, we find that genetic signals converge on congruently regulated sets of DHSs that decorate entire gene bodies. Together, our results create a universal, extensible coordinate system and vocabulary for human regulatory DNA marked by DHSs, and provide a new global perspective on the architecture of human gene regulation.
脱氧核糖核酸酶I超敏位点(DHSs)是调控性DNA的通用标记,包含与疾病和表型特征相关的遗传变异。我们从733份人类生物样本中创建了DHSs的高分辨率图谱,这些样本涵盖438种细胞和组织类型及状态,并将这些图谱整合起来,在人类基因组序列中描绘并数字化索引了约360万个DHSs,为调控性DNA提供了一个通用坐标系。我们在此表明,这些图谱高度解析了人类基因组的顺式调控区室,该区室以非常高的密度编码了意想不到的多种细胞和组织选择性调控程序。这些程序可以通过一个简单的词汇表全面捕获,该词汇表能够为每个DHS分配一个调控条形码,该条形码概括了其组织表现,并以一种与基因表达正交的方式对蛋白质编码和非编码RNA基因进行全局注释。最后,我们表明,清晰解析的DHSs显著增强了疾病和性状的遗传关联及遗传力信号。我们发现,遗传信号并非局限于少数远端元件或启动子,而是汇聚在装饰整个基因体的协调调控的DHSs集合上。总之,我们的结果为以DHSs标记的人类调控性DNA创建了一个通用、可扩展的坐标系和词汇表,并为人类基因调控的架构提供了一个新的全局视角。