Suppr超能文献

合成细胞类型特异性调控元件的机器引导设计

Machine-guided design of synthetic cell type-specific -regulatory elements.

作者信息

Gosai S J, Castro R I, Fuentes N, Butts J C, Kales S, Noche R R, Mouri K, Sabeti P C, Reilly S K, Tewhey R

机构信息

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Harvard Graduate Program in Biological and Biomedical Science, Boston MA.

出版信息

bioRxiv. 2023 Aug 9:2023.08.08.552077. doi: 10.1101/2023.08.08.552077.

Abstract

-regulatory elements (CREs) control gene expression, orchestrating tissue identity, developmental timing, and stimulus responses, which collectively define the thousands of unique cell types in the body. While there is great potential for strategically incorporating CREs in therapeutic or biotechnology applications that require tissue specificity, there is no guarantee that an optimal CRE for an intended purpose has arisen naturally through evolution. Here, we present a platform to engineer and validate synthetic CREs capable of driving gene expression with programmed cell type specificity. We leverage innovations in deep neural network modeling of CRE activity across three cell types, efficient optimization, and massively parallel reporter assays (MPRAs) to design and empirically test thousands of CREs. Through and validation, we show that synthetic sequences outperform natural sequences from the human genome in driving cell type-specific expression. Synthetic sequences leverage unique sequence syntax to promote activity in the on-target cell type and simultaneously reduce activity in off-target cells. Together, we provide a generalizable framework to prospectively engineer CREs and demonstrate the required literacy to write regulatory code that is fit-for-purpose across vertebrates.

摘要

调控元件(CREs)控制基因表达,协调组织特性、发育时间和刺激反应,这些共同定义了体内数千种独特的细胞类型。虽然在需要组织特异性的治疗或生物技术应用中战略性地整合CREs具有巨大潜力,但不能保证针对特定目的的最佳CRE是通过进化自然产生的。在此,我们展示了一个平台,用于设计和验证能够以编程的细胞类型特异性驱动基因表达的合成CREs。我们利用跨三种细胞类型的CRE活性深度神经网络建模、高效优化和大规模平行报告基因检测(MPRAs)方面的创新,来设计并实证测试数千个CREs。通过设计和验证,我们表明合成序列在驱动细胞类型特异性表达方面优于人类基因组中的天然序列。合成序列利用独特的序列语法来促进在靶细胞类型中的活性,同时降低在非靶细胞中的活性。我们共同提供了一个可推广的框架,用于前瞻性地设计CREs,并展示编写适用于整个脊椎动物的调控代码所需的能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b7e4/10441439/e55bfb675f4b/nihpp-2023.08.08.552077v1-f0001.jpg

相似文献

1
Machine-guided design of synthetic cell type-specific -regulatory elements.
bioRxiv. 2023 Aug 9:2023.08.08.552077. doi: 10.1101/2023.08.08.552077.
2
Machine-guided design of cell-type-targeting cis-regulatory elements.
Nature. 2024 Oct;634(8036):1211-1220. doi: 10.1038/s41586-024-08070-z. Epub 2024 Oct 23.
4
Massively parallel cis-regulatory analysis in the mammalian central nervous system.
Genome Res. 2016 Feb;26(2):238-55. doi: 10.1101/gr.193789.115. Epub 2015 Nov 17.
6
Sequence-based correction of barcode bias in massively parallel reporter assays.
Genome Res. 2021 Sep;31(9):1638-1645. doi: 10.1101/gr.268599.120. Epub 2021 Jul 20.
7
Designing realistic regulatory DNA with autoregressive language models.
Genome Res. 2024 Oct 11;34(9):1411-1420. doi: 10.1101/gr.279142.124.
10
Identification, Design, and Application of Noncoding Cis-Regulatory Elements.
Biomolecules. 2024 Aug 5;14(8):945. doi: 10.3390/biom14080945.

本文引用的文献

1
Hold out the genome: a roadmap to solving the cis-regulatory code.
Nature. 2024 Jan;625(7993):41-50. doi: 10.1038/s41586-023-06661-w. Epub 2023 Dec 13.
2
LegNet: a best-in-class deep learning model for short DNA regulatory regions.
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad457.
3
The functional and evolutionary impacts of human-specific deletions in conserved elements.
Science. 2023 Apr 28;380(6643):eabn2253. doi: 10.1126/science.abn2253.
4
A -regulatory lexicon of DNA motif combinations mediating cell-type-specific gene regulation.
Cell Genom. 2022 Nov 9;2(11). doi: 10.1016/j.xgen.2022.100191. Epub 2022 Oct 5.
5
maxATAC: Genome-scale transcription-factor binding prediction from ATAC-seq with deep neural networks.
PLoS Comput Biol. 2023 Jan 31;19(1):e1010863. doi: 10.1371/journal.pcbi.1010863. eCollection 2023 Jan.
7
Controlling gene expression with deep generative design of regulatory DNA.
Nat Commun. 2022 Aug 30;13(1):5099. doi: 10.1038/s41467-022-32818-8.
8
Machine learning sequence prioritization for cell type-specific enhancer design.
Elife. 2022 May 16;11:e69571. doi: 10.7554/eLife.69571.
9
DeepSTARR predicts enhancer activity from DNA sequence and enables the de novo design of synthetic enhancers.
Nat Genet. 2022 May;54(5):613-624. doi: 10.1038/s41588-022-01048-5. Epub 2022 May 12.
10
RSAT 2022: regulatory sequence analysis tools.
Nucleic Acids Res. 2022 Jul 5;50(W1):W670-W676. doi: 10.1093/nar/gkac312.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验