• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过深度生成式调控 DNA 设计来控制基因表达。

Controlling gene expression with deep generative design of regulatory DNA.

机构信息

Department of Biology and Biological Engineering, Chalmers University of Technology, Kemivägen 10, SE41296, Gothenburg, Sweden.

Department of Biotechnology and Systems Biology, National Institute of Biology, Večna pot 111, SI1000, Ljubljana, Slovenia.

出版信息

Nat Commun. 2022 Aug 30;13(1):5099. doi: 10.1038/s41467-022-32818-8.

DOI:10.1038/s41467-022-32818-8
PMID:36042233
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9427793/
Abstract

Design of de novo synthetic regulatory DNA is a promising avenue to control gene expression in biotechnology and medicine. Using mutagenesis typically requires screening sizable random DNA libraries, which limits the designs to span merely a short section of the promoter and restricts their control of gene expression. Here, we prototype a deep learning strategy based on generative adversarial networks (GAN) by learning directly from genomic and transcriptomic data. Our ExpressionGAN can traverse the entire regulatory sequence-expression landscape in a gene-specific manner, generating regulatory DNA with prespecified target mRNA levels spanning the whole gene regulatory structure including coding and adjacent non-coding regions. Despite high sequence divergence from natural DNA, in vivo measurements show that 57% of the highly-expressed synthetic sequences surpass the expression levels of highly-expressed natural controls. This demonstrates the applicability and relevance of deep generative design to expand our knowledge and control of gene expression regulation in any desired organism, condition or tissue.

摘要

从头设计合成调控 DNA 是生物技术和医学中控制基因表达的一种很有前途的方法。使用诱变通常需要筛选相当大的随机 DNA 文库,这限制了设计只能跨越启动子的一小段,限制了它们对基因表达的控制。在这里,我们通过直接从基因组和转录组数据中学习,提出了一种基于生成对抗网络(GAN)的深度学习策略。我们的 ExpressionGAN 可以以基因特异性的方式遍历整个调控序列-表达景观,生成具有预定目标 mRNA 水平的调控 DNA,这些 DNA 跨越整个基因调控结构,包括编码区和相邻的非编码区。尽管与天然 DNA 的序列差异很大,但体内测量表明,57%的高表达合成序列超过了高表达天然对照的表达水平。这证明了深度生成设计在任何所需的生物体、条件或组织中扩展我们对基因表达调控的知识和控制的适用性和相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/590e/9427793/d6a3524ef76a/41467_2022_32818_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/590e/9427793/42af0d7ce188/41467_2022_32818_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/590e/9427793/eaf0269af7a7/41467_2022_32818_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/590e/9427793/057a041ba0f6/41467_2022_32818_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/590e/9427793/2ffd1b8f87bc/41467_2022_32818_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/590e/9427793/0bdbd591b781/41467_2022_32818_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/590e/9427793/d6a3524ef76a/41467_2022_32818_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/590e/9427793/42af0d7ce188/41467_2022_32818_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/590e/9427793/eaf0269af7a7/41467_2022_32818_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/590e/9427793/057a041ba0f6/41467_2022_32818_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/590e/9427793/2ffd1b8f87bc/41467_2022_32818_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/590e/9427793/0bdbd591b781/41467_2022_32818_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/590e/9427793/d6a3524ef76a/41467_2022_32818_Fig6_HTML.jpg

相似文献

1
Controlling gene expression with deep generative design of regulatory DNA.通过深度生成式调控 DNA 设计来控制基因表达。
Nat Commun. 2022 Aug 30;13(1):5099. doi: 10.1038/s41467-022-32818-8.
2
De Novo Peptide and Protein Design Using Generative Adversarial Networks: An Update.使用生成对抗网络的从头肽和蛋白质设计:最新进展
J Chem Inf Model. 2022 Feb 28;62(4):761-774. doi: 10.1021/acs.jcim.1c01361. Epub 2022 Feb 7.
3
Genome-wide prediction of cis-regulatory regions using supervised deep learning methods.基于监督深度学习方法的全基因组顺式调控区预测。
BMC Bioinformatics. 2018 May 31;19(1):202. doi: 10.1186/s12859-018-2187-1.
4
iEnhancer-GAN: A Deep Learning Framework in Combination with Word Embedding and Sequence Generative Adversarial Net to Identify Enhancers and Their Strength.iEnhancer-GAN:一种结合词嵌入和序列生成对抗网络以识别增强子及其强度的深度学习框架。
Int J Mol Sci. 2021 Mar 30;22(7):3589. doi: 10.3390/ijms22073589.
5
Generative Deep Learning for Targeted Compound Design.生成式深度学习在靶向化合物设计中的应用。
J Chem Inf Model. 2021 Nov 22;61(11):5343-5361. doi: 10.1021/acs.jcim.0c01496. Epub 2021 Oct 26.
6
Generative Adversarial Networks for Creating Synthetic Nucleic Acid Sequences of Cat Genome.用于创建猫基因组合成核酸序列的生成对抗网络。
Int J Mol Sci. 2022 Mar 28;23(7):3701. doi: 10.3390/ijms23073701.
7
Characterization of 954 bovine full-CDS cDNA sequences.954条牛全长编码序列(CDS)cDNA序列的特征分析
BMC Genomics. 2005 Nov 23;6:166. doi: 10.1186/1471-2164-6-166.
8
A Generative Neural Network for Maximizing Fitness and Diversity of Synthetic DNA and Protein Sequences.用于最大化合成DNA和蛋白质序列适应性与多样性的生成神经网络。
Cell Syst. 2020 Jul 22;11(1):49-62.e16. doi: 10.1016/j.cels.2020.05.007. Epub 2020 Jun 25.
9
Emergence of Peptides That Confer Antibiotic Resistance.抗生素耐药性相关肽的出现
mBio. 2019 Jun 4;10(3):e00837-19. doi: 10.1128/mBio.00837-19.
10
Comparative genomic and transcriptomic characterization of the toxigenic marine dinoflagellate Alexandrium ostenfeldii.海洋产毒甲藻亚历山大藻的比较基因组和转录组特征分析。
PLoS One. 2011;6(12):e28012. doi: 10.1371/journal.pone.0028012. Epub 2011 Dec 2.

引用本文的文献

1
De novo promoter design method based on deep generative and dynamic evolution algorithm.基于深度生成与动态进化算法的从头启动子设计方法
Nucleic Acids Res. 2025 Aug 27;53(16). doi: 10.1093/nar/gkaf833.
2
Computational nanobody design through deep generative modeling and epitope landscape profiling.通过深度生成建模和表位景观分析进行计算纳米抗体设计。
Comput Struct Biotechnol J. 2025 Jul 30;27:3443-3455. doi: 10.1016/j.csbj.2025.07.052. eCollection 2025.
3
UTRGAN: learning to generate 5' UTR sequences for optimized translation efficiency and gene expression.

本文引用的文献

1
The evolution, evolvability and engineering of gene regulatory DNA.基因调控 DNA 的进化、可进化性与工程。
Nature. 2022 Mar;603(7901):455-463. doi: 10.1038/s41586-022-04506-6. Epub 2022 Mar 9.
2
De novo protein design by deep network hallucination.基于深度网络幻觉的从头设计蛋白质。
Nature. 2021 Dec;600(7889):547-552. doi: 10.1038/s41586-021-04184-w. Epub 2021 Dec 1.
3
Effects of sequence motifs in the yeast 3' untranslated region determined from massively parallel assays of random sequences.从大规模平行随机序列测定中确定的酵母 3'非翻译区序列模体的影响。
UTRGAN:学习生成5'非翻译区序列以优化翻译效率和基因表达。
Bioinform Adv. 2025 Jun 10;5(1):vbaf134. doi: 10.1093/bioadv/vbaf134. eCollection 2025.
4
De novo design of insulated cis-regulatory elements based on deep learning-predicted fitness landscape.基于深度学习预测的适应度景观对绝缘顺式调控元件进行从头设计。
Nucleic Acids Res. 2025 Jun 20;53(12). doi: 10.1093/nar/gkaf611.
5
Polygraph: a software framework for the systematic assessment of synthetic regulatory DNA elements.多聚体记录仪:用于系统评估合成调控DNA元件的软件框架。
Genome Biol. 2025 May 6;26(1):114. doi: 10.1186/s13059-025-03584-9.
6
Improving the generalization of protein expression models with mechanistic sequence information.利用机制性序列信息提高蛋白质表达模型的泛化能力。
Nucleic Acids Res. 2025 Jan 24;53(3). doi: 10.1093/nar/gkaf020.
7
Synthetic promoter design in based on multinomial diffusion model.基于多项式扩散模型的合成启动子设计。
iScience. 2024 Oct 18;27(11):111207. doi: 10.1016/j.isci.2024.111207. eCollection 2024 Nov 15.
8
The design and engineering of synthetic genomes.合成基因组的设计与工程
Nat Rev Genet. 2025 May;26(5):298-319. doi: 10.1038/s41576-024-00786-y. Epub 2024 Nov 6.
9
Machine-guided design of cell-type-targeting cis-regulatory elements.机器引导的细胞类型靶向顺式调控元件设计。
Nature. 2024 Oct;634(8036):1211-1220. doi: 10.1038/s41586-024-08070-z. Epub 2024 Oct 23.
10
Artificial intelligence and machine learning applications for cultured meat.用于培养肉的人工智能和机器学习应用。
Front Artif Intell. 2024 Sep 24;7:1424012. doi: 10.3389/frai.2024.1424012. eCollection 2024.
Genome Biol. 2021 Oct 18;22(1):293. doi: 10.1186/s13059-021-02509-6.
4
Synthetic promoter designs enabled by a comprehensive analysis of plant core promoters.通过对植物核心启动子的全面分析,实现了合成启动子的设计。
Nat Plants. 2021 Jun;7(6):842-855. doi: 10.1038/s41477-021-00932-y. Epub 2021 Jun 3.
5
DIVERSIFY: A Fungal Multispecies Gene Expression Platform.DIVERSIFY:真菌多物种基因表达平台。
ACS Synth Biol. 2021 Mar 19;10(3):579-588. doi: 10.1021/acssynbio.0c00587. Epub 2021 Mar 2.
6
Deep learning suggests that gene expression is encoded in all parts of a co-evolving interacting gene regulatory structure.深度学习表明,基因表达是由共同进化的相互作用基因调控结构的所有部分编码的。
Nat Commun. 2020 Dec 1;11(1):6141. doi: 10.1038/s41467-020-19921-4.
7
The InterPro protein families and domains database: 20 years on.The InterPro 蛋白质家族和结构域数据库:20 年的发展历程。
Nucleic Acids Res. 2021 Jan 8;49(D1):D344-D354. doi: 10.1093/nar/gkaa977.
8
Exploring the synthetic biology potential of bacteriophages for engineering non-model bacteria.探索噬菌体在工程非模型细菌中的合成生物学潜力。
Nat Commun. 2020 Oct 20;11(1):5294. doi: 10.1038/s41467-020-19124-x.
9
Rational design of minimal synthetic promoters for plants.植物最小合成启动子的合理设计。
Nucleic Acids Res. 2020 Dec 2;48(21):11845-11856. doi: 10.1093/nar/gkaa682.
10
A Generative Neural Network for Maximizing Fitness and Diversity of Synthetic DNA and Protein Sequences.用于最大化合成DNA和蛋白质序列适应性与多样性的生成神经网络。
Cell Syst. 2020 Jul 22;11(1):49-62.e16. doi: 10.1016/j.cels.2020.05.007. Epub 2020 Jun 25.