Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China.
Frontiers Research Institute for Synthetic Biology, Tianjin University, Tianjin 300072, China.
Nucleic Acids Res. 2023 Nov 27;51(21):11967-11979. doi: 10.1093/nar/gkad930.
Synthetic biology and deep learning synergistically revolutionize our ability for decoding and recoding DNA regulatory grammar. The B-cell-specific transcriptional regulation is intricate, and unlock the potential of B-cell-specific promoters as synthetic elements is important for B-cell engineering. Here, we designed and pooled synthesized 23 640 B-cell-specific promoters that exhibit larger sequence space, B-cell-specific expression, and enable diverse transcriptional patterns in B-cells. By MPRA (Massively parallel reporter assays), we deciphered the sequence features that regulate promoter transcriptional, including motifs and motif syntax (their combination and distance). Finally, we built and trained a deep learning model capable of predicting the transcriptional strength of the immunoglobulin V gene promoter directly from sequence. Prediction of thousands of promoter variants identified in the global human population shows that polymorphisms in promoters influence the transcription of immunoglobulin V genes, which may contribute to individual differences in adaptive humoral immune responses. Our work helps to decipher the transcription mechanism in immunoglobulin genes and offers thousands of non-similar promoters for B-cell engineering.
合成生物学和深度学习协同彻底改变了我们解码和重写 DNA 调控语法的能力。B 细胞特异性转录调控错综复杂,挖掘 B 细胞特异性启动子作为合成元件的潜力对于 B 细胞工程至关重要。在这里,我们设计并汇集了 23640 个合成的 B 细胞特异性启动子,它们具有更大的序列空间、B 细胞特异性表达,并能够在 B 细胞中实现多样化的转录模式。通过大规模平行报告基因分析 (MPRA),我们解析了调节启动子转录的序列特征,包括基序和基序语法(它们的组合和距离)。最后,我们构建并训练了一个深度学习模型,能够直接从序列预测免疫球蛋白 V 基因启动子的转录强度。对全球人类群体中鉴定的数千个启动子变体的预测表明,启动子中的多态性影响免疫球蛋白 V 基因的转录,这可能导致适应性体液免疫反应的个体差异。我们的工作有助于解析免疫球蛋白基因的转录机制,并为 B 细胞工程提供数千个非相似启动子。