• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习生成实验上不相关的靶分子结合的高功能化核酸聚合物。

Generating experimentally unrelated target molecule-binding highly functionalized nucleic-acid polymers using machine learning.

机构信息

Merkin Institute of Transformative Technologies in Healthcare, Broad Institute of Harvard and MIT, Cambridge, MA, USA.

Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA.

出版信息

Nat Commun. 2022 Aug 4;13(1):4541. doi: 10.1038/s41467-022-31955-4.

DOI:10.1038/s41467-022-31955-4
PMID:35927274
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9352670/
Abstract

In vitro selection queries large combinatorial libraries for sequence-defined polymers with target binding and reaction catalysis activity. While the total sequence space of these libraries can extend beyond 10 sequences, practical considerations limit starting sequences to ≤~10 distinct molecules. Selection-induced sequence convergence and limited sequencing depth further constrain experimentally observable sequence space. To address these limitations, we integrate experimental and machine learning approaches to explore regions of sequence space unrelated to experimentally derived variants. We perform in vitro selections to discover highly side-chain-functionalized nucleic acid polymers (HFNAPs) with potent affinities for a target small molecule (daunomycin K = 5-65 nM). We then use the selection data to train a conditional variational autoencoder (CVAE) machine learning model to generate diverse and unique HFNAP sequences with high daunomycin affinities (K = 9-26 nM), even though they are unrelated in sequence to experimental polymers. Coupling in vitro selection with a machine learning model thus enables direct generation of active variants, demonstrating a new approach to the discovery of functional biopolymers.

摘要

体外选择从具有目标结合和反应催化活性的序列定义聚合物的大型组合文库中查询序列。虽然这些文库的总序列空间可以扩展到超过 10 个序列,但实际考虑因素将起始序列限制在≤~10 个不同的分子。选择诱导的序列收敛和有限的测序深度进一步限制了实验可观察到的序列空间。为了解决这些限制,我们整合了实验和机器学习方法来探索与实验衍生变体无关的序列空间区域。我们进行体外选择以发现具有强靶小分子(柔红霉素 K=5-65 nM)亲和力的高度侧链官能化核酸聚合物(HFNAP)。然后,我们使用选择数据来训练条件变分自动编码器(CVAE)机器学习模型,以生成具有高柔红霉素亲和力(K=9-26 nM)的多样化和独特的 HFNAP 序列,即使它们在序列上与实验聚合物无关。因此,将体外选择与机器学习模型相结合可以直接生成活性变体,展示了发现功能生物聚合物的新方法。

相似文献

1
Generating experimentally unrelated target molecule-binding highly functionalized nucleic-acid polymers using machine learning.利用机器学习生成实验上不相关的靶分子结合的高功能化核酸聚合物。
Nat Commun. 2022 Aug 4;13(1):4541. doi: 10.1038/s41467-022-31955-4.
2
Evolution of sequence-defined highly functionalized nucleic acid polymers.序列定义的高度功能化核酸聚合物的演变。
Nat Chem. 2018 Apr;10(4):420-427. doi: 10.1038/s41557-018-0008-9. Epub 2018 Mar 5.
3
Side chain determinants of biopolymer function during selection and replication.侧链决定生物聚合物在选择和复制过程中的功能。
Nat Chem Biol. 2019 Apr;15(4):419-426. doi: 10.1038/s41589-019-0229-2. Epub 2019 Feb 11.
4
Machine learning-assisted directed protein evolution with combinatorial libraries.机器学习辅助的组合文库定向蛋白质进化。
Proc Natl Acad Sci U S A. 2019 Apr 30;116(18):8852-8858. doi: 10.1073/pnas.1901979116. Epub 2019 Apr 12.
5
Enzyme-free translation of DNA into sequence-defined synthetic polymers structurally unrelated to nucleic acids.无酶将 DNA 翻译成与核酸在结构上无关联的序列定义的合成聚合物。
Nat Chem. 2013 Apr;5(4):282-92. doi: 10.1038/nchem.1577. Epub 2013 Mar 3.
6
Comprehensive Prediction of Molecular Recognition in a Combinatorial Chemical Space Using Machine Learning.使用机器学习全面预测组合化学空间中的分子识别。
ACS Comb Sci. 2020 Oct 12;22(10):500-508. doi: 10.1021/acscombsci.0c00003. Epub 2020 Aug 17.
7
From Drug Molecules to Thermoset Shape Memory Polymers: A Machine Learning Approach.从药物分子到热固性形状记忆聚合物:机器学习方法。
ACS Appl Mater Interfaces. 2021 Dec 22;13(50):60508-60521. doi: 10.1021/acsami.1c20947. Epub 2021 Dec 8.
8
Generation of Synthetic Copolymer Libraries by Combinatorial Assembly on Nucleic Acid Templates.通过在核酸模板上进行组合组装生成合成共聚物文库。
ACS Comb Sci. 2016 Jul 11;18(7):355-70. doi: 10.1021/acscombsci.6b00059. Epub 2016 Jun 17.
9
Evolution of Functionally Enhanced α-l-Threofuranosyl Nucleic Acid Aptamers.功能增强的α-L-呋喃核糖核酸适体的演变。
ACS Synth Biol. 2021 Nov 19;10(11):3190-3199. doi: 10.1021/acssynbio.1c00481. Epub 2021 Nov 5.
10
Enzymatic Synthesis of Sequence-Defined Synthetic Nucleic Acid Polymers with Diverse Functional Groups.用具有不同官能团的酶合成序列定义的合成核酸聚合物。
Angew Chem Int Ed Engl. 2016 Oct 10;55(42):13164-13168. doi: 10.1002/anie.201607538.

引用本文的文献

1
Sequence-selective duplex formation and template effect in recognition-encoded oligoanilines.识别编码的寡苯胺中的序列选择性双链体形成和模板效应。
Chem Sci. 2023 Jul 26;14(33):8878-8888. doi: 10.1039/d3sc00880k. eCollection 2023 Aug 23.
2
A Sensitive and Nonoptical CRISPR Detection Mechanism by Sizing Double-Stranded λ DNA Reporter.基于双链 λ DNA 报告分子大小的灵敏非光学 CRISPR 检测机制。
Angew Chem Int Ed Engl. 2022 Dec 12;61(50):e202213920. doi: 10.1002/anie.202213920. Epub 2022 Nov 10.

本文引用的文献

1
Low-N protein engineering with data-efficient deep learning.低蛋白工程与数据高效深度学习。
Nat Methods. 2021 Apr;18(4):389-396. doi: 10.1038/s41592-021-01100-y. Epub 2021 Apr 7.
2
2'-fluoro-modified pyrimidines enhance affinity of RNA oligonucleotides to HIV-1 reverse transcriptase.2'-氟修饰的嘧啶可增强 RNA 寡核苷酸对 HIV-1 逆转录酶的亲和力。
RNA. 2020 Nov;26(11):1667-1679. doi: 10.1261/rna.077008.120. Epub 2020 Jul 30.
3
Determinants of Base Editing Outcomes from Target Library Analysis and Machine Learning.从目标文库分析和机器学习角度看碱基编辑结果的决定因素。
Cell. 2020 Jul 23;182(2):463-480.e30. doi: 10.1016/j.cell.2020.05.037. Epub 2020 Jun 12.
4
A Deep Learning Approach to Antibiotic Discovery.深度学习在抗生素发现中的应用。
Cell. 2020 Feb 20;180(4):688-702.e13. doi: 10.1016/j.cell.2020.01.021.
5
A Style-Based Generator Architecture for Generative Adversarial Networks.基于风格的生成对抗网络生成器架构。
IEEE Trans Pattern Anal Mach Intell. 2021 Dec;43(12):4217-4228. doi: 10.1109/TPAMI.2020.2970919. Epub 2021 Nov 3.
6
Machine learning-guided channelrhodopsin engineering enables minimally invasive optogenetics.机器学习指导的通道蛋白工程实现微创光遗传学。
Nat Methods. 2019 Nov;16(11):1176-1184. doi: 10.1038/s41592-019-0583-8. Epub 2019 Oct 14.
7
Sequenceserver: A Modern Graphical User Interface for Custom BLAST Databases.序列服务器:用于定制 BLAST 数据库的现代图形用户界面。
Mol Biol Evol. 2019 Dec 1;36(12):2922-2924. doi: 10.1093/molbev/msz185.
8
Side chain determinants of biopolymer function during selection and replication.侧链决定生物聚合物在选择和复制过程中的功能。
Nat Chem Biol. 2019 Apr;15(4):419-426. doi: 10.1038/s41589-019-0229-2. Epub 2019 Feb 11.
9
Design of metalloproteins and novel protein folds using variational autoencoders.利用变分自动编码器设计金属蛋白和新型蛋白质折叠。
Sci Rep. 2018 Nov 1;8(1):16189. doi: 10.1038/s41598-018-34533-1.
10
Efficiency and fidelity of T3 DNA ligase in ligase-catalysed oligonucleotide polymerisations.T3 DNA 连接酶在连接酶催化的寡核苷酸聚合反应中的效率和保真度。
Org Biomol Chem. 2019 Feb 13;17(7):1962-1965. doi: 10.1039/c8ob01958d.