• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生成式和可解释机器学习在适体设计和体外序列选择分析中的应用。

Generative and interpretable machine learning for aptamer design and analysis of in vitro sequence selection.

机构信息

Laboratoire de Physique de l'Ecole Normale Supérieure, PSL & CNRS UMR8063, Sorbonne Université, Université de Paris, Paris, France.

School of Molecular Sciences and Center for Molecular Design and Biomimetics, The Biodesign Institute, Arizona State University, Tempe, Arizona, United States of America.

出版信息

PLoS Comput Biol. 2022 Sep 29;18(9):e1010561. doi: 10.1371/journal.pcbi.1010561. eCollection 2022 Sep.

DOI:10.1371/journal.pcbi.1010561
PMID:36174101
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9553063/
Abstract

Selection protocols such as SELEX, where molecules are selected over multiple rounds for their ability to bind to a target of interest, are popular methods for obtaining binders for diagnostic and therapeutic purposes. We show that Restricted Boltzmann Machines (RBMs), an unsupervised two-layer neural network architecture, can successfully be trained on sequence ensembles from single rounds of SELEX experiments for thrombin aptamers. RBMs assign scores to sequences that can be directly related to their fitnesses estimated through experimental enrichment ratios. Hence, RBMs trained from sequence data at a given round can be used to predict the effects of selection at later rounds. Moreover, the parameters of the trained RBMs are interpretable and identify functional features contributing most to sequence fitness. To exploit the generative capabilities of RBMs, we introduce two different training protocols: one taking into account sequence counts, capable of identifying the few best binders, and another based on unique sequences only, generating more diverse binders. We then use RBMs model to generate novel aptamers with putative disruptive mutations or good binding properties, and validate the generated sequences with gel shift assay experiments. Finally, we compare the RBM's performance with different supervised learning approaches that include random forests and several deep neural network architectures.

摘要

选择方案,如 SELEX,其中分子在多个轮次中被选择,以评估它们与感兴趣的靶标的结合能力,是获得用于诊断和治疗目的的配体的流行方法。我们表明,受限玻尔兹曼机(RBM),一种无监督的两层神经网络架构,可以成功地对来自 SELEX 实验单轮的序列集合进行训练,以获得凝血酶适体。RBM 为序列分配分数,这些分数可以直接与其通过实验富集比估计的适应性相关。因此,从给定轮次的序列数据中训练的 RBM 可以用于预测后续轮次的选择效果。此外,训练后的 RBM 参数是可解释的,并确定对序列适应性贡献最大的功能特征。为了利用 RBM 的生成能力,我们引入了两种不同的训练方案:一种考虑序列计数,能够识别出少数最佳配体,另一种仅基于独特序列,生成更多样化的配体。然后,我们使用 RBM 模型生成具有潜在破坏性突变或良好结合特性的新适体,并通过凝胶迁移实验验证生成的序列。最后,我们将 RBM 的性能与包括随机森林和几种深度神经网络架构在内的不同监督学习方法进行了比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/64ef80299cc5/pcbi.1010561.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/d23caadd3ff4/pcbi.1010561.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/3fbf88dcb88f/pcbi.1010561.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/2195d7e4c8a8/pcbi.1010561.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/4c1f654e8472/pcbi.1010561.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/bab7108a1818/pcbi.1010561.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/0f9f7f4f7ac4/pcbi.1010561.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/d91e81327efc/pcbi.1010561.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/6cb20ff780f7/pcbi.1010561.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/64ef80299cc5/pcbi.1010561.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/d23caadd3ff4/pcbi.1010561.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/3fbf88dcb88f/pcbi.1010561.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/2195d7e4c8a8/pcbi.1010561.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/4c1f654e8472/pcbi.1010561.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/bab7108a1818/pcbi.1010561.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/0f9f7f4f7ac4/pcbi.1010561.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/d91e81327efc/pcbi.1010561.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/6cb20ff780f7/pcbi.1010561.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/9553063/64ef80299cc5/pcbi.1010561.g009.jpg

相似文献

1
Generative and interpretable machine learning for aptamer design and analysis of in vitro sequence selection.生成式和可解释机器学习在适体设计和体外序列选择分析中的应用。
PLoS Comput Biol. 2022 Sep 29;18(9):e1010561. doi: 10.1371/journal.pcbi.1010561. eCollection 2022 Sep.
2
Approximation properties of Gaussian-binary restricted Boltzmann machines and Gaussian-binary deep belief networks.高斯-二项式受限玻尔兹曼机和高斯-二项式深度置信网络的逼近性质。
Neural Netw. 2022 Sep;153:49-63. doi: 10.1016/j.neunet.2022.05.020. Epub 2022 Jun 2.
3
The emergence of a concept in shallow neural networks.浅层神经网络中概念的出现。
Neural Netw. 2022 Apr;148:232-253. doi: 10.1016/j.neunet.2022.01.017. Epub 2022 Feb 4.
4
Generative Modeling of RNA Sequence Families with Restricted Boltzmann Machines.用受限玻尔兹曼机对 RNA 序列家族进行生成建模。
Methods Mol Biol. 2025;2847:163-175. doi: 10.1007/978-1-0716-4079-1_11.
5
Accelerating deep learning with memcomputing.利用忆阻器计算加速深度学习。
Neural Netw. 2019 Feb;110:1-7. doi: 10.1016/j.neunet.2018.10.012. Epub 2018 Nov 3.
6
The Role of Architectural and Learning Constraints in Neural Network Models: A Case Study on Visual Space Coding.架构和学习约束在神经网络模型中的作用:视觉空间编码的案例研究
Front Comput Neurosci. 2017 Mar 21;11:13. doi: 10.3389/fncom.2017.00013. eCollection 2017.
7
Creating artificial human genomes using generative neural networks.使用生成式神经网络创建人工人类基因组。
PLoS Genet. 2021 Feb 4;17(2):e1009303. doi: 10.1371/journal.pgen.1009303. eCollection 2021 Feb.
8
A simple method for eliminating fixed-region interference of aptamer binding during SELEX.一种在指数富集的配体系统进化(SELEX)过程中消除适体结合固定区域干扰的简单方法。
Biotechnol Bioeng. 2014 Nov;111(11):2265-79. doi: 10.1002/bit.25294. Epub 2014 Jul 14.
9
On the effective initialisation for restricted Boltzmann machines via duality with Hopfield model.通过对偶性与 Hopfield 模型实现受限 Boltzmann 机的有效初始化。
Neural Netw. 2021 Nov;143:314-326. doi: 10.1016/j.neunet.2021.06.017. Epub 2021 Jun 23.
10
A novel artificial intelligence-based approach for identification of deoxynucleotide aptamers.一种基于新型人工智能的脱氧核苷酸适体鉴定方法。
PLoS Comput Biol. 2021 Aug 3;17(8):e1009247. doi: 10.1371/journal.pcbi.1009247. eCollection 2021 Aug.

引用本文的文献

1
Optimal sequencing depth for measuring the concentrations of molecular barcodes.用于测量分子条形码浓度的最佳测序深度。
Nucleic Acids Res. 2025 Aug 27;53(16). doi: 10.1093/nar/gkaf793.
2
Multiplexed Biomarker Detection Using DNA Payloads: Design, Assembly, and Analysis.使用DNA负载的多重生物标志物检测:设计、组装与分析
Methods Mol Biol. 2025;2901:203-226. doi: 10.1007/978-1-0716-4394-5_16.
3
Combination of Coevolutionary Information and Supervised Learning Enables Generation of Cyclic Peptide Inhibitors with Enhanced Potency from a Small Data Set.

本文引用的文献

1
Modeling Sequence-Space Exploration and Emergence of Epistatic Signals in Protein Evolution.蛋白质进化中序列空间探索和上位信号涌现的建模。
Mol Biol Evol. 2022 Jan 7;39(1). doi: 10.1093/molbev/msab321.
2
AMaLa: Analysis of Directed Evolution Experiments via Annealed Mutational Approximated Landscape.AMaLa:通过退火突变逼近景观分析定向进化实验。
Int J Mol Sci. 2021 Oct 9;22(20):10908. doi: 10.3390/ijms222010908.
3
Barriers and dynamical paths in alternating Gibbs sampling of restricted Boltzmann machines.受限玻尔兹曼机交替吉布斯采样中的障碍与动态路径
共进化信息与监督学习相结合可从小数据集生成具有更高效力的环肽抑制剂。
ACS Cent Sci. 2024 Nov 20;10(12):2242-2252. doi: 10.1021/acscentsci.4c01428. eCollection 2024 Dec 25.
4
Machine Learning for RNA Design: LEARNA.机器学习在 RNA 设计中的应用:LEARNA。
Methods Mol Biol. 2025;2847:63-93. doi: 10.1007/978-1-0716-4079-1_5.
5
Computational Frontiers in Aptamer-Based Nanomedicine for Precision Therapeutics: A Comprehensive Review.基于适配体的纳米医学用于精准治疗的计算前沿:综述
ACS Omega. 2024 Jun 10;9(25):26838-26862. doi: 10.1021/acsomega.4c02466. eCollection 2024 Jun 25.
6
Inference of annealed protein fitness landscapes with AnnealDCA.利用 AnnealDCA 推断退火蛋白质适应度景观。
PLoS Comput Biol. 2024 Feb 20;20(2):e1011812. doi: 10.1371/journal.pcbi.1011812. eCollection 2024 Feb.
7
ACIDES: on-line monitoring of forward genetic screens for protein engineering.酸:用于蛋白质工程正向遗传筛选的在线监测。
Nat Commun. 2023 Dec 26;14(1):8504. doi: 10.1038/s41467-023-43967-9.
8
Interpretable Machine Learning of Amino Acid Patterns in Proteins: A Statistical Ensemble Approach.蛋白质中氨基酸模式的可解释机器学习:一种统计集成方法。
J Chem Theory Comput. 2023 Sep 12;19(17):6011-6022. doi: 10.1021/acs.jctc.3c00383. Epub 2023 Aug 8.
9
Recent trends in RNA informatics: a review of machine learning and deep learning for RNA secondary structure prediction and RNA drug discovery.RNA 信息学的最新趋势:机器学习和深度学习在 RNA 二级结构预测和 RNA 药物发现中的应用综述。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad186.
10
Random and Natural Non-Coding RNA Have Similar Structural Motif Patterns but Differ in Bulge, Loop, and Bond Counts.随机和天然非编码RNA具有相似的结构基序模式,但在凸起、环和键的数量上有所不同。
Life (Basel). 2023 Mar 6;13(3):708. doi: 10.3390/life13030708.
Phys Rev E. 2021 Sep;104(3-1):034109. doi: 10.1103/PhysRevE.104.034109.
4
Geometric deep learning of RNA structure.RNA 结构的几何深度学习。
Science. 2021 Aug 27;373(6558):1047-1051. doi: 10.1126/science.abe5650.
5
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
6
Learning the Regulatory Code of Gene Expression.学习基因表达的调控密码。
Front Mol Biosci. 2021 Jun 10;8:673363. doi: 10.3389/fmolb.2021.673363. eCollection 2021.
7
Machine learning directed drug formulation development.机器学习指导药物制剂开发。
Adv Drug Deliv Rev. 2021 Aug;175:113806. doi: 10.1016/j.addr.2021.05.016. Epub 2021 May 19.
8
A synthetic RNA-based biosensor for fructose-1,6-bisphosphate that reports glycolytic flux.用于报告糖酵解通量的果糖-1,6-二磷酸的合成 RNA 基生物传感器。
Cell Chem Biol. 2021 Nov 18;28(11):1554-1568.e8. doi: 10.1016/j.chembiol.2021.04.006. Epub 2021 Apr 28.
9
A SARS-CoV-2 Spike Binding DNA Aptamer that Inhibits Pseudovirus Infection by an RBD-Independent Mechanism*.一种通过非 RBD 依赖机制抑制假病毒感染的 SARS-CoV-2 刺突结合 DNA 适体*。
Angew Chem Int Ed Engl. 2021 Apr 26;60(18):10279-10285. doi: 10.1002/anie.202100316. Epub 2021 Mar 23.
10
Deep diversification of an AAV capsid protein by machine learning.机器学习深度多样化 AAV 衣壳蛋白。
Nat Biotechnol. 2021 Jun;39(6):691-696. doi: 10.1038/s41587-020-00793-4. Epub 2021 Feb 11.