Suppr超能文献

基于保守一级序列和二级结构的机器学习定向适体搜索

Machine Learning Directed Aptamer Search from Conserved Primary Sequences and Secondary Structures.

作者信息

Perez Tobia Javier, Huang Po-Jung Jimmy, Ding Yuzhe, Saran Narayan Runjhun, Narayan Apurva, Liu Juewen

机构信息

Department of Computer Science, University of British Columbia, Kelowna, British Columbia V1V 1V7, Canada.

Department of Chemistry, Waterloo Institute for Nanotechnology, Water Institute, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada.

出版信息

ACS Synth Biol. 2023 Jan 20;12(1):186-195. doi: 10.1021/acssynbio.2c00462. Epub 2023 Jan 3.

Abstract

Computer-aided prediction of aptamer sequences has been focused on primary sequence alignment and motif comparison. We observed that many aptamers have a conserved hairpin, yet the sequence of the hairpin can be highly variable. Taking such secondary structure information into consideration, a new algorithm combining conserved primary sequences and secondary structures is developed, which combines three scores based on sequence abundance, stability, and structure, respectively. This algorithm was used in the prediction of aptamers from the caffeine and theophylline selections. In the late rounds of the selections, when the libraries were converged, the predicted sequences matched well with the most abundant sequences. When the libraries were far from convergence and the sequences were deemed challenging for traditional analysis methods, this algorithm still predicted aptamer sequences that were experimentally verified by isothermal titration calorimetry. This algorithm paves a new way to look for patterns in aptamer selection libraries and mimics the sequence evolution process. It will help shorten the aptamer selection time and promote the biosensor and chemical biology applications of aptamers.

摘要

适体序列的计算机辅助预测一直集中在一级序列比对和基序比较上。我们观察到许多适体都有一个保守的发夹结构,但其序列可能高度可变。考虑到这种二级结构信息,开发了一种结合保守一级序列和二级结构的新算法,该算法分别结合了基于序列丰度、稳定性和结构的三个分数。该算法用于从咖啡因和茶碱筛选中预测适体。在筛选的后期,当文库收敛时,预测序列与最丰富的序列匹配良好。当文库远未收敛且序列被认为对传统分析方法具有挑战性时,该算法仍然可以预测通过等温滴定量热法实验验证的适体序列。该算法为在适体选择文库中寻找模式并模拟序列进化过程开辟了一条新途径。它将有助于缩短适体选择时间,并促进适体在生物传感器和化学生物学中的应用。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验