• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

epLSAP对齐:一种基于熵正则化部分线性和分配问题公式化的非顺序蛋白质结构对齐求解器。

epLSAP-Align: a non-sequential protein structural alignment solver with entropy-regularized partial linear sum assignment problem formulation.

作者信息

Zhang Xuechen, Chen Zhuoyang, Li Junyu, Luo Qiong, Wu Longjun, Yu Weichuan

机构信息

Department of Electronic and Computational Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China.

Data Science and Analytics Thrust, Information Hub, The Hong Kong University of Science and Technology (Guangzhou), Guangzhou, Guangdong 511400, China.

出版信息

Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf309.

DOI:10.1093/bioinformatics/btaf309
PMID:40394847
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12137893/
Abstract

MOTIVATION

The three-dimensional protein tertiary structure alignment is a fundamental problem that seeks insights into functions and evolution. Previous structure alignment algorithms have adopted the sequential assumption and used dynamic programming solvers. However, many distantly related structures exhibit non-sequential similarities, and non-sequential alignment tools are less efficient and accurate than sequential ones. In this paper, we formulate the non-sequential alignment as the Entropy-regularized Partial Linear Sum Assignment Problem (epLSAP) and propose a solver based on Sinkhorn algorithms, referred to as epLSAP-Align.

RESULTS

Compared with existing non-sequential alignment solvers, our epLSAP-Align can explicitly model the gap penalty, efficiently achieve global optimality and balance coverage and fidelity. We show that epLSAP-Align can be easily integrated into the existing frameworks, such as TM-align and MICAN, resulting in the non-sequential alignment tool epLSAP-TM and epLSAP-MICAN, respectively. Both epLSAP-TM and epLSAP-MICAN achieve better performance than the existing non-sequential alignment tools in terms of biologically meaningful structure overlaps on two sequential alignment test sets MALIDUP and MALISAM, and four non-sequential alignment test sets MALIDUP-ns, MALISAM-ns, 64-difficult-case and RIPC datasets. Also, compared with the most recent non-sequential alignment tool USalign2, our epLSAP-TM is at least 22% faster under the same setting.

AVAILABILITY AND IMPLEMENTATION

Our source code is available at https://github.com/xzhangem/epLSAP-align.

摘要

动机

三维蛋白质三级结构比对是一个基础性问题,旨在深入了解蛋白质的功能和进化。以往的结构比对算法采用了序列假设并使用动态规划求解器。然而,许多远缘相关结构表现出非序列相似性,并且非序列比对工具在效率和准确性方面都不如序列比对工具。在本文中,我们将非序列比对问题表述为熵正则化部分线性和分配问题(epLSAP),并提出一种基于Sinkhorn算法的求解器,称为epLSAP-Align。

结果

与现有的非序列比对求解器相比,我们的epLSAP-Align能够明确地对空位罚分进行建模,有效地实现全局最优性,并在覆盖度和保真度之间取得平衡。我们表明,epLSAP-Align可以很容易地集成到现有框架中,如TM-align和MICAN,分别产生非序列比对工具epLSAP-TM和epLSAP-MICAN。在两个序列比对测试集MALIDUP和MALISAM以及四个非序列比对测试集MALIDUP-ns、MALISAM-ns、64难例和RIPC数据集上,就具有生物学意义的结构重叠而言,epLSAP-TM和epLSAP-MICAN的性能均优于现有的非序列比对工具。此外,与最新的非序列比对工具USalign2相比,在相同设置下,我们的epLSAP-TM速度至少快22%。

可用性和实现

我们的源代码可在https://github.com/xzhangem/epLSAP-align获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5577/12137893/3cf43a5aca37/btaf309f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5577/12137893/3cf43a5aca37/btaf309f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5577/12137893/3cf43a5aca37/btaf309f1.jpg

相似文献

1
epLSAP-Align: a non-sequential protein structural alignment solver with entropy-regularized partial linear sum assignment problem formulation.epLSAP对齐:一种基于熵正则化部分线性和分配问题公式化的非顺序蛋白质结构对齐求解器。
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf309.
2
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
3
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
4
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
5
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
6
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.
7
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
8
Immunogenicity and seroefficacy of pneumococcal conjugate vaccines: a systematic review and network meta-analysis.肺炎球菌结合疫苗的免疫原性和血清效力:系统评价和网络荟萃分析。
Health Technol Assess. 2024 Jul;28(34):1-109. doi: 10.3310/YWHA3079.
9
Initial arch wires used in orthodontic treatment with fixed appliances.固定矫治器正畸治疗中使用的初始弓丝。
Cochrane Database Syst Rev. 2018 Jul 31;7(7):CD007859. doi: 10.1002/14651858.CD007859.pub4.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

本文引用的文献

1
GTalign: spatial index-driven protein structure alignment, superposition, and search.GTalign:基于空间索引的蛋白质结构比对、叠加和搜索。
Nat Commun. 2024 Aug 24;15(1):7305. doi: 10.1038/s41467-024-51669-z.
2
Clustering predicted structures at the scale of the known protein universe.对已知蛋白质宇宙尺度的预测结构进行聚类。
Nature. 2023 Oct;622(7983):637-645. doi: 10.1038/s41586-023-06510-w. Epub 2023 Sep 13.
3
Fast and accurate protein structure search with Foldseek.使用 Foldseek 进行快速准确的蛋白质结构搜索。
Nat Biotechnol. 2024 Feb;42(2):243-246. doi: 10.1038/s41587-023-01773-0. Epub 2023 May 8.
4
AlphaFold2 protein structure prediction: Implications for drug discovery.AlphaFold2 蛋白质结构预测:对药物发现的影响。
Curr Opin Struct Biol. 2023 Feb;78:102526. doi: 10.1016/j.sbi.2022.102526. Epub 2023 Jan 6.
5
A unified approach to sequential and non-sequential structure alignment of proteins, RNAs, and DNAs.一种用于蛋白质、RNA和DNA的序列和非序列结构比对的统一方法。
iScience. 2022 Sep 28;25(10):105218. doi: 10.1016/j.isci.2022.105218. eCollection 2022 Oct 21.
6
US-align: universal structure alignments of proteins, nucleic acids, and macromolecular complexes.US-align:蛋白质、核酸和大分子复合物的通用结构比对。
Nat Methods. 2022 Sep;19(9):1109-1115. doi: 10.1038/s41592-022-01585-1. Epub 2022 Aug 29.
7
Dali server: structural unification of protein families.达尔服务器:蛋白质家族的结构统一。
Nucleic Acids Res. 2022 Jul 5;50(W1):W210-W215. doi: 10.1093/nar/gkac387.
8
Structure-based protein function prediction using graph convolutional networks.基于结构的蛋白质功能预测使用图卷积网络。
Nat Commun. 2021 May 26;12(1):3168. doi: 10.1038/s41467-021-23303-9.
9
Topology-independent and global protein structure alignment through an FFT-based algorithm.基于 FFT 的算法实现拓扑无关和全局的蛋白质结构比对。
Bioinformatics. 2020 Jan 15;36(2):478-486. doi: 10.1093/bioinformatics/btz609.
10
MICAN-SQ: a sequential protein structure alignment program that is applicable to monomers and all types of oligomers.MICAN-SQ:一种适用于单体和各种类型寡聚物的顺序蛋白质结构比对程序。
Bioinformatics. 2018 Oct 1;34(19):3324-3331. doi: 10.1093/bioinformatics/bty369.