• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PairK:用于量化无序区域中蛋白质基序保守性的成对k-mer比对

PairK: Pairwise k-mer alignment for quantifying protein motif conservation in disordered regions.

作者信息

Halpin Jackson C, Keating Amy E

机构信息

MIT Department of Biology, 77 Massachusetts Ave., Cambridge, MA 02139.

MIT Department of Biological Engineering, 77 Massachusetts Ave., Cambridge, MA 02139.

出版信息

bioRxiv. 2024 Jul 24:2024.07.23.604860. doi: 10.1101/2024.07.23.604860.

DOI:10.1101/2024.07.23.604860
PMID:39091826
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11291154/
Abstract

Protein-protein interactions are often mediated by a modular peptide recognition domain binding to a short linear motif (SLiM) in the disordered region of another protein. The ability to predict domain-SLiM interactions would allow researchers to map protein interaction networks, predict the effects of perturbations to those networks, and develop biologically meaningful hypotheses. Unfortunately, sequence database searches for SLiMs generally yield mostly biologically irrelevant motif matches or false positives. To improve the prediction of novel SLiM interactions, researchers employ filters to discriminate between biologically relevant and improbable motif matches. One promising criterion for identifying biologically relevant SLiMs is the sequence conservation of the motif, exploiting the fact that functional motifs are more likely to be conserved than spurious motif matches. However, the difficulty of aligning disordered regions has significantly hampered the utility of this approach. We present PairK (pairwise k-mer alignment), an MSA-free method to quantify motif conservation in disordered regions. PairK outperforms both standard MSA-based conservation scores and a modern LLM-based conservation score predictor on the task of identifying biologically important motif instances. PairK can quantify conservation over wider phylogenetic distances than MSAs, indicating that SLiMs may be more conserved than is implied by MSA-based metrics. PairK is available as open-source code at https://github.com/jacksonh1/pairk.

摘要

蛋白质-蛋白质相互作用通常由一个模块化的肽识别结构域介导,该结构域与另一种蛋白质无序区域中的短线性基序(SLiM)结合。预测结构域-SLiM相互作用的能力将使研究人员能够绘制蛋白质相互作用网络,预测对这些网络的扰动影响,并提出具有生物学意义的假设。不幸的是,在序列数据库中搜索SLiM通常会产生大多与生物学无关的基序匹配或假阳性结果。为了改进对新型SLiM相互作用的预测,研究人员采用过滤器来区分生物学相关和不太可能的基序匹配。一种有前景的识别生物学相关SLiM的标准是基序的序列保守性,利用功能基序比虚假基序匹配更可能保守这一事实。然而,比对无序区域的困难严重阻碍了这种方法的实用性。我们提出了PairK(成对k-mer比对),一种无需多序列比对(MSA)的方法来量化无序区域中的基序保守性。在识别生物学上重要的基序实例任务中,PairK的表现优于基于标准MSA的保守性评分和基于现代语言模型的保守性评分预测器。与MSA相比,PairK可以在更广泛的系统发育距离上量化保守性,这表明SLiM可能比基于MSA的指标所暗示的更保守。PairK可在https://github.com/jacksonh1/pairk上作为开源代码获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e24/11291154/5219bbdcd4d1/nihpp-2024.07.23.604860v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e24/11291154/61b9b657ee22/nihpp-2024.07.23.604860v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e24/11291154/1d5fc731ef86/nihpp-2024.07.23.604860v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e24/11291154/071e7ffee16a/nihpp-2024.07.23.604860v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e24/11291154/ff99c5c46e8a/nihpp-2024.07.23.604860v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e24/11291154/5219bbdcd4d1/nihpp-2024.07.23.604860v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e24/11291154/61b9b657ee22/nihpp-2024.07.23.604860v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e24/11291154/1d5fc731ef86/nihpp-2024.07.23.604860v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e24/11291154/071e7ffee16a/nihpp-2024.07.23.604860v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e24/11291154/ff99c5c46e8a/nihpp-2024.07.23.604860v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e24/11291154/5219bbdcd4d1/nihpp-2024.07.23.604860v1-f0005.jpg

相似文献

1
PairK: Pairwise k-mer alignment for quantifying protein motif conservation in disordered regions.PairK:用于量化无序区域中蛋白质基序保守性的成对k-mer比对
bioRxiv. 2024 Jul 24:2024.07.23.604860. doi: 10.1101/2024.07.23.604860.
2
PairK: Pairwise k-mer alignment for quantifying protein motif conservation in disordered regions.PairK:用于量化无序区域中蛋白质基序保守性的成对k-mer比对
Protein Sci. 2025 Jan;34(1):e70004. doi: 10.1002/pro.70004.
3
SLiMSearch: a framework for proteome-wide discovery and annotation of functional modules in intrinsically disordered regions.SLiMSearch:一种用于在无序区域中进行蛋白质组功能模块的全局发现和注释的框架。
Nucleic Acids Res. 2017 Jul 3;45(W1):W464-W469. doi: 10.1093/nar/gkx238.
4
Bioinformatics Approaches for Predicting Disordered Protein Motifs.预测无序蛋白质基序的生物信息学方法
Adv Exp Med Biol. 2015;870:291-318. doi: 10.1007/978-3-319-20164-1_9.
5
Computational Prediction of Disordered Protein Motifs Using SLiMSuite.使用 SLiMSuite 进行无规则蛋白基序的计算预测。
Methods Mol Biol. 2020;2141:37-72. doi: 10.1007/978-1-0716-0524-0_3.
6
Attributes of short linear motifs.短线性基序的属性。
Mol Biosyst. 2012 Jan;8(1):268-81. doi: 10.1039/c1mb05231d. Epub 2011 Sep 12.
7
Computational prediction of short linear motifs from protein sequences.从蛋白质序列中对短线性基序进行计算预测。
Methods Mol Biol. 2015;1268:89-141. doi: 10.1007/978-1-4939-2285-7_6.
8
SLiMAn: An Integrative Web Server for Exploring Short Linear Motif-Mediated Interactions in Interactomes.SLiMAn:一个用于探索互作组中短线性基序介导相互作用的综合网络服务器。
J Proteome Res. 2022 Jul 1;21(7):1654-1663. doi: 10.1021/acs.jproteome.1c00964. Epub 2022 Jun 1.
9
SLiM-Enrich: computational assessment of protein-protein interaction data as a source of domain-motif interactions.SLiM-Enrich:将蛋白质-蛋白质相互作用数据作为结构域-基序相互作用来源的计算评估
PeerJ. 2018 Oct 31;6:e5858. doi: 10.7717/peerj.5858. eCollection 2018.
10
Discovery of short linear motif-mediated interactions through phage display of intrinsically disordered regions of the human proteome.通过人蛋白质组内在无序区域的噬菌体展示发现短线性基序介导的相互作用。
FEBS J. 2017 Feb;284(3):485-498. doi: 10.1111/febs.13995. Epub 2017 Jan 18.

本文引用的文献

1
SHARK enables sensitive detection of evolutionary homologs and functional analogs in unalignable and disordered sequences.SHARK 能够在不可比对和无序序列中灵敏地检测进化同源物和功能类似物。
Proc Natl Acad Sci U S A. 2024 Oct 15;121(42):e2401622121. doi: 10.1073/pnas.2401622121. Epub 2024 Oct 9.
2
ELM-the Eukaryotic Linear Motif resource-2024 update.ELM-the Eukaryotic Linear Motif resource-2024 update. ELM-真核线性基序资源-2024 更新。
Nucleic Acids Res. 2024 Jan 5;52(D1):D442-D455. doi: 10.1093/nar/gkad1058.
3
Evolutionary-scale prediction of atomic-level protein structure with a language model.
用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
4
Alignment-free estimation of sequence conservation for identifying functional sites using protein sequence embeddings.基于蛋白质序列嵌入的无比对序列保守性估计用于识别功能位点。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac599.
5
Muscle5: High-accuracy alignment ensembles enable unbiased assessments of sequence homology and phylogeny.肌肉 5:高精度比对集合可实现序列同源性和系统发育的无偏评估。
Nat Commun. 2022 Nov 15;13(1):6968. doi: 10.1038/s41467-022-34630-w.
6
OrthoDB v11: annotation of orthologs in the widest sampling of organismal diversity.OrthoDB v11:在最广泛的生物多样性样本中注释直系同源物。
Nucleic Acids Res. 2023 Jan 6;51(D1):D445-D451. doi: 10.1093/nar/gkac998.
7
Molecular determinants of TRAF6 binding specificity suggest that native interaction partners are not optimized for affinity.TRAF6 结合特异性的分子决定因素表明,天然相互作用伙伴并非针对亲和力进行了优化。
Protein Sci. 2022 Nov;31(11):e4429. doi: 10.1002/pro.4429.
8
Evolution of short linear motifs and disordered proteins Topic: yeast as model system to study evolution.短线性基序和无规则蛋白的进化 主题:以酵母为模型系统研究进化。
Curr Opin Genet Dev. 2022 Oct;76:101964. doi: 10.1016/j.gde.2022.101964. Epub 2022 Aug 5.
9
Discovering molecular features of intrinsically disordered regions by using evolution for contrastive learning.利用进化进行对比学习来发现无序区域的分子特征。
PLoS Comput Biol. 2022 Jun 29;18(6):e1010238. doi: 10.1371/journal.pcbi.1010238. eCollection 2022 Jun.
10
Native proline-rich motifs exploit sequence context to target actin-remodeling Ena/VASP protein ENAH.天然脯氨酸丰富基序利用序列上下文靶向肌动蛋白重塑 Ena/VASP 蛋白 ENAH。
Elife. 2022 Jan 25;11:e70680. doi: 10.7554/eLife.70680.