• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

鉴定生物序列中基于分类的判别基序。

Identifying discriminative classification-based motifs in biological sequences.

机构信息

Katholieke Universiteit Leuven, Department of Computer Science, Celestijnenlaan 200A, Leuven, Belgium.

出版信息

Bioinformatics. 2011 May 1;27(9):1231-8. doi: 10.1093/bioinformatics/btr110. Epub 2011 Mar 3.

DOI:10.1093/bioinformatics/btr110
PMID:21372086
Abstract

MOTIVATION

Identification of conserved motifs in biological sequences is crucial to unveil common shared functions. Many tools exist for motif identification, including some that allow degenerate positions with multiple possible nucleotides or amino acids. Most efficient methods available today search conserved motifs in a set of sequences, but do not check for their specificity regarding to a set of negative sequences.

RESULTS

We present a tool to identify degenerate motifs, based on a given classification of amino acids according to their physico-chemical properties. It returns the top K motifs that are most frequent in a positive set of sequences involved in a biological process of interest, and absent from a negative set. Thus, our method discovers discriminative motifs in biological sequences that may be used to identify new sequences involved in the same process. We used this tool to identify candidate effector proteins secreted into plant tissues by the root knot nematode Meloidogyne incognita. Our tool identified a series of motifs specifically present in a positive set of known effectors while totally absent from a negative set of evolutionarily conserved housekeeping proteins. Scanning the proteome of M. incognita, we detected 2579 proteins that contain these specific motifs and can be considered as new putative effectors.

AVAILABILITY AND IMPLEMENTATION

The motif discovery tool and the proteins used in the experiments are available at http://dtai.cs.kuleuven.be/ml/systems/merci.

摘要

动机

在生物序列中识别保守基序对于揭示共同的共享功能至关重要。许多工具可用于基序识别,包括允许具有多个可能核苷酸或氨基酸的简并位置的工具。当今最有效的方法是在一组序列中搜索保守基序,但不检查它们针对一组负序列的特异性。

结果

我们提出了一种基于给定的氨基酸理化性质分类来识别简并基序的工具。它返回在与感兴趣的生物过程相关的阳性序列集中最频繁出现且在阴性序列集中不存在的前 K 个最常见的基序。因此,我们的方法可以在生物序列中发现具有区分度的基序,这些基序可用于识别参与同一过程的新序列。我们使用此工具来鉴定根结线虫 Meloidogyne incognita 分泌到植物组织中的候选效应蛋白。我们的工具鉴定了一系列特定存在于阳性已知效应物集中而完全不存在于进化保守管家蛋白阴性集中的基序。扫描 M. incognita 的蛋白质组,我们检测到 2579 种含有这些特定基序的蛋白质,可被视为新的潜在效应物。

可用性和实现

http://dtai.cs.kuleuven.be/ml/systems/merci 上提供了基序发现工具和实验中使用的蛋白质。

相似文献

1
Identifying discriminative classification-based motifs in biological sequences.鉴定生物序列中基于分类的判别基序。
Bioinformatics. 2011 May 1;27(9):1231-8. doi: 10.1093/bioinformatics/btr110. Epub 2011 Mar 3.
2
Members of the Meloidogyne avirulence protein family contain multiple plant ligand-like motifs.拟滑刃线虫无毒蛋白家族成员包含多个植物配体样基序。
Phytopathology. 2014 Aug;104(8):879-85. doi: 10.1094/PHYTO-11-13-0326-R.
3
A generic motif discovery algorithm for sequential data.一种用于序列数据的通用基序发现算法。
Bioinformatics. 2006 Jan 1;22(1):21-8. doi: 10.1093/bioinformatics/bti745. Epub 2005 Oct 27.
4
Discriminative motif discovery in DNA and protein sequences using the DEME algorithm.使用DEME算法在DNA和蛋白质序列中发现鉴别性基序。
BMC Bioinformatics. 2007 Oct 15;8:385. doi: 10.1186/1471-2105-8-385.
5
Identification and characterization of specific motifs in effector proteins of plant parasites using MOnSTER.使用 MOnSTER 鉴定和描述植物寄生效应子蛋白中的特异性模体。
Commun Biol. 2024 Jul 12;7(1):850. doi: 10.1038/s42003-024-06515-9.
6
Localized motif discovery in gene regulatory sequences.基因调控序列中的局部模体发现。
Bioinformatics. 2010 May 1;26(9):1152-9. doi: 10.1093/bioinformatics/btq106. Epub 2010 Mar 11.
7
Galaxy as a platform for identifying candidate pathogen effectors.银河作为鉴定候选病原体效应子的平台。
Methods Mol Biol. 2014;1127:3-15. doi: 10.1007/978-1-62703-986-4_1.
8
Fast and accurate discovery of degenerate linear motifs in protein sequences.在蛋白质序列中快速准确地发现简并线性基序
PLoS One. 2014 Sep 10;9(9):e106081. doi: 10.1371/journal.pone.0106081. eCollection 2014.
9
Fast model-based protein homology detection without alignment.基于快速模型的无需比对的蛋白质同源性检测。
Bioinformatics. 2007 Jul 15;23(14):1728-36. doi: 10.1093/bioinformatics/btm247. Epub 2007 May 8.
10
Multiple Motif Scanning to identify methyltransferases from the yeast proteome.通过多重基序扫描从酵母蛋白质组中鉴定甲基转移酶。
Mol Cell Proteomics. 2009 Jul;8(7):1516-26. doi: 10.1074/mcp.M900025-MCP200. Epub 2009 Apr 7.

引用本文的文献

1
Prediction of exosomal miRNA-based biomarkers for liquid biopsy.基于外泌体微小RNA的液体活检生物标志物预测
Sci Rep. 2025 Aug 25;15(1):31191. doi: 10.1038/s41598-025-15814-y.
2
EnsembleNPPred: A Robust Approach to Neuropeptide Prediction and Recognition Using Ensemble Machine Learning and Deep Learning Methods.集成神经肽预测:一种使用集成机器学习和深度学习方法进行神经肽预测与识别的稳健方法。
Life (Basel). 2025 Jun 25;15(7):1010. doi: 10.3390/life15071010.
3
In Silico tool for predicting, designing and scanning IL-2 inducing peptides.用于预测、设计和筛选白细胞介素-2诱导肽的计算机模拟工具。
Sci Rep. 2025 Jul 16;15(1):25692. doi: 10.1038/s41598-025-08388-2.
4
iALP: Identification of Allergenic Proteins Based on Large Language Model and Gate Linear Unit.iALP:基于大语言模型和门控线性单元的变应原蛋白鉴定
Interdiscip Sci. 2025 Jul 13. doi: 10.1007/s12539-025-00734-2.
5
High Throughput Meta-analysis of Antimicrobial Peptides for Characterizing Class Specific Therapeutic Candidates: An In Silico Approach.用于鉴定特定类别治疗候选物的抗菌肽高通量荟萃分析:一种计算机模拟方法。
Probiotics Antimicrob Proteins. 2025 May 27. doi: 10.1007/s12602-025-10596-1.
6
Prediction of hemolytic peptides and their hemolytic concentration.溶血肽及其溶血浓度的预测。
Commun Biol. 2025 Feb 4;8(1):176. doi: 10.1038/s42003-025-07615-w.
7
Single-wall carbon nanotube separations aqueous two-phase extraction: new prospects enabled by high-throughput methods.单壁碳纳米管分离 双水相萃取:高通量方法带来的新前景
Chem Commun (Camb). 2025 Jan 28;61(10):2026-2039. doi: 10.1039/d4cc06096b.
8
A hybrid method for discovering interferon-gamma inducing peptides in human and mouse.一种用于在人和鼠中发现干扰素-γ诱导肽的混合方法。
Sci Rep. 2024 Nov 6;14(1):26859. doi: 10.1038/s41598-024-77957-8.
9
PLMACPred prediction of anticancer peptides based on protein language model and wavelet denoising transformation.基于蛋白质语言模型和小波去噪变换的抗癌肽 PLMACPred 预测。
Sci Rep. 2024 Jul 23;14(1):16992. doi: 10.1038/s41598-024-67433-8.
10
Identification and characterization of specific motifs in effector proteins of plant parasites using MOnSTER.使用 MOnSTER 鉴定和描述植物寄生效应子蛋白中的特异性模体。
Commun Biol. 2024 Jul 12;7(1):850. doi: 10.1038/s42003-024-06515-9.