• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PatMatch:一个用于在肽和核苷酸序列中寻找模式的程序。

PatMatch: a program for finding patterns in peptide and nucleotide sequences.

作者信息

Yan Thomas, Yoo Danny, Berardini Tanya Z, Mueller Lukas A, Weems Dan C, Weng Shuai, Cherry J Michael, Rhee Seung Y

机构信息

Department of Plant Biology, Carnegie Institution of Washington, 260 Panama Street, Stanford, CA 94305, USA.

出版信息

Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W262-6. doi: 10.1093/nar/gki368.

DOI:10.1093/nar/gki368
PMID:15980466
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1160129/
Abstract

Here, we present PatMatch, an efficient, web-based pattern-matching program that enables searches for short nucleotide or peptide sequences such as cis-elements in nucleotide sequences or small domains and motifs in protein sequences. The program can be used to find matches to a user-specified sequence pattern that can be described using ambiguous sequence codes and a powerful and flexible pattern syntax based on regular expressions. A recent upgrade has improved performance and now supports both mismatches and wildcards in a single pattern. This enhancement has been achieved by replacing the previous searching algorithm, scan_for_matches [D'Souza et al. (1997), Trends in Genetics, 13, 497-498], with nondeterministic-reverse grep (NR-grep), a general pattern matching tool that allows for approximate string matching [Navarro (2001), Software Practice and Experience, 31, 1265-1312]. We have tailored NR-grep to be used for DNA and protein searches with PatMatch. The stand-alone version of the software can be adapted for use with any sequence dataset and is available for download at The Arabidopsis Information Resource (TAIR) at ftp://ftp.arabidopsis.org/home/tair/Software/Patmatch/. The PatMatch server is available on the web at http://www.arabidopsis.org/cgi-bin/patmatch/nph-patmatch.pl for searching Arabidopsis thaliana sequences.

摘要

在此,我们展示了PatMatch,这是一个基于网络的高效模式匹配程序,可用于搜索短核苷酸或肽序列,如核苷酸序列中的顺式元件或蛋白质序列中的小结构域和基序。该程序可用于查找与用户指定的序列模式的匹配项,该模式可使用模糊序列代码以及基于正则表达式的强大且灵活的模式语法来描述。最近的一次升级提高了性能,现在在单个模式中同时支持错配和通配符。这一改进是通过用非确定性反向grep(NR-grep)替换先前的搜索算法scan_for_matches [D'Souza等人(1997年),《遗传学趋势》,13,497 - 498]实现的,NR-grep是一种允许近似字符串匹配的通用模式匹配工具[Navarro(2001年),《软件实践与经验》,31,1265 - 1312]。我们对NR-grep进行了定制,使其可用于PatMatch的DNA和蛋白质搜索。该软件的独立版本可适用于任何序列数据集,可从拟南芥信息资源库(TAIR)通过ftp://ftp.arabidopsis.org/home/tair/Software/Patmatch/下载。PatMatch服务器可通过网络访问http://www.arabidopsis.org/cgi-bin/patmatch/nph-patmatch.pl,用于搜索拟南芥序列。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ae7/1160129/060e886c5a85/gki368f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ae7/1160129/060e886c5a85/gki368f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ae7/1160129/060e886c5a85/gki368f1.jpg

相似文献

1
PatMatch: a program for finding patterns in peptide and nucleotide sequences.PatMatch:一个用于在肽和核苷酸序列中寻找模式的程序。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W262-6. doi: 10.1093/nar/gki368.
2
Gene structure prediction from consensus spliced alignment of multiple ESTs matching the same genomic locus.基于与同一基因组位点匹配的多个EST的一致性剪接比对进行基因结构预测。
Bioinformatics. 2004 May 1;20(7):1157-69. doi: 10.1093/bioinformatics/bth058. Epub 2004 Feb 5.
3
MEME: discovering and analyzing DNA and protein sequence motifs.MEME:发现和分析DNA与蛋白质序列基序
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W369-73. doi: 10.1093/nar/gkl198.
4
GeneSeqer@PlantGDB: Gene structure prediction in plant genomes.GeneSeqer@PlantGDB:植物基因组中的基因结构预测。
Nucleic Acids Res. 2003 Jul 1;31(13):3597-600. doi: 10.1093/nar/gkg533.
5
Patscanui: an intuitive web interface for searching patterns in DNA and protein data.Patscanui:一个直观的网页界面,用于搜索 DNA 和蛋白质数据中的模式。
Nucleic Acids Res. 2018 Jul 2;46(W1):W205-W208. doi: 10.1093/nar/gky321.
6
The 3of5 web application for complex and comprehensive pattern matching in protein sequences.用于蛋白质序列中复杂全面模式匹配的3of5网络应用程序。
BMC Bioinformatics. 2006 Mar 16;7:144. doi: 10.1186/1471-2105-7-144.
7
Using the Arabidopsis Information Resource (TAIR) to find information about Arabidopsis genes.利用拟南芥信息资源库(TAIR)查找有关拟南芥基因的信息。
Curr Protoc Bioinformatics. 2005 Apr;Chapter 1:Unit 1.11. doi: 10.1002/0471250953.bi0111s9.
8
RNAPattMatch: a web server for RNA sequence/structure motif detection based on pattern matching with flexible gaps.RNAPattMatch:一个基于带灵活间隔的模式匹配来检测RNA序列/结构基序的网络服务器。
Nucleic Acids Res. 2015 Jul 1;43(W1):W507-12. doi: 10.1093/nar/gkv435. Epub 2015 May 4.
9
ANTHEPROT: an integrated protein sequence analysis software with client/server capabilities.ANTHEPROT:一款具有客户端/服务器功能的集成蛋白质序列分析软件。
Comput Biol Med. 2001 Jul;31(4):259-67. doi: 10.1016/s0010-4825(01)00008-7.
10
FAN: fingerprint analysis of nucleotide sequences.FAN:核苷酸序列的指纹分析。
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W620-3. doi: 10.1093/nar/gkh457.

引用本文的文献

1
Novel Leech Antimicrobial Peptides, Hirunipins: Real-Time 3D Monitoring of Antimicrobial and Antibiofilm Mechanisms Using Optical Diffraction Tomography.新型水蛭抗菌肽,hirunipins:使用光学衍射断层扫描技术对抗菌和抗生物膜机制进行实时三维监测。
Adv Sci (Weinh). 2025 Mar;12(10):e2409803. doi: 10.1002/advs.202409803. Epub 2025 Jan 10.
2
Genome-wide conditional degron libraries for functional genomics.用于功能基因组学的全基因组条件性降解子文库。
J Cell Biol. 2025 Feb 3;224(2). doi: 10.1083/jcb.202409007. Epub 2024 Dec 18.
3
Dehydrin Client Proteins Identified Using Phage Display Affinity Selected Libraries Processed With Paired-End Phage Sequencing.

本文引用的文献

1
PatSearch: A program for the detection of patterns and structural motifs in nucleotide sequences.PatSearch:一种用于检测核苷酸序列中模式和结构基序的程序。
Nucleic Acids Res. 2003 Jul 1;31(13):3608-12. doi: 10.1093/nar/gkg548.
2
The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and community.拟南芥信息资源库(TAIR):一个模式生物数据库,为拟南芥生物学、研究材料及相关群体提供集中的、经过整理的信息通道。
Nucleic Acids Res. 2003 Jan 1;31(1):224-8. doi: 10.1093/nar/gkg076.
3
tacg--a grep for DNA.
使用经过双端噬菌体测序处理的噬菌体展示亲和选择文库鉴定脱水素客户蛋白。
Mol Cell Proteomics. 2024 Dec;23(12):100867. doi: 10.1016/j.mcpro.2024.100867. Epub 2024 Oct 21.
4
Overexpression of transcription factor accelerates vegetative development in .转录因子的过表达加速了……中的营养生长。 (注:原文中“in”后面缺少具体内容)
Front Plant Sci. 2024 Mar 20;15:1327152. doi: 10.3389/fpls.2024.1327152. eCollection 2024.
5
CyanoCyc cyanobacterial web portal.蓝藻网络门户(蓝藻循环)
Front Microbiol. 2024 Jan 31;15:1340413. doi: 10.3389/fmicb.2024.1340413. eCollection 2024.
6
Genome-wide identification and expression analysis of the Pisum sativum (L.) APETALA2/ethylene-responsive factor (AP2/ERF) gene family reveals functions in drought and cold stresses.豌豆(Pisum sativum (L.))APETALA2/乙烯响应因子(AP2/ERF)基因家族的全基因组鉴定与表达分析揭示其在干旱和低温胁迫中的功能
Genetica. 2023 Jun;151(3):225-239. doi: 10.1007/s10709-023-00190-0. Epub 2023 Jun 3.
7
Whole-Transcriptome Sequencing Reveals a ceRNA Regulatory Network Associated with the Process of Periodic Albinism under Low Temperature in Baiye No. 1 ().全转录组测序揭示了低温下白叶 1 号周期性白化过程相关的 ceRNA 调控网络。
Int J Mol Sci. 2023 Apr 12;24(8):7162. doi: 10.3390/ijms24087162.
8
Computational Identification of Functional Centers in Complex Proteins: A Step-by-Step Guide With Examples.复杂蛋白质中功能中心的计算识别:带示例的分步指南
Front Bioinform. 2021 Mar 25;1:652286. doi: 10.3389/fbinf.2021.652286. eCollection 2021.
9
Genomic distribution and context dependent functionality of novel WRKY transcription factor binding sites.新型 WRKY 转录因子结合位点的基因组分布和上下文相关功能。
BMC Genomics. 2022 Sep 27;23(1):673. doi: 10.1186/s12864-022-08877-y.
10
Identification and characterization of miRNAs and PHAS loci related to the early development of the embryo and endosperm in Fragaria × ananassa.鉴定和描述与草莓胚胎和胚乳早期发育相关的 miRNA 和 PHAS 基因座。
BMC Genomics. 2022 Sep 8;23(1):638. doi: 10.1186/s12864-022-08864-3.
TACG——一种用于DNA的搜索工具。
BMC Bioinformatics. 2002;3:8. doi: 10.1186/1471-2105-3-8. Epub 2002 Mar 6.
4
The EMOTIF database.EMOTIF数据库。
Nucleic Acids Res. 2001 Jan 1;29(1):202-4. doi: 10.1093/nar/29.1.202.
5
The Arabidopsis Information Resource (TAIR): a comprehensive database and web-based information retrieval, analysis, and visualization system for a model plant.拟南芥信息资源库(TAIR):一个针对模式植物的综合数据库以及基于网络的信息检索、分析和可视化系统。
Nucleic Acids Res. 2001 Jan 1;29(1):102-5. doi: 10.1093/nar/29.1.102.
6
PatSearch: a pattern matcher software that finds functional elements in nucleotide and protein sequences and assesses their statistical significance.PatSearch:一种模式匹配软件,可在核苷酸和蛋白质序列中找到功能元件并评估其统计学意义。
Bioinformatics. 2000 May;16(5):439-50. doi: 10.1093/bioinformatics/16.5.439.
7
Searching for patterns in genomic data.在基因组数据中寻找模式。
Trends Genet. 1997 Dec;13(12):497-8. doi: 10.1016/s0168-9525(97)01347-4.
8
Genetic and physical maps of Saccharomyces cerevisiae.酿酒酵母的遗传图谱和物理图谱。
Nature. 1997 May 29;387(6632 Suppl):67-73.
9
Arabidopsis thaliana CBF1 encodes an AP2 domain-containing transcriptional activator that binds to the C-repeat/DRE, a cis-acting DNA regulatory element that stimulates transcription in response to low temperature and water deficit.拟南芥CBF1编码一种含AP2结构域的转录激活因子,该因子可与C重复序列/脱水应答元件(C-repeat/DRE)结合,C-repeat/DRE是一种顺式作用DNA调控元件,可响应低温和水分亏缺刺激转录。
Proc Natl Acad Sci U S A. 1997 Feb 4;94(3):1035-40. doi: 10.1073/pnas.94.3.1035.