拟南芥未注释分泌肽数据库，植物肽组学资源。

The Arabidopsis unannotated secreted peptide database, a resource for plant peptidomics.

作者信息

Lease Kevin A, Walker John C

机构信息

Division of Biological Sciences, University of Missouri, Columbia, Missouri 65211, USA.

出版信息

Plant Physiol. 2006 Nov;142(3):831-8. doi: 10.1104/pp.106.086041. Epub 2006 Sep 22.

DOI:10.1104/pp.106.086041

PMID:16998087

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1630735/

Abstract

In the era of genomics, if a gene is not annotated, it is not investigated. Due to their small size, genes encoding peptides are often missed in genome annotations. Secreted peptides are important regulators of plant growth, development, and physiology. Identification of additional peptide signals by sequence homology searches has had limited success due to sequence heterogeneity. A bioinformatics approach was taken to find unannotated Arabidopsis (Arabidopsis thaliana) peptides. Arabidopsis chromosome sequences were searched for all open reading frames (ORFs) encoding peptides and small proteins between 25 and 250 amino acids in length. The translated ORFs were then sequentially queried for the presence of an amino-terminal cleavable signal peptide, the absence of transmembrane domains, and the absence of endoplasmic reticulum lumenal retention sequences. Next, the ORFs were filtered against the The Arabidopsis Information Resource 6.0 annotated Arabidopsis genes to remove those ORFs overlapping known genes. The remaining 33,809 ORFs were placed in a relational database to which additional annotation data were deposited. Genome-wide tiling array data were compared with the coordinates of the ORFs, supporting the possibility that many of the ORFs may be expressed. In addition, clustering and sequence similarity analyses revealed that many of the putative peptides are in gene families and/or appear to be present in the rice (Oryza sativa) genome. A subset of the ORFs was evaluated by reverse transcription-PCR and, for one-fifth of those, expression was detected. These results support the idea that the number and diversity of plant peptides is broader than currently assumed. The peptides identified and their annotation data may be viewed or downloaded through a searchable Web interface at peptidome.missouri.edu.

摘要

在基因组学时代，如果一个基因没有注释，就不会对其进行研究。由于其尺寸小，编码肽的基因在基因组注释中常常被遗漏。分泌肽是植物生长、发育和生理的重要调节因子。由于序列异质性，通过序列同源性搜索鉴定额外的肽信号成效有限。我们采用了一种生物信息学方法来寻找未注释的拟南芥（Arabidopsis thaliana）肽。在拟南芥染色体序列中搜索所有编码长度在25至250个氨基酸之间的肽和小蛋白的开放阅读框（ORF）。然后依次查询翻译后的ORF是否存在氨基末端可切割信号肽、是否不存在跨膜结构域以及是否不存在内质网腔滞留序列。接下来，将这些ORF与拟南芥信息资源6.0注释的拟南芥基因进行比对，以去除那些与已知基因重叠的ORF。其余33,809个ORF被放入一个关系数据库，并在其中存入了额外的注释数据。将全基因组平铺阵列数据与ORF的坐标进行比较，支持了许多ORF可能被表达的可能性。此外，聚类和序列相似性分析表明，许多推定的肽属于基因家族和/或似乎存在于水稻（Oryza sativa）基因组中。通过逆转录PCR对一部分ORF进行了评估，其中五分之一检测到了表达。这些结果支持了植物肽的数量和多样性比目前所认为的更为广泛这一观点。所鉴定的肽及其注释数据可通过peptidome.missouri.edu上的可搜索网络界面进行查看或下载。

相似文献

The Arabidopsis unannotated secreted peptide database, a resource for plant peptidomics.拟南芥未注释分泌肽数据库，植物肽组学资源。

Plant Physiol. 2006 Nov;142(3):831-8. doi: 10.1104/pp.106.086041. Epub 2006 Sep 22.

Bioinformatic identification of plant peptides.植物肽的生物信息学鉴定

Methods Mol Biol. 2010;615:375-83. doi: 10.1007/978-1-60761-535-4_26.

Plant-PrAS: a database of physicochemical and structural properties and novel functional regions in plant proteomes.植物PrAS：植物蛋白质组中物理化学和结构特性以及新功能区域的数据库。

Plant Cell Physiol. 2015 Jan;56(1):e11. doi: 10.1093/pcp/pcu176. Epub 2014 Nov 29.

Identification of Arabidopsis thaliana upstream open reading frames encoding peptide sequences that cause ribosomal arrest.鉴定编码导致核糖体停滞的肽序列的拟南芥上游开放阅读框。

Nucleic Acids Res. 2017 Sep 6;45(15):8844-8858. doi: 10.1093/nar/gkx528.

Peptomics, identification of novel cationic Arabidopsis peptides with conserved sequence motifs.肽组学，鉴定具有保守序列基序的新型拟南芥阳离子肽。

In Silico Biol. 2002;2(4):441-51.

Small cysteine-rich peptides resembling antimicrobial peptides have been under-predicted in plants.在植物中，类似于抗菌肽的富含半胱氨酸的小肽一直未得到充分预测。

Plant J. 2007 Jul;51(2):262-80. doi: 10.1111/j.1365-313X.2007.03136.x. Epub 2007 Jun 12.

Identification of novel Arabidopsis thaliana upstream open reading frames that control expression of the main coding sequences in a peptide sequence-dependent manner.鉴定以肽序列依赖性方式控制主要编码序列表达的新型拟南芥上游开放阅读框。

Nucleic Acids Res. 2015 Feb 18;43(3):1562-76. doi: 10.1093/nar/gkv018. Epub 2015 Jan 23.

Mining the genome of Arabidopsis thaliana as a basis for the identification of novel bioactive peptides involved in oxidative stress tolerance.以拟南芥基因组挖掘为基础，鉴定参与氧化胁迫耐受的新型生物活性肽。

J Exp Bot. 2013 Dec;64(17):5297-307. doi: 10.1093/jxb/ert295. Epub 2013 Sep 16.

Identification and analysis of Arabidopsis expressed sequence tags characteristic of non-coding RNAs.拟南芥非编码RNA特征性表达序列标签的鉴定与分析。

Plant Physiol. 2001 Nov;127(3):765-76.

Stress-induced changes in the Arabidopsis thaliana transcriptome analyzed using whole-genome tiling arrays.利用全基因组平铺阵列分析拟南芥转录组中应激诱导的变化。

Plant J. 2009 Jun;58(6):1068-82. doi: 10.1111/j.1365-313X.2009.03835.x. Epub 2009 Feb 13.

引用本文的文献

Identification of peptidome-based biomarkers of cassava mosaic disease resistance in different cassava varieties.不同木薯品种中基于肽组的木薯花叶病抗性生物标志物的鉴定

Sci Rep. 2025 Apr 12;15(1):12653. doi: 10.1038/s41598-025-97452-y.

Determining the Role of OsAGP6P in Anther Development Within the Arabinogalactan Peptide Family of Rice ().确定水稻阿拉伯半乳聚糖肽家族中OsAGP6P在花药发育中的作用（）。

Int J Mol Sci. 2025 Mar 14;26(6):2616. doi: 10.3390/ijms26062616.

Peptide hormones in plants.植物中的肽激素。

Mol Hortic. 2025 Jan 23;5(1):7. doi: 10.1186/s43897-024-00134-y.

Critical radicle length window governing loss of dehydration tolerance in germinated Perilla seeds: insights from physiological and transcriptomic analyses.临界胚根长度窗口控制萌发紫苏种子脱水耐性的丧失：来自生理和转录组分析的见解。

BMC Plant Biol. 2024 Nov 15;24(1):1078. doi: 10.1186/s12870-024-05801-2.

LncRNA-encoded peptides in cancer.lncRNA 编码肽在癌症中的作用。

J Hematol Oncol. 2024 Aug 12;17(1):66. doi: 10.1186/s13045-024-01591-0.

Phytosulfokine alpha enhances regeneration of transformed and untransformed protoplasts of .植物磺肽素α增强了（此处原文不完整，未明确具体对象）转化和未转化原生质体的再生能力。

Front Plant Sci. 2024 Mar 27;15:1379618. doi: 10.3389/fpls.2024.1379618. eCollection 2024.

Sulfated peptides and their receptors: Key regulators of plant development and stress adaptation.硫酸化肽及其受体：植物发育和应激适应的关键调节剂。

Plant Commun. 2024 Jun 10;5(6):100918. doi: 10.1016/j.xplc.2024.100918. Epub 2024 Apr 10.

Editorial: Neuropeptide actions in arthropod biology.社论：神经肽在节肢动物生物学中的作用

Front Endocrinol (Lausanne). 2024 Feb 28;15:1387176. doi: 10.3389/fendo.2024.1387176. eCollection 2024.

Rapid Identification of Peptide-Receptor-Coreceptor Complexes in Protoplasts.快速鉴定原生质体中的肽-受体-共受体复合物。

Methods Mol Biol. 2024;2731:241-251. doi: 10.1007/978-1-0716-3511-7_18.

Improved super-resolution ribosome profiling reveals prevalent translation of upstream ORFs and small ORFs in Arabidopsis.改进的核糖体超分辨图谱分析揭示了拟南芥中上游开放阅读框和小开放阅读框的普遍翻译。

Plant Cell. 2024 Feb 26;36(3):510-539. doi: 10.1093/plcell/koad290.

本文引用的文献

A plant peptide encoded by CLV3 identified by in situ MALDI-TOF MS analysis.通过原位基质辅助激光解吸电离飞行时间质谱分析鉴定出的由CLV3编码的一种植物肽。

Science. 2006 Aug 11;313(5788):845-8. doi: 10.1126/science.1128439.

Dodeca-CLE peptides as suppressors of plant stem cell differentiation.十二聚体CLE肽作为植物干细胞分化的抑制剂。

Science. 2006 Aug 11;313(5788):842-5. doi: 10.1126/science.1128436.

An endogenous peptide signal in Arabidopsis activates components of the innate immune response.拟南芥中的一种内源性肽信号激活了先天免疫反应的组成部分。

Proc Natl Acad Sci U S A. 2006 Jun 27;103(26):10098-103. doi: 10.1073/pnas.0603727103. Epub 2006 Jun 19.

The cell surface leucine-rich repeat receptor for AtPep1, an endogenous peptide elicitor in Arabidopsis, is functional in transgenic tobacco cells.拟南芥中的一种内源性肽激发子AtPep1的细胞表面富含亮氨酸重复序列受体在转基因烟草细胞中具有功能。

Proc Natl Acad Sci U S A. 2006 Jun 27;103(26):10104-9. doi: 10.1073/pnas.0603729103. Epub 2006 Jun 19.

Gain-of-function phenotypes of many CLAVATA3/ESR genes, including four new family members, correlate with tandem variations in the conserved CLAVATA3/ESR domain.许多CLAVATA3/ESR基因（包括四个新的家族成员）的功能获得性表型与保守的CLAVATA3/ESR结构域中的串联变异相关。

Plant Physiol. 2006 Apr;140(4):1331-44. doi: 10.1104/pp.105.075515. Epub 2006 Feb 17.

Features of Arabidopsis genes and genome discovered using full-length cDNAs.利用全长cDNA发现的拟南芥基因和基因组特征。

Plant Mol Biol. 2006 Jan;60(1):69-85. doi: 10.1007/s11103-005-2564-9.

Plant MPSS databases: signature-based transcriptional resources for analyses of mRNA and small RNA.植物MPSS数据库：用于mRNA和小RNA分析的基于特征的转录资源。

Nucleic Acids Res. 2006 Jan 1;34(Database issue):D731-5. doi: 10.1093/nar/gkj077.

Cell wall proteins: a new insight through proteomics.细胞壁蛋白：蛋白质组学带来的新见解

Trends Plant Sci. 2006 Jan;11(1):33-9. doi: 10.1016/j.tplants.2005.11.006. Epub 2005 Dec 13.

Pathogen elicitor-induced changes in the maize extracellular matrix proteome.病原体激发子诱导的玉米细胞外基质蛋白质组变化。

Proteomics. 2005 Dec;5(18):4894-904. doi: 10.1002/pmic.200500047.

A proteomic approach to apoplastic proteins involved in cell wall regeneration in protoplasts of Arabidopsis suspension-cultured cells.一种针对拟南芥悬浮培养细胞原生质体中参与细胞壁再生的质外体蛋白的蛋白质组学方法。

Plant Cell Physiol. 2005 Jun;46(6):843-57. doi: 10.1093/pcp/pci089. Epub 2005 Mar 15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。