Ma Yuling, Yan Chenchao, Li Huimin, Wu Wentao, Liu Yaxue, Wang Yuqian, Chen Qin, Ma Haoli
State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F UniversityYangling, China; National Base for the Talents on Life-Science and Technology, Innovation Experimental College, Northwest A&F UniversityYangling, China.
State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University Yangling, China.
Front Plant Sci. 2017 Jan 26;8:66. doi: 10.3389/fpls.2017.00066. eCollection 2017.
Arabinogalactan proteins (AGPs) are a family of extracellular glycoproteins implicated in plant growth and development. With a rapid growth in the number of genomes sequenced in many plant species, the family members of AGPs can now be predicted to facilitate functional investigation. Building upon previous advances in identifying AGPs, an integrated strategy of systematical AGP screening for "classical" and "chimeric" family members is proposed in this study. A Python script named Finding-AGP is compiled to find AGP-like sequences and filter AGP candidates under the given thresholds. The primary screening of classical AGPs, Lys-rich classical AGPs, AGP-extensin hybrids, and non-classical AGPs was performed using the existence of signal peptides as a necessary requirement, and BLAST searches were conducted mainly for fasciclin-like, phytocyanin-like and xylogen-like AGPs. Then glycomodule index and partial PAST (Pro, Ala, Ser, and Thr) percentage are adopted to identify AGP candidates. The integrated strategy successfully discovered AGP gene families in 47 plant species and the main results are summarized as follows: (i) AGPs are abundant in angiosperms and many "ancient" AGPs with Ser-Pro repeats are found in ; (ii) Classical AGPs, AG-peptides, and Lys-rich classical AGPs first emerged in , and , respectively; (iii) Nine subfamilies of chimeric AGPs are introduced as newly identified chimeric subfamilies similar to fasciclin-like, phytocyanin-like, and xylogen-like AGPs; (iv) The length and amino acid composition of Lys-rich domains are largely variable, indicating an insertion/deletion model during evolution. Our findings provide not only a powerful means to identify AGP gene families but also probable explanations of AGPs in maintaining the plant cell wall and transducing extracellular signals into the cytoplasm.
阿拉伯半乳聚糖蛋白(AGPs)是一类参与植物生长发育的细胞外糖蛋白。随着许多植物物种测序基因组数量的快速增长,现在可以预测AGPs的家族成员以促进功能研究。基于先前在鉴定AGPs方面的进展,本研究提出了一种针对“经典”和“嵌合”家族成员进行系统AGP筛选的综合策略。编写了一个名为Finding-AGP的Python脚本,以查找AGP样序列并在给定阈值下筛选AGP候选物。使用信号肽的存在作为必要条件对经典AGPs、富含赖氨酸的经典AGPs、AGP-伸展蛋白杂种和非经典AGPs进行初步筛选,并主要针对类成束蛋白样、植物蓝蛋白样和木葡聚糖样AGPs进行BLAST搜索。然后采用糖模块指数和部分PAST(脯氨酸、丙氨酸、丝氨酸和苏氨酸)百分比来鉴定AGP候选物。该综合策略成功地在47种植物物种中发现了AGP基因家族,主要结果总结如下:(i)AGPs在被子植物中丰富,并且在[具体植物类群]中发现了许多具有丝氨酸-脯氨酸重复序列的“古老”AGPs;(ii)经典AGPs、AG肽和富含赖氨酸的经典AGPs分别首先出现在[具体植物类群1]、[具体植物类群2]和[具体植物类群3]中;(iii)引入了九个嵌合AGP亚家族作为新鉴定的类似于类成束蛋白样、植物蓝蛋白样和木葡聚糖样AGPs的嵌合亚家族;(iv)富含赖氨酸结构域的长度和氨基酸组成在很大程度上是可变的,表明在进化过程中存在插入/缺失模型。我们的研究结果不仅提供了一种识别AGP基因家族的有力手段,还为AGPs在维持植物细胞壁和将细胞外信号转导到细胞质中的作用提供了可能的解释。