ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley, WA, 6009, Australia.
Precision Medicine Theme, South Australian Health and Medical Research Institute, Adelaide, SA, 5000, Australia.
Plant J. 2020 Nov;104(3):812-827. doi: 10.1111/tpj.14961. Epub 2020 Sep 16.
Agriculture faces increasing demand for yield, higher plant-derived protein content and diversity while facing pressure to achieve sustainability. Although the genomes of many of the important crops have been sequenced, the subcellular locations of most of the encoded proteins remain unknown or are only predicted. Protein subcellular location is crucial in determining protein function and accumulation patterns in plants, and is critical for targeted improvements in yield and resilience. Integrating location data from over 800 studies for 12 major crop species into the cropPAL2020 data collection showed that while >80% of proteins in most species are not localised by experimental data, combining species data or integrating predictions can help bridge gaps at similar accuracy. The collation and integration of over 61 505 experimental localisations and more than 6 million predictions showed that the relative sizes of the protein catalogues located in different subcellular compartments are comparable between crops and Arabidopsis. A comprehensive cross-species comparison showed that between 50% and 80% of the subcellulomes are conserved across species and that conservation only depends to some degree on the phylogenetic relationship of the species. Protein subcellular locations in major biosynthesis pathways are more often conserved than in metabolic pathways. Underlying this conservation is a clear potential for subcellular diversity in protein location between species by means of gene duplication and alternative splicing. Our cropPAL data set and search platform (https://crop-pal.org) provide a comprehensive subcellular proteomics resource to drive compartmentation-based approaches for improving yield, protein composition and resilience in future crop varieties.
农业面临着提高产量、提高植物源性蛋白质含量和多样性的需求,同时还面临着实现可持续性的压力。尽管许多重要作物的基因组已经被测序,但大多数编码蛋白的亚细胞位置仍然未知或仅被预测。蛋白质的亚细胞定位对确定蛋白质在植物中的功能和积累模式至关重要,对有针对性地提高产量和恢复力也至关重要。将 12 种主要作物的 800 多项研究的定位数据整合到 cropPAL2020 数据集中,结果表明,尽管大多数物种中超过 80%的蛋白质没有通过实验数据定位,但整合物种数据或整合预测可以帮助以类似的准确性弥合差距。对超过 61505 个实验定位和超过 600 万个预测的整理和整合表明,不同亚细胞区室中定位的蛋白质目录的相对大小在作物和拟南芥之间是可比的。全面的跨物种比较表明,50%至 80%的亚细胞组在物种间是保守的,而且这种保守性仅在一定程度上取决于物种的系统发育关系。主要生物合成途径中的蛋白质亚细胞位置比代谢途径更常保守。这种保守性的背后是物种间蛋白质定位的亚细胞多样性的明显潜力,这是通过基因复制和选择性剪接实现的。我们的 cropPAL 数据集和搜索平台(https://crop-pal.org)提供了一个全面的亚细胞蛋白质组学资源,以推动基于区室化的方法,在未来的作物品种中提高产量、蛋白质组成和恢复力。