Department of Bioinformatics, Kish International Campus University of Tehran, Kish, Iran.
Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran.
Sci Rep. 2023 Sep 25;13(1):16029. doi: 10.1038/s41598-023-43042-9.
There are significant environmental and health concerns associated with the current inefficient plastic recycling process. This study presents the first integrated reference catalog of plastic-contaminated environments obtained using an insilico workflow that could play a significant role in discovering new plastizymes. Here, we combined 66 whole metagenomic data from plastic-contaminated environment samples from four previously collected metagenome data with our new sample. In this study, an integrated plastic-contaminated environment gene, protein, taxa, and plastic degrading enzyme catalog (PDEC) was constructed. These catalogs contain 53,300,583 non-redundant genes and proteins, 691 metagenome-assembled genomes, and 136,654 plastizymes. Based on KEGG and eggNOG annotations, 42% of recognized genes lack annotations, indicating their functions remain elusive and warrant further investigation. Additionally, the PDEC catalog highlights hydrolases, peroxidases, and cutinases as the prevailing plastizymes. Ultimately, following multiple validation procedures, our effort focused on pinpointing enzymes that exhibited the highest similarity to the introduced plastizymes in terms of both sequence and three-dimensional structural aspects. This encompassed evaluating the linear composition of constituent units as well as the complex spatial conformation of the molecule. The resulting catalog is expected to improve the resolution of future multi-omics studies, providing new insights into plastic-pollution related research.
目前低效的塑料回收过程存在重大的环境和健康问题。本研究提出了第一个使用计算工作流程获得的受塑料污染环境的综合参考目录,这可能在发现新的塑料酶方面发挥重要作用。在这里,我们将来自四个先前收集的宏基因组数据的 66 个受塑料污染环境样本的全宏基因组数据与我们的新样本结合起来。在这项研究中,构建了一个综合的受塑料污染环境的基因、蛋白质、分类群和塑料降解酶目录(PDEC)。这些目录包含 53300583 个非冗余基因和蛋白质、691 个宏基因组组装基因组和 136654 个塑料酶。根据 KEGG 和 eggNOG 注释,42%的已识别基因缺乏注释,这表明它们的功能仍然难以捉摸,需要进一步研究。此外,PDEC 目录突出了水解酶、过氧化物酶和角质酶作为主要的塑料酶。最终,经过多次验证程序,我们的工作重点是确定在序列和三维结构方面与引入的塑料酶最相似的酶。这包括评估组成单位的线性组成以及分子的复杂空间构象。预计该目录将提高未来多组学研究的分辨率,为与塑料污染相关的研究提供新的见解。