Uttarotai Toungporn, Mukjang Nilita, Chaisoung Natcha, Pathom-Aree Wasu, Pekkoh Jeeraporn, Pumas Chayakorn, Sattayawat Pachara
Department of Biology, Faculty of Science, Chiang Mai University, Chiang Mai 50200, Thailand.
Department of Entomology and Plant Pathology, Faculty of Agriculture, Chiang Mai University, Chiang Mai 50200, Thailand.
Biology (Basel). 2022 Aug 17;11(8):1226. doi: 10.3390/biology11081226.
Synthetic biology is a principle that aims to create new biological systems with particular functions or to redesign the existing ones through bioengineering. Therefore, this principle is often utilized as a tool to put the knowledge learned to practical use in actual fields. However, there is still a great deal of information remaining to be found, and this limits the possible utilization of synthetic biology, particularly on the topic that is the focus of the present work-heavy metal bio-removal. In this work, we aim to construct a comprehensive library of putative proteins that might support heavy metal bio-removal. Hypothetical proteins were discovered from and genomes and extensively annotated. The protein structures of these putative proteins were also modeled through Alphafold2. Although a portion of this workflow has previously been demonstrated to annotate hypothetical proteins from whole genome sequences, the adaptation of such steps is yet to be done for library construction purposes. We also demonstrated further downstream steps that allow a more accurate function prediction of the hypothetical proteins by subjecting the models generated to structure-based annotation. In conclusion, a total of 72 newly discovered putative proteins were annotated with ready-to-use predicted structures available for further investigation.
合成生物学是一种旨在通过生物工程创造具有特定功能的新生物系统或重新设计现有生物系统的原理。因此,这一原理常被用作一种工具,将所学知识应用于实际领域。然而,仍有大量信息有待发现,这限制了合成生物学的可能应用,特别是在当前工作的重点——重金属生物去除这一主题上。在这项工作中,我们旨在构建一个可能支持重金属生物去除的假定蛋白质综合文库。从[具体物种1]和[具体物种2]基因组中发现了假定蛋白质并进行了广泛注释。这些假定蛋白质的蛋白质结构也通过Alphafold2进行了建模。虽然此前已证明该工作流程的一部分可用于注释全基因组序列中的假定蛋白质,但尚未为文库构建目的对这些步骤进行调整。我们还展示了进一步的下游步骤,通过对生成的模型进行基于结构的注释,能够对假定蛋白质进行更准确的功能预测。总之,共有72个新发现的假定蛋白质被注释,并具有可供进一步研究的即用型预测结构。