Janairo Jose Isagani B
Department of Biology, De La Salle University 2401 Taft Avenue Manila 0922 Philippines
RSC Adv. 2023 Jul 12;13(31):21146-21152. doi: 10.1039/d3ra04269c.
Metal-binding peptides play a central role in bionanotechnology, wherein they are responsible for directing growth and influencing the resulting properties of inorganic nanomaterials. One of the key advantages of using peptides to create nanomaterials is their versatility, wherein subtle changes in the sequence can have a dramatic effect on the structure and properties of the nanomaterial. However, precisely knowing which position and which amino acid should be modified within a given sequence to enhance a specific property can be a daunting challenge owing to combinatorial complexity. In this study, classification based on association rules was performed using 860 gold-binding peptides. Using a minimum support threshold of 0.035 and confidence of 0.9, 30 rules with confidence and lift values greater than 0.9 and 1, respectively, were extracted that can differentiate high-binding from low-binding peptides. The test performance of these rules for categorizing the peptides was found to be satisfactory, as characterized by accuracy = 0.942, = 0.941, MCC = 0.884. What stands out from the extracted rules are the importance of tryptophan and arginine residues in differentiating peptides with high binding affinity from those with low affinity. In addition, the association rules revealed that positions 2 and 4 within a decapeptide are frequently involved in the rules, thus suggesting their importance in influencing peptide binding affinity to AuNPs. Collectively, this study identified sequence rules that may be used to design peptides with high binding affinity.
金属结合肽在生物纳米技术中发挥着核心作用,在该技术领域中,它们负责引导生长并影响无机纳米材料的最终性质。使用肽来制备纳米材料的关键优势之一在于其多功能性,即序列中的细微变化可能会对纳米材料的结构和性质产生显著影响。然而,由于组合复杂性,精确确定在给定序列中应修饰哪个位置和哪个氨基酸以增强特定性质可能是一项艰巨的挑战。在本研究中,使用860种金结合肽进行了基于关联规则的分类。使用0.035的最小支持阈值和0.9的置信度,提取了30条规则,其置信度和提升值分别大于0.9和1,这些规则可以区分高结合肽和低结合肽。结果发现,这些规则对肽进行分类的测试性能令人满意,其特征为准确率 = 0.942,召回率 = 0.941,马修斯相关系数 = 0.884。从提取的规则中可以看出,色氨酸和精氨酸残基在区分高结合亲和力肽和低结合亲和力肽方面的重要性。此外,关联规则显示,十肽中的第2位和第4位经常出现在规则中,因此表明它们在影响肽与金纳米颗粒的结合亲和力方面的重要性。总的来说,本研究确定了可用于设计具有高结合亲和力肽的序列规则。