Yang Yankun, Liu Guoqiang, Liu Meng, Bai Zhonghu, Liu Xiuxia, Dai Xiaofeng, Guo Wenwen
The Key Laboratory of Carbohydrate Chemistry and Biotechnology, School of Biotechnology, Jiangnan University, Ministry of Education, 1800 Lihu Avenue, 214122 Wuxi, PR China.
National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China.
Food Technol Biotechnol. 2018 Mar;56(1):101-109. doi: 10.17113/ftb.56.01.18.5445.
It is widely accepted that features such as pI, length, molecular mass and amino acid (AA) sequence have a significant influence on protein solubility. Here, we mainly focused on AA composition and explored those that most affected the soluble expression level of human serum albumin (HSA) domain antibody (dAb). The soluble expression and sequence of 65 dAb variants were analysed using clustering and linear modelling. Certain AAs significantly affected the soluble expression level of dAb, with the specific AA combinations being (S, R, N, D, Q), (G, R, C, N, S) and (R, S, G); these combinations respectively affected the dAb expression level in the broth supernatant, the level in the pellet lysate and total soluble dAb. Among the 20 AAs, R displayed a negative influence on the soluble expression level, whereas G and S showed positive effects. A linear model was built to predict the soluble expression level from the sequence; this model had a prediction accuracy of 80%. In summary, increasing the content of polar AAs, especially G and S, and decreasing the content of R, was helpful to improve the soluble expression level of HSA dAb.
人们普遍认为,诸如等电点、长度、分子量和氨基酸(AA)序列等特征对蛋白质溶解度有重大影响。在此,我们主要关注氨基酸组成,并探索那些对人血清白蛋白(HSA)结构域抗体(dAb)的可溶性表达水平影响最大的因素。使用聚类和线性建模分析了65种dAb变体的可溶性表达和序列。某些氨基酸对dAb的可溶性表达水平有显著影响,具体的氨基酸组合为(丝氨酸、精氨酸、天冬酰胺、天冬氨酸、谷氨酰胺)、(甘氨酸、精氨酸、半胱氨酸、天冬酰胺、丝氨酸)和(精氨酸、丝氨酸、甘氨酸);这些组合分别影响肉汤上清液中dAb的表达水平、沉淀裂解物中的水平和总可溶性dAb。在20种氨基酸中,精氨酸对可溶性表达水平有负面影响,而甘氨酸和丝氨酸则显示出正面影响。建立了一个线性模型,根据序列预测可溶性表达水平;该模型的预测准确率为80%。总之,增加极性氨基酸的含量,尤其是甘氨酸和丝氨酸,并降低精氨酸的含量,有助于提高HSA dAb的可溶性表达水平。