Thulin C D, Taylor J A, Walsh K A
Department of Biochemistry, University of Washington, Seattle 98195, USA.
Protein Sci. 1996 Jun;5(6):1157-64. doi: 10.1002/pro.5560050618.
Filaggrin is the product of posttranslational processing of the large, epidermal protein profilaggrin, which consists of 10 or more tandem filaggrin domains plus an amino and a carboxyl domain. According to fragmentary cDNA sequences, the filaggrin domains in the human protein vary at 40% of the amino acid positions; hence, mature filaggrin is a population of homologous but heterogeneous proteins, even within one individual. Available gene sequences give only a limited picture of the heterogeneity of human filaggrin protein because no complete human profilaggrin gene has been sequenced. Questions about the extent of heterogeneity of filaggrin within and between individuals have not been answered, nor have questions concerning the limited proteolytic cleavage of human profilaggrin that generates filaggrin in vivo. In order to address these questions and to provide an analysis of the primary structure of human filaggrins, we employed various methods of mass spectrometry. The intact protein and a tryptic digest of the mixture of human filaggrins were examined by matrix-assisted laser desorption time-of-flight mass spectrometry. Tryptic digests of human filaggrin from single individuals were also separated and analyzed by liquid chromatography/mass spectrometry (LC/MS) (using electrospray mass spectrometry), and specific peptides were identified by tandem mass spectrometry (MS/MS). A robust data analysis program, Sherpa, was developed to facilitate the interpretation of both LC/MS and MS/MS. These experiments show that human filaggrin includes heterogeneity not yet seen in cDNA sequences, but that much structure is highly conserved. Interestingly, we found that the heterogeneity is conserved among individuals. An approximation of the regions linking filaggrins in human profilaggrin is developed. These investigations provide a unique test of the limits of tryptic mapping of complex mixtures using mass spectrometry.
丝聚合蛋白是大型表皮蛋白前丝聚合蛋白经翻译后加工的产物,前丝聚合蛋白由10个或更多个串联的丝聚合蛋白结构域加上一个氨基结构域和一个羧基结构域组成。根据片段性的cDNA序列,人类蛋白质中的丝聚合蛋白结构域在40%的氨基酸位置上存在差异;因此,即使在同一个体内,成熟的丝聚合蛋白也是一组同源但异质的蛋白质。现有的基因序列只能有限地反映人类丝聚合蛋白的异质性,因为尚未对完整的人类前丝聚合蛋白基因进行测序。关于个体内部和个体之间丝聚合蛋白异质性的程度问题尚未得到解答,关于人类前丝聚合蛋白在体内产生丝聚合蛋白的有限蛋白水解切割问题也未得到解答。为了解决这些问题并对人类丝聚合蛋白的一级结构进行分析,我们采用了各种质谱方法。通过基质辅助激光解吸飞行时间质谱对完整的蛋白质和人类丝聚合蛋白混合物的胰蛋白酶消化产物进行了检测。来自单个个体的人类丝聚合蛋白的胰蛋白酶消化产物也通过液相色谱/质谱联用(LC/MS)(使用电喷雾质谱)进行了分离和分析,并通过串联质谱(MS/MS)鉴定了特定的肽段。开发了一个强大的数据分析程序Sherpa,以促进对LC/MS和MS/MS数据的解释。这些实验表明,人类丝聚合蛋白存在cDNA序列中尚未发现的异质性,但许多结构高度保守。有趣的是,我们发现这种异质性在个体之间是保守的。对人类前丝聚合蛋白中连接丝聚合蛋白结构域的区域进行了近似分析。这些研究为使用质谱对复杂混合物进行胰蛋白酶图谱分析的极限提供了独特的检验。