Lohmann R, Schneider G, Wrede P
Gesellschaft zur Förderung angewandter Informatik (GFal), Berlin, Germany.
Biopolymers. 1996 Jan;38(1):13-29. doi: 10.1002/(SICI)1097-0282(199601)38:1%3C13::AID-BIP2%3E3.0.CO;2-Z.
An artificial neural network has been developed for the recognition and prediction of transmembrane regions in the amino acid sequences of human integral membrane proteins. It provides an additional prediction method besides the common hydrophobicity analysis by statistical means. Membrane/nonmembrane transition regions are predicted with 92% accuracy in both training and independent test data. The method used for the development of the neural filter is the algorithm of structure evolution. It subjects both the architecture and parameters of the system to a systematical optimization process and carries out local search in the respective structure and parameter spaces. The training technique of incomplete induction as part of the structure evolution provides for a comparatively general solution of the problem that is described by input-output relations only. Seven physiochemical side-chain properties were used to encode the amino acid sequences. It was found that geometric parameters like side-chain volume, bulkiness, or surface area are of minor importance. The properties polarity, refractivity, and hydrophobicity, however, turned out to support feature extraction. It is concluded that membrane transition regions in proteins are encoded in sequences as a characteristic feature based on the respective side-chain properties. The method of structure evolution is described in detail for this particular application and suggestions for further development of amino acid sequence filters are made.
已开发出一种人工神经网络,用于识别和预测人类整合膜蛋白氨基酸序列中的跨膜区域。除了通过统计手段进行常见的疏水性分析外,它还提供了一种额外的预测方法。在训练数据和独立测试数据中,膜/非膜过渡区域的预测准确率均为92%。用于开发神经滤波器的方法是结构进化算法。它对系统的架构和参数进行系统优化过程,并在各自的结构和参数空间中进行局部搜索。作为结构进化一部分的不完全归纳训练技术为仅由输入-输出关系描述的问题提供了一个相对通用的解决方案。使用了七种物理化学侧链性质对氨基酸序列进行编码。发现诸如侧链体积、庞大性或表面积等几何参数的重要性较小。然而,极性、折射性和疏水性等性质被证明有助于特征提取。得出的结论是,蛋白质中的膜过渡区域基于各自的侧链性质作为特征编码在序列中。针对这一特定应用详细描述了结构进化方法,并对氨基酸序列滤波器的进一步发展提出了建议。