Yang Zheng Rong
Department of Computer Science, University of Exeter, Exeter EX4 4QK, UK.
IEEE Trans Biomed Eng. 2006 Oct;53(10):2119-23. doi: 10.1109/TBME.2006.881779.
This paper discusses how to predict hepatitis C virus protease cleavage sites in proteins using generalized linear indicator regression models. The mutual information is used for model-size optimization. Two simulation strategies are adopted, i.e., building a model based on published peptides and building a model based on the published peptides plus newly collected sequences. It is found that the latter outperforms the former significantly. The simulation also shows that the generalized linear indicator regression model far outperforms the multilayer perceptron model.
本文讨论了如何使用广义线性指标回归模型预测蛋白质中的丙型肝炎病毒蛋白酶切割位点。互信息用于模型大小的优化。采用了两种模拟策略,即基于已发表的肽段构建模型和基于已发表的肽段加上新收集的序列构建模型。结果发现,后者的性能明显优于前者。模拟还表明,广义线性指标回归模型的性能远优于多层感知器模型。