Datta Somnath
University of Louisville.
Stat Appl Genet Mol Biol. 2008;7(2):Article7. doi: 10.2202/1544-6115.1345. Epub 2008 Feb 19.
We present our approach to classifying the processed proteomic data that were made available to the participants of the classification competition. Although classification of the spectra was the goal of the competition we feel that proteomic applications to cancer biomarker studies make certain additional demands. For example, one such requirement should be identification of certain features which collectively could differentiate the two groups of samples. Also ideally, the size of the feature set should be small. To that end we propose a linear discriminant classifier based on nine m/z intensity values. Construction and performance of this classifier are discussed.
我们展示了我们对已处理蛋白质组学数据进行分类的方法,这些数据已提供给分类竞赛的参与者。尽管光谱分类是竞赛的目标,但我们认为蛋白质组学在癌症生物标志物研究中的应用有一些额外的要求。例如,这样的一个要求应该是识别某些特征,这些特征共同可以区分两组样本。同样理想的是,特征集的大小应该很小。为此,我们提出了一种基于九个质荷比强度值的线性判别分类器。讨论了该分类器的构建和性能。