Sun Ying, Du Wei, Zhou Chunguang, Zhou You, Cao Zhongbo, Tian Yuan, Wang Yan
IEEE Trans Nanobioscience. 2015 Mar;14(2):167-74. doi: 10.1109/TNB.2015.2395143. Epub 2015 Feb 6.
Human saliva is rich in proteins, which have been used for disease detection such as oral diseases and systematic diseases. In this paper, we present a computational method for predicting secretory proteins in human saliva based on two sets of human proteins from published literatures and public databases. One set contains known proteins which can be secreted into saliva, and the other contains the proteins that are deemed to be not extracellular secretion. The protein features with discerning power between two sets were firstly gathered. Then a classifier was trained based on the identified features to predict whether a protein was saliva-secretory one or not. The average values of the sensitivity, specificity, precision, accuracy, and Matthews correlation coefficient value by 10-fold cross validation repeated 100 times were 80.67%, 90.56%, 90.09%, 85.53%, and 0.7168, respectively. These results indicated that our selected features are informative. We applied the classifier for prediction saliva-secretory proteins out of all human proteins, if a known biomarker was likely to enter into saliva, and the potential salivary biomarkers for head and neck squamous cell carcinoma. We also compared the top 1000 proteins predicted by computational methods in different kind of fluids. This work provided a useful tool for effectively identifying the salivary biomarkers for various human diseases and facilitate the development of salivary diagnosis.
人类唾液富含蛋白质,这些蛋白质已被用于疾病检测,如口腔疾病和全身性疾病。在本文中,我们基于已发表文献和公共数据库中的两组人类蛋白质,提出了一种预测人类唾液中分泌蛋白的计算方法。一组包含可分泌到唾液中的已知蛋白质,另一组包含被认为不是细胞外分泌的蛋白质。首先收集两组之间具有鉴别力的蛋白质特征。然后基于所识别的特征训练分类器,以预测一种蛋白质是否为唾液分泌蛋白。通过100次重复的10折交叉验证得到的灵敏度、特异性、精确度、准确度和马修斯相关系数值的平均值分别为80.67%、90.56%、90.09%、85.53%和0.7168。这些结果表明我们选择的特征具有信息性。我们将该分类器应用于预测所有人类蛋白质中的唾液分泌蛋白、已知生物标志物是否可能进入唾液以及头颈部鳞状细胞癌的潜在唾液生物标志物。我们还比较了通过计算方法在不同类型液体中预测的前1000种蛋白质。这项工作为有效识别各种人类疾病的唾液生物标志物提供了一个有用的工具,并促进了唾液诊断的发展。