• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于识别唾液中癌症蛋白质标志物的计算方法。

Computational methods for recognition of cancer protein markers in saliva.

机构信息

Tianjin Key Laboratory of Intelligence Computing and Novel Software Technology, School of Computer Science and Engineering, Tianjin University of Technology, Tianjin 300384, China.

Information Technology Research Base of Civil Aviation Administration of China, Civil Aviation University of China, Tianjin 300300, China.

出版信息

Math Biosci Eng. 2020 Feb 25;17(3):2453-2469. doi: 10.3934/mbe.2020134.

DOI:10.3934/mbe.2020134
PMID:32233548
Abstract

In recent years, many studies have supported that cancer tissues can make disease-specific changes in some salivary proteins through some mediators in the pathogenesis of systemic diseases. These salivary proteins have the potential to become cancer-specific biomarkers in the early diagnosis stage. How to effectively identify these potential markers is one of the challenging issues. In this paper, we propose novel machine learning methods for recognition cancer biomarkers in saliva by two stages. In the first stage, salivary secreted proteins are recognized which are considered as candidate biomarkers of cancers. We picked up 557 salivary secretory proteins from 20379 human proteins by public databases and published literatures. Then, we present a training set construction strategy to solve the imbalance problem in order to make the classification methods get better accuracy. From all human protein set, the proteins belonging to the same families as salivary secretory proteins are removed. After that, we use SVC-KM method to cluster the remaining proteins, and select negative samples from each cluster in proportion. Next, the features of proteins are calculated by tools. We collect 24 protein properties such as sequence, structure and physicochemical properties, a total of 1087 features. An innovative procedure based on the local samples is proposed for selecting the appropriate features, in order to further improve the performance of SVM classifier. Experimental results show that the average sensitivity, specificity and accuracy of salivary secretory protein recognition using selected 32 features in training set are 97.09%, 98.10%, 97.61%, respectively. The use of these methods can improve the accuracy of recognition by solving the problems of unbalanced sample size and uneven distribution in training set. In the second stage, we apply the best model to dig out the salivary secreted proteins from 58 reported cancer markers, and get a total of 42 proteins which are considered to be used for salivary diagnosis. We analyze the gene expression data of three types of cancer, and predict that 33 genes will appear in saliva after they are translated into proteins. This study provides an important computational tool to help biologists and researchers reduce the number of candidate proteins and the cost of research. So as to further accelerate the discovery of cancer biomarkers in saliva and promote the development of saliva diagnosis.

摘要

近年来,许多研究支持通过系统疾病发病机制中的一些介质,癌症组织可以使一些唾液蛋白发生疾病特异性改变。这些唾液蛋白有可能成为早期诊断阶段的癌症特异性生物标志物。如何有效地识别这些潜在的标志物是具有挑战性的问题之一。在本文中,我们提出了通过两个阶段识别唾液生物标志物的新的机器学习方法。在第一阶段,识别唾液分泌蛋白,这些蛋白被认为是癌症的候选生物标志物。我们从 20379 个人类蛋白中通过公共数据库和已发表的文献中提取了 557 种唾液分泌蛋白。然后,我们提出了一种训练集构建策略来解决不平衡问题,以提高分类方法的准确性。从所有人类蛋白组中,去除与唾液分泌蛋白属于同一家族的蛋白。之后,我们使用 SVC-KM 方法对剩余蛋白进行聚类,并按比例从每个聚类中选择负样本。接下来,使用工具计算蛋白的特征。我们收集了 24 种蛋白特性,如序列、结构和理化特性,共计 1087 种特征。为了进一步提高 SVM 分类器的性能,提出了一种基于局部样本的新的特征选择方法。实验结果表明,在训练集中使用所选的 32 个特征进行唾液分泌蛋白识别的平均灵敏度、特异性和准确性分别为 97.09%、98.10%和 97.61%。这些方法的使用可以通过解决训练集样本大小不平衡和分布不均匀的问题来提高识别的准确性。在第二阶段,我们将最佳模型应用于从 58 个已报道的癌症标志物中挖掘出的唾液分泌蛋白,并得到了 42 种被认为可用于唾液诊断的蛋白。我们分析了三种癌症的基因表达数据,并预测这 33 个基因在翻译成蛋白质后将出现在唾液中。这项研究提供了一个重要的计算工具,帮助生物学家和研究人员减少候选蛋白的数量和研究成本。从而进一步加速唾液生物标志物的发现并促进唾液诊断的发展。

相似文献

1
Computational methods for recognition of cancer protein markers in saliva.用于识别唾液中癌症蛋白质标志物的计算方法。
Math Biosci Eng. 2020 Feb 25;17(3):2453-2469. doi: 10.3934/mbe.2020134.
2
A computational method for prediction of saliva-secretory proteins and its application to identification of head and neck cancer biomarkers for salivary diagnosis.一种预测唾液分泌蛋白的计算方法及其在鉴定用于唾液诊断的头颈癌生物标志物中的应用。
IEEE Trans Nanobioscience. 2015 Mar;14(2):167-74. doi: 10.1109/TNB.2015.2395143. Epub 2015 Feb 6.
3
CapsNet-SSP: multilane capsule network for predicting human saliva-secretory proteins.CapsNet-SSP:用于预测人类唾液分泌蛋白的多车道胶囊网络。
BMC Bioinformatics. 2020 Jun 9;21(1):237. doi: 10.1186/s12859-020-03579-2.
4
Role of Salivary Biomarkers in Oral Cancer Detection.唾液生物标志物在口腔癌检测中的作用。
Adv Clin Chem. 2018;86:23-70. doi: 10.1016/bs.acc.2018.05.002. Epub 2018 Jul 23.
5
Computational prediction of human salivary proteins from blood circulation and application to diagnostic biomarker identification.基于血液循环对人唾液蛋白进行计算预测及其在诊断生物标志物识别中的应用。
PLoS One. 2013 Nov 12;8(11):e80211. doi: 10.1371/journal.pone.0080211. eCollection 2013.
6
Salivary biomarkers in the diagnosis of breast cancer: A review.唾液生物标志物在乳腺癌诊断中的应用:综述。
Crit Rev Oncol Hematol. 2017 Feb;110:62-73. doi: 10.1016/j.critrevonc.2016.12.009. Epub 2016 Dec 21.
7
Recent advances in salivary cancer diagnostics enabled by biosensors and bioelectronics.生物传感器和生物电子学在唾液癌诊断方面的最新进展。
Biosens Bioelectron. 2016 Jul 15;81:181-197. doi: 10.1016/j.bios.2016.02.040. Epub 2016 Feb 18.
8
Mixture classification model based on clinical markers for breast cancer prognosis.基于临床标志物的乳腺癌预后混合分类模型。
Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.
9
Predicting novel salivary biomarkers for the detection of pancreatic cancer using biological feature-based classification.利用基于生物学特征的分类方法预测用于检测胰腺癌的新型唾液生物标志物。
Pathol Res Pract. 2017 Apr;213(4):394-399. doi: 10.1016/j.prp.2016.09.017. Epub 2016 Sep 22.
10
Bioassay of saliva proteins: The best alternative for conventional methods in non-invasive diagnosis of cancer.唾液蛋白生物测定:癌症无创诊断中替代传统方法的最佳选择。
Int J Biol Macromol. 2019 Mar 1;124:1246-1255. doi: 10.1016/j.ijbiomac.2018.11.277. Epub 2018 Dec 1.

引用本文的文献

1
Association of salivary proteins with dental caries in children with mixed dentition: a systematic review.混合牙列期儿童唾液蛋白与龋齿的关联:一项系统评价
Eur Arch Paediatr Dent. 2025 Jan 18. doi: 10.1007/s40368-024-00994-4.