• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用支持向量机和进化信息预测与单点蛋白质突变相关的人类遗传疾病的发生。

Predicting the insurgence of human genetic diseases associated to single point protein mutations with support vector machines and evolutionary information.

作者信息

Capriotti E, Calabrese R, Casadio R

机构信息

Laboratory of Biocomputing, CIRB/Department of Biology, University of Bologna via Irnerio 42, 40126 Bologna, Italy.

出版信息

Bioinformatics. 2006 Nov 15;22(22):2729-34. doi: 10.1093/bioinformatics/btl423. Epub 2006 Aug 7.

DOI:10.1093/bioinformatics/btl423
PMID:16895930
Abstract

MOTIVATION

Human single nucleotide polymorphisms (SNPs) are the most frequent type of genetic variation in human population. One of the most important goals of SNP projects is to understand which human genotype variations are related to Mendelian and complex diseases. Great interest is focused on non-synonymous coding SNPs (nsSNPs) that are responsible of protein single point mutation. nsSNPs can be neutral or disease associated. It is known that the mutation of only one residue in a protein sequence can be related to a number of pathological conditions of dramatic social impact such as Alzheimer's, Parkinson's and Creutzfeldt-Jakob's diseases. The quality and completeness of presently available SNPs databases allows the application of machine learning techniques to predict the insurgence of human diseases due to single point protein mutation starting from the protein sequence.

RESULTS

In this paper, we develop a method based on support vector machines (SVMs) that starting from the protein sequence information can predict whether a new phenotype derived from a nsSNP can be related to a genetic disease in humans. Using a dataset of 21 185 single point mutations, 61% of which are disease-related, out of 3587 proteins, we show that our predictor can reach more than 74% accuracy in the specific task of predicting whether a single point mutation can be disease related or not. Our method, although based on less information, outperforms other web-available predictors implementing different approaches.

AVAILABILITY

A beta version of the web tool is available at http://gpcr.biocomp.unibo.it/cgi/predictors/PhD-SNP/PhD-SNP.cgi

摘要

动机

人类单核苷酸多态性(SNP)是人类群体中最常见的遗传变异类型。SNP项目的最重要目标之一是了解哪些人类基因型变异与孟德尔疾病和复杂疾病相关。人们对导致蛋白质单点突变的非同义编码SNP(nsSNP)极为关注。nsSNP可能是中性的,也可能与疾病相关。已知蛋白质序列中仅一个残基的突变就可能与许多具有重大社会影响的病理状况有关,如阿尔茨海默病、帕金森病和克雅氏病。当前可用的SNP数据库的质量和完整性使得能够应用机器学习技术,从蛋白质序列出发预测由于单点蛋白质突变而导致的人类疾病的发生。

结果

在本文中,我们开发了一种基于支持向量机(SVM)的方法,该方法从蛋白质序列信息出发,可以预测源自nsSNP的新表型是否可能与人类遗传疾病相关。我们使用了一个包含21185个单点突变的数据集,这些突变来自3587种蛋白质,其中61%与疾病相关,结果表明我们的预测器在预测单点突变是否与疾病相关的特定任务中,准确率可以超过74%。我们的方法虽然基于较少的信息,但优于其他采用不同方法的在线可用预测器。

可用性

网络工具的测试版可在http://gpcr.biocomp.unibo.it/cgi/predictors/PhD-SNP/PhD-SNP.cgi获取。

相似文献

1
Predicting the insurgence of human genetic diseases associated to single point protein mutations with support vector machines and evolutionary information.利用支持向量机和进化信息预测与单点蛋白质突变相关的人类遗传疾病的发生。
Bioinformatics. 2006 Nov 15;22(22):2729-34. doi: 10.1093/bioinformatics/btl423. Epub 2006 Aug 7.
2
Use of estimated evolutionary strength at the codon level improves the prediction of disease-related protein mutations in humans.使用密码子水平的估计进化强度可改善对人类疾病相关蛋白质突变的预测。
Hum Mutat. 2008 Jan;29(1):198-204. doi: 10.1002/humu.20628.
3
Prediction of the phenotypic effects of non-synonymous single nucleotide polymorphisms using structural and evolutionary information.利用结构和进化信息预测非同义单核苷酸多态性的表型效应。
Bioinformatics. 2005 May 15;21(10):2185-90. doi: 10.1093/bioinformatics/bti365. Epub 2005 Mar 3.
4
Predicting protein stability changes from sequences using support vector machines.使用支持向量机从序列预测蛋白质稳定性变化。
Bioinformatics. 2005 Sep 1;21 Suppl 2:ii54-8. doi: 10.1093/bioinformatics/bti1109.
5
Knowledge-based computational mutagenesis for predicting the disease potential of human non-synonymous single nucleotide polymorphisms.基于知识的计算突变分析预测人类非同义单核苷酸多态性的疾病潜能。
J Theor Biol. 2010 Oct 21;266(4):560-8. doi: 10.1016/j.jtbi.2010.07.026. Epub 2010 Jul 23.
6
Statistical geometry based prediction of nonsynonymous SNP functional effects using random forest and neuro-fuzzy classifiers.基于统计几何学,使用随机森林和神经模糊分类器预测非同义单核苷酸多态性的功能效应
Proteins. 2008 Jun;71(4):1930-9. doi: 10.1002/prot.21838.
7
LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources.LS-SNP:基于多信息源的编码非同义单核苷酸多态性的大规模注释
Bioinformatics. 2005 Jun 15;21(12):2814-20. doi: 10.1093/bioinformatics/bti442. Epub 2005 Apr 12.
8
Prediction of function changes associated with single-point protein mutations using support vector machines (SVMs).使用支持向量机(SVM)预测与单点蛋白质突变相关的功能变化。
Hum Mutat. 2009 Aug;30(8):1161-6. doi: 10.1002/humu.21039.
9
CCHMM_PROF: a HMM-based coiled-coil predictor with evolutionary information.CCHMM_PROF:一种基于隐马尔可夫模型的带有进化信息的卷曲螺旋预测器。
Bioinformatics. 2009 Nov 1;25(21):2757-63. doi: 10.1093/bioinformatics/btp539. Epub 2009 Sep 10.
10
Computational prediction of the effects of non-synonymous single nucleotide polymorphisms in human DNA repair genes.人类DNA修复基因中非同义单核苷酸多态性影响的计算预测。
Neuroscience. 2007 Apr 14;145(4):1273-9. doi: 10.1016/j.neuroscience.2006.09.004. Epub 2006 Oct 19.

引用本文的文献

1
Language Modelling Techniques for Analysing the Impact of Human Genetic Variation.用于分析人类基因变异影响的语言建模技术
Bioinform Biol Insights. 2025 Sep 2;19:11779322251358314. doi: 10.1177/11779322251358314. eCollection 2025.
2
Computational prediction of high-risk non-synonymous SNPs in human ApoE and their structural impact on amyloid-β interaction in Alzheimer's disease pathogenesis.人类载脂蛋白E中高风险非同义单核苷酸多态性的计算预测及其在阿尔茨海默病发病机制中对淀粉样β蛋白相互作用的结构影响
PLoS One. 2025 Sep 2;20(9):e0331339. doi: 10.1371/journal.pone.0331339. eCollection 2025.
3
Whole exon screening of SLC2A4 gene and the association of rs5435 with type 2 diabetes in a Bangladeshi case-control study.
在一项孟加拉国病例对照研究中对SLC2A4基因进行全外显子筛查以及rs5435与2型糖尿病的关联研究
J Genet Eng Biotechnol. 2025 Sep;23(3):100534. doi: 10.1016/j.jgeb.2025.100534. Epub 2025 Jul 10.
4
"In silico analysis of human TLR3 missense single nucleotide polymorphisms and their potential association with cancer".人Toll样受体3错义单核苷酸多态性的计算机模拟分析及其与癌症的潜在关联
Sci Rep. 2025 Aug 22;15(1):30837. doi: 10.1038/s41598-025-05599-5.
5
Computational prediction of the pathogenic variants of arachidonate 5-lipoxygenase activating protein using Molecular Dynamics simulation.利用分子动力学模拟对花生四烯酸5-脂氧合酶激活蛋白的致病变异进行计算预测。
PLoS One. 2025 Jul 30;20(7):e0329126. doi: 10.1371/journal.pone.0329126. eCollection 2025.
6
Computational Study of Pathogenic Variants in Phenylalanine-4-hydroxylase (PAH): Insights into Structure, Dynamics, and BH Binding.苯丙氨酸-4-羟化酶(PAH)致病变体的计算研究:对结构、动力学和BH结合的见解
ACS Omega. 2025 Jul 2;10(27):29209-29223. doi: 10.1021/acsomega.5c01993. eCollection 2025 Jul 15.
7
Decoding Myosin-3 mutational hotspots: Linking deleterious variants to Duchenne muscular dystrophy severity and psychiatric comorbidities.解读肌球蛋白-3突变热点:将有害变异与杜氏肌营养不良症的严重程度及精神共病联系起来。
PLoS One. 2025 Jul 15;20(7):e0328503. doi: 10.1371/journal.pone.0328503. eCollection 2025.
8
Prognostic Modeling of Deleterious IDUA Mutations L238Q and P385R in Hurler Syndrome Through Molecular Dynamics Simulations.通过分子动力学模拟对黏多糖贮积症I型(Hurler综合征)中有害的艾杜糖醛酸酶(IDUA)突变L238Q和P385R进行预后建模。
Pharmaceuticals (Basel). 2025 Jun 19;18(6):922. doi: 10.3390/ph18060922.
9
Multi-omics insights into biomarkers of breast cancer associated diabetes: a computational approach.乳腺癌相关糖尿病生物标志物的多组学见解:一种计算方法。
Front Med (Lausanne). 2025 Jun 6;12:1572500. doi: 10.3389/fmed.2025.1572500. eCollection 2025.
10
The rs140668532 SNP in GSK-3β gene as a potential biomarker for Alzheimer's disease: Insights from computational modeling.GSK-3β基因中的rs140668532单核苷酸多态性作为阿尔茨海默病的潜在生物标志物:来自计算模型的见解。
Toxicol Rep. 2025 May 28;14:102060. doi: 10.1016/j.toxrep.2025.102060. eCollection 2025 Jun.