• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于蛋白质相互作用网络和混合特性预测有害非同义 SNPs。

Prediction of deleterious non-synonymous SNPs based on protein interaction network and hybrid properties.

机构信息

Key Laboratory of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, People's Republic of China.

出版信息

PLoS One. 2010 Jul 30;5(7):e11900. doi: 10.1371/journal.pone.0011900.

DOI:10.1371/journal.pone.0011900
PMID:20689580
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2912763/
Abstract

Non-synonymous SNPs (nsSNPs), also known as Single Amino acid Polymorphisms (SAPs) account for the majority of human inherited diseases. It is important to distinguish the deleterious SAPs from neutral ones. Most traditional computational methods to classify SAPs are based on sequential or structural features. However, these features cannot fully explain the association between a SAP and the observed pathophysiological phenotype. We believe the better rationale for deleterious SAP prediction should be: If a SAP lies in the protein with important functions and it can change the protein sequence and structure severely, it is more likely related to disease. So we established a method to predict deleterious SAPs based on both protein interaction network and traditional hybrid properties. Each SAP is represented by 472 features that include sequential features, structural features and network features. Maximum Relevance Minimum Redundancy (mRMR) method and Incremental Feature Selection (IFS) were applied to obtain the optimal feature set and the prediction model was Nearest Neighbor Algorithm (NNA). In jackknife cross-validation, 83.27% of SAPs were correctly predicted when the optimized 263 features were used. The optimized predictor with 263 features was also tested in an independent dataset and the accuracy was still 80.00%. In contrast, SIFT, a widely used predictor of deleterious SAPs based on sequential features, has a prediction accuracy of 71.05% on the same dataset. In our study, network features were found to be most important for accurate prediction and can significantly improve the prediction performance. Our results suggest that the protein interaction context could provide important clues to help better illustrate SAP's functional association. This research will facilitate the post genome-wide association studies.

摘要

非同义 SNP(nsSNP),又称单氨基酸多态性(SAP),占人类遗传性疾病的绝大多数。区分有害 SAP 与中性 SAP 非常重要。大多数传统的 SAP 分类计算方法都是基于序列或结构特征。然而,这些特征并不能完全解释 SAP 与观察到的病理生理表型之间的关联。我们认为,有害 SAP 预测的更好原理应该是:如果 SAP 位于具有重要功能的蛋白质中,并且能够严重改变蛋白质序列和结构,那么它与疾病的相关性就更高。因此,我们建立了一种基于蛋白质相互作用网络和传统混合特性的有害 SAP 预测方法。每个 SAP 由 472 个特征表示,包括序列特征、结构特征和网络特征。采用最大相关最小冗余(mRMR)方法和增量特征选择(IFS)来获得最佳特征集,并采用最近邻算法(NNA)构建预测模型。在 Jackknife 交叉验证中,当使用优化的 263 个特征时,83.27%的 SAP 被正确预测。优化后的 263 个特征的预测器也在独立数据集上进行了测试,准确率仍然为 80.00%。相比之下,基于序列特征的广泛使用的有害 SAP 预测器 SIFT 在同一数据集上的预测准确率为 71.05%。在我们的研究中,网络特征被发现对准确预测最重要,并且可以显著提高预测性能。我们的结果表明,蛋白质相互作用背景可以提供重要线索,有助于更好地说明 SAP 的功能关联。这项研究将促进全基因组关联研究之后的工作。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b04d/2912763/d4891a7e8884/pone.0011900.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b04d/2912763/c93ab00fbc74/pone.0011900.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b04d/2912763/ad9ce9644bf3/pone.0011900.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b04d/2912763/d4891a7e8884/pone.0011900.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b04d/2912763/c93ab00fbc74/pone.0011900.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b04d/2912763/ad9ce9644bf3/pone.0011900.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b04d/2912763/d4891a7e8884/pone.0011900.g003.jpg

相似文献

1
Prediction of deleterious non-synonymous SNPs based on protein interaction network and hybrid properties.基于蛋白质相互作用网络和混合特性预测有害非同义 SNPs。
PLoS One. 2010 Jul 30;5(7):e11900. doi: 10.1371/journal.pone.0011900.
2
SySAP: a system-level predictor of deleterious single amino acid polymorphisms.SySAP:一种有害单氨基酸变异的系统水平预测因子。
Protein Cell. 2012 Jan;3(1):38-43. doi: 10.1007/s13238-011-1130-2. Epub 2011 Dec 19.
3
Identification of deleterious non-synonymous single nucleotide polymorphisms using sequence-derived information.利用序列衍生信息鉴定有害非同义单核苷酸多态性
BMC Bioinformatics. 2008 Jun 27;9:297. doi: 10.1186/1471-2105-9-297.
4
Predicting deleterious non-synonymous single nucleotide polymorphisms in signal peptides based on hybrid sequence attributes.基于混合序列属性预测信号肽中的有害非同义单核苷酸多态性。
Comput Biol Chem. 2012 Feb;36:31-5. doi: 10.1016/j.compbiolchem.2011.12.001. Epub 2011 Dec 30.
5
Discriminating between deleterious and neutral non-frameshifting indels based on protein interaction networks and hybrid properties.基于蛋白质相互作用网络和混合特性区分有害和中性非移码插入缺失。
Mol Genet Genomics. 2015 Feb;290(1):343-52. doi: 10.1007/s00438-014-0922-5. Epub 2014 Sep 24.
6
Predicting disease-associated substitution of a single amino acid by analyzing residue interactions.通过分析残基相互作用预测单个氨基酸的疾病相关取代。
BMC Bioinformatics. 2011 Jan 12;12:14. doi: 10.1186/1471-2105-12-14.
7
Prediction of tyrosine sulfation with mRMR feature selection and analysis.酪氨酸硫酸化的预测与 mRMR 特征选择和分析。
J Proteome Res. 2010 Dec 3;9(12):6490-7. doi: 10.1021/pr1007152. Epub 2010 Nov 11.
8
Statistical geometry based prediction of nonsynonymous SNP functional effects using random forest and neuro-fuzzy classifiers.基于统计几何学,使用随机森林和神经模糊分类器预测非同义单核苷酸多态性的功能效应
Proteins. 2008 Jun;71(4):1930-9. doi: 10.1002/prot.21838.
9
Prediction of interactiveness of proteins and nucleic acids based on feature selections.基于特征选择的蛋白质和核酸相互作用预测。
Mol Divers. 2010 Nov;14(4):627-33. doi: 10.1007/s11030-009-9198-9. Epub 2009 Oct 9.
10
Combination use of protein-protein interaction network topological features improves the predictive scores of deleterious non-synonymous single-nucleotide polymorphisms.蛋白质-蛋白质相互作用网络拓扑特征的联合使用提高了有害非同义单核苷酸多态性的预测分数。
Amino Acids. 2014 Aug;46(8):2025-35. doi: 10.1007/s00726-014-1760-9. Epub 2014 May 22.

引用本文的文献

1
Rapid discrimination between deleterious and benign missense mutations in the CAGI 6 experiment.在 CAGI 6 实验中快速区分有害和良性错义突变。
Hum Genomics. 2024 Aug 27;18(1):89. doi: 10.1186/s40246-024-00655-z.
2
IDRMutPred: predicting disease-associated germline nonsynonymous single nucleotide variants (nsSNVs) in intrinsically disordered regions.IDRMutPred:预测内在无序区域中与疾病相关的种系非同义单核苷酸变异(nsSNVs)。
Bioinformatics. 2020 Dec 22;36(20):4977-4983. doi: 10.1093/bioinformatics/btaa618.
3
Genomic insights on heterogeneous resistance to vancomycin and teicoplanin in Methicillin-resistant Staphylococcus aureus: A first report from South India.

本文引用的文献

1
Analysis and prediction of the metabolic stability of proteins based on their sequential features, subcellular locations and interaction networks.基于序列特征、亚细胞定位和相互作用网络分析和预测蛋白质的代谢稳定性。
PLoS One. 2010 Jun 4;5(6):e10972. doi: 10.1371/journal.pone.0010972.
2
Prediction of pharmacological and xenobiotic responses to drugs based on time course gene expression profiles.基于时间进程基因表达谱预测药物的药理和外源物质反应。
PLoS One. 2009 Dec 2;4(12):e8126. doi: 10.1371/journal.pone.0008126.
3
Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm.
基因层面揭示耐甲氧西林金黄色葡萄球菌对万古霉素和替考拉宁的异质性耐药机制:来自印度南部的首次报告。
PLoS One. 2019 Dec 30;14(12):e0227009. doi: 10.1371/journal.pone.0227009. eCollection 2019.
4
Identifying Microbiota Signature and Functional Rules Associated With Bacterial Subtypes in Human Intestine.识别与人类肠道细菌亚型相关的微生物群特征和功能规则。
Front Genet. 2019 Nov 15;10:1146. doi: 10.3389/fgene.2019.01146. eCollection 2019.
5
Network-Based Method for Identifying Co- Regeneration Genes in Bone, Dentin, Nerve and Vessel Tissues.基于网络的骨、牙本质、神经和血管组织中共再生基因识别方法。
Genes (Basel). 2017 Oct 2;8(10):252. doi: 10.3390/genes8100252.
6
Leucine to proline substitution by SNP at position 197 in Caspase-9 gene expression leads to neuroblastoma: a bioinformatics analysis.半胱天冬酶-9基因表达中第197位单核苷酸多态性导致的亮氨酸到脯氨酸的替换引发神经母细胞瘤:一项生物信息学分析。
3 Biotech. 2013 Jun;3(3):225-234. doi: 10.1007/s13205-012-0088-y. Epub 2012 Sep 18.
7
Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing.通过高通量测序和高性能计算探索非模式产油微藻杜氏盐藻的转录组。
BMC Bioinformatics. 2017 Feb 22;18(1):122. doi: 10.1186/s12859-017-1551-x.
8
HAPPI-2: a Comprehensive and High-quality Map of Human Annotated and Predicted Protein Interactions.HAPPI-2:人类注释和预测蛋白质相互作用的全面且高质量图谱。
BMC Genomics. 2017 Feb 17;18(1):182. doi: 10.1186/s12864-017-3512-1.
9
Computational assessment of feature combinations for pathogenic variant prediction.用于致病变异预测的特征组合的计算评估。
Mol Genet Genomic Med. 2016 Mar 14;4(4):431-46. doi: 10.1002/mgg3.214. eCollection 2016 Jul.
10
mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling.mRMR-ABC:一种利用微阵列基因表达谱进行癌症分类的混合基因选择算法。
Biomed Res Int. 2015;2015:604910. doi: 10.1155/2015/604910. Epub 2015 Apr 15.
使用SIFT算法预测编码非同义变体对蛋白质功能的影响。
Nat Protoc. 2009;4(7):1073-81. doi: 10.1038/nprot.2009.86. Epub 2009 Jun 25.
4
Pathways to cancer therapy.癌症治疗途径。
Nat Rev Drug Discov. 2008 Nov;7(11):875-6. doi: 10.1038/nrd2748.
5
STRING 8--a global view on proteins and their functional interactions in 630 organisms.STRING 8——关于630种生物中蛋白质及其功能相互作用的全局视图。
Nucleic Acids Res. 2009 Jan;37(Database issue):D412-6. doi: 10.1093/nar/gkn760. Epub 2008 Oct 21.
6
The IMGT/HLA database.国际免疫遗传学信息系统/HLA数据库。
Nucleic Acids Res. 2009 Jan;37(Database issue):D1013-7. doi: 10.1093/nar/gkn662. Epub 2008 Oct 5.
7
Core signaling pathways in human pancreatic cancers revealed by global genomic analyses.通过全基因组分析揭示的人类胰腺癌核心信号通路。
Science. 2008 Sep 26;321(5897):1801-6. doi: 10.1126/science.1164368. Epub 2008 Sep 4.
8
The prediction of interferon treatment effects based on time series microarray gene expression profiles.基于时间序列微阵列基因表达谱预测干扰素治疗效果。
J Transl Med. 2008 Aug 9;6:44. doi: 10.1186/1479-5876-6-44.
9
Identification of deleterious non-synonymous single nucleotide polymorphisms using sequence-derived information.利用序列衍生信息鉴定有害非同义单核苷酸多态性
BMC Bioinformatics. 2008 Jun 27;9:297. doi: 10.1186/1471-2105-9-297.
10
SNAP: predict effect of non-synonymous polymorphisms on function.SNAP:预测非同义多态性对功能的影响。
Nucleic Acids Res. 2007;35(11):3823-35. doi: 10.1093/nar/gkm238. Epub 2007 May 25.