• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用分类器集成利用局部和全局序列信息进行准确的错义突变预测。

TargetMM: Accurate Missense Mutation Prediction by Utilizing Local and Global Sequence Information with Classifier Ensemble.

机构信息

School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094,China.

College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023,China.

出版信息

Comb Chem High Throughput Screen. 2022;25(1):38-52. doi: 10.2174/1386207323666201204140438.

DOI:10.2174/1386207323666201204140438
PMID:33280588
Abstract

AIM AND OBJECTIVE

Missense mutation (MM) may lead to various human diseases by disabling proteins. Accurate prediction of MM is important and challenging for both protein function annotation and drug design. Although several computational methods yielded acceptable success rates, there is still room for further enhancing the prediction performance of MM.

MATERIALS AND METHODS

In the present study, we designed a new feature extracting method, which considers the impact degree of residues in the microenvironment range to the mutation site. Stringent cross-validation and independent test on benchmark datasets were performed to evaluate the efficacy of the proposed feature extracting method. Furthermore, three heterogeneous prediction models were trained and then ensembled for the final prediction. By combining the feature representation method and classifier ensemble technique, we reported a novel MM predictor called TargetMM for identifying the pathogenic mutations from the neutral ones.

RESULTS

Comparison outcomes based on statistical evaluation demonstrate that TargetMM outperforms the prior advanced methods on the independent test data. The source codes and benchmark datasets of TargetMM are freely available at https://github.com/sera616/TargetMM.git for academic use.

摘要

目的和目标

错义突变 (MM) 可能通过使蛋白质失活导致各种人类疾病。准确预测 MM 对于蛋白质功能注释和药物设计都非常重要且具有挑战性。尽管有几种计算方法取得了可接受的成功率,但仍有进一步提高 MM 预测性能的空间。

材料和方法

在本研究中,我们设计了一种新的特征提取方法,该方法考虑了微环境范围内残基对突变位点的影响程度。在基准数据集上进行了严格的交叉验证和独立测试,以评估所提出的特征提取方法的功效。此外,还训练了三个异构预测模型,然后进行集成以进行最终预测。通过结合特征表示方法和分类器集成技术,我们报告了一种名为 TargetMM 的新型 MM 预测器,用于从中性突变中识别致病性突变。

结果

基于统计评估的比较结果表明,TargetMM 在独立测试数据上优于先前的先进方法。TargetMM 的源代码和基准数据集可在 https://github.com/sera616/TargetMM.git 上免费获取,供学术使用。

相似文献

1
TargetMM: Accurate Missense Mutation Prediction by Utilizing Local and Global Sequence Information with Classifier Ensemble.利用分类器集成利用局部和全局序列信息进行准确的错义突变预测。
Comb Chem High Throughput Screen. 2022;25(1):38-52. doi: 10.2174/1386207323666201204140438.
2
A novel missense-mutation-related feature extraction scheme for 'driver' mutation identification.一种新型的错义突变相关特征提取方案,用于“驱动”突变识别。
Bioinformatics. 2012 Nov 15;28(22):2948-55. doi: 10.1093/bioinformatics/bts558. Epub 2012 Oct 7.
3
ECAmyloid: An amyloid predictor based on ensemble learning and comprehensive sequence-derived features.ECAmyloid:一种基于集成学习和综合序列衍生特征的淀粉样蛋白预测器。
Comput Biol Chem. 2023 Jun;104:107853. doi: 10.1016/j.compbiolchem.2023.107853. Epub 2023 Mar 23.
4
StackDPP: a stacking ensemble based DNA-binding protein prediction model.StackDPP:一种基于堆叠集成的 DNA 结合蛋白预测模型。
BMC Bioinformatics. 2024 Mar 14;25(1):111. doi: 10.1186/s12859-024-05714-9.
5
MutTMPredictor: Robust and accurate cascade XGBoost classifier for prediction of mutations in transmembrane proteins.MutTMPredictor:用于预测跨膜蛋白突变的强大且准确的级联XGBoost分类器。
Comput Struct Biotechnol J. 2021 Nov 19;19:6400-6416. doi: 10.1016/j.csbj.2021.11.024. eCollection 2021.
6
SPPPred: Sequence-Based Protein-Peptide Binding Residue Prediction Using Genetic Programming and Ensemble Learning.SPPPred:基于遗传编程和集成学习的序列基蛋白-肽结合残基预测。
IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun;20(3):2029-2040. doi: 10.1109/TCBB.2022.3230540. Epub 2023 Jun 5.
7
Predicting deleterious missense genetic variants via integrative supervised nonnegative matrix tri-factorization.通过集成监督非负矩阵三因子分解预测有害错义遗传变异。
Sci Rep. 2021 Dec 9;11(1):23747. doi: 10.1038/s41598-021-03230-x.
8
Minimalist ensemble algorithms for genome-wide protein localization prediction.基因组范围内蛋白质定位预测的简约集成算法。
BMC Bioinformatics. 2012 Jul 3;13:157. doi: 10.1186/1471-2105-13-157.
9
iAtbP-Hyb-EnC: Prediction of antitubercular peptides via heterogeneous feature representation and genetic algorithm based ensemble learning model.iAtbP-Hyb-EnC:基于异质特征表示和遗传算法的集合学习模型对抗结核肽的预测。
Comput Biol Med. 2021 Oct;137:104778. doi: 10.1016/j.compbiomed.2021.104778. Epub 2021 Aug 25.
10
MIonSite: Ligand-specific prediction of metal ion-binding sites via enhanced AdaBoost algorithm with protein sequence information.MIonSite:利用AdaBoost 算法增强并结合蛋白质序列信息进行配体特异性金属离子结合位点预测。
Anal Biochem. 2019 Feb 1;566:75-88. doi: 10.1016/j.ab.2018.11.009. Epub 2018 Nov 9.

引用本文的文献

1
Improved in Silico Identification of Protein-Protein Interactions Using Deep Learning Approach.使用深度学习方法改进蛋白质-蛋白质相互作用的计算机模拟识别
IET Syst Biol. 2025 Jan-Dec;19(1):e70008. doi: 10.1049/syb2.70008.
2
MFP-MFL: Leveraging Graph Attention and Multi-Feature Integration for Superior Multifunctional Bioactive Peptide Prediction.MFP-MFL:利用图注意力和多特征整合实现卓越的多功能生物活性肽预测
Int J Mol Sci. 2025 Feb 4;26(3):1317. doi: 10.3390/ijms26031317.
3
DeepBP: Ensemble deep learning strategy for bioactive peptide prediction.
DeepBP:用于生物活性肽预测的集成深度学习策略。
BMC Bioinformatics. 2024 Nov 11;25(1):352. doi: 10.1186/s12859-024-05974-5.
4
PLMACPred prediction of anticancer peptides based on protein language model and wavelet denoising transformation.基于蛋白质语言模型和小波去噪变换的抗癌肽 PLMACPred 预测。
Sci Rep. 2024 Jul 23;14(1):16992. doi: 10.1038/s41598-024-67433-8.
5
Review and Comparative Analysis of Methods and Advancements in Predicting Protein Complex Structure.蛋白质复合物结构预测方法及进展的回顾与比较分析。
Interdiscip Sci. 2024 Jun;16(2):261-288. doi: 10.1007/s12539-024-00626-x. Epub 2024 Jul 2.
6
DPI_CDF: druggable protein identifier using cascade deep forest.DPI_CDF:基于级联深度森林的可成药性蛋白识别方法。
BMC Bioinformatics. 2024 Apr 5;25(1):145. doi: 10.1186/s12859-024-05744-3.
7
iMRSAPred: Improved Prediction of Anti-MRSA Peptides Using Physicochemical and Pairwise Contact-Energy Properties of Amino Acids.iMRSAPred:利用氨基酸的物理化学性质和成对接触能特性改进抗耐甲氧西林金黄色葡萄球菌肽的预测
ACS Omega. 2024 Jan 3;9(2):2874-2883. doi: 10.1021/acsomega.3c08303. eCollection 2024 Jan 16.
8
An initial prediction and fine-tuning model based on improving GCN for 3D human motion prediction.一种基于改进图卷积网络(GCN)的用于3D人体运动预测的初始预测和微调模型。
Front Comput Neurosci. 2023 Apr 5;17:1145209. doi: 10.3389/fncom.2023.1145209. eCollection 2023.
9
Accurate few-shot object counting with Hough matching feature enhancement.基于霍夫匹配特征增强的精确少样本目标计数
Front Comput Neurosci. 2023 Mar 30;17:1145219. doi: 10.3389/fncom.2023.1145219. eCollection 2023.
10
IGPred-HDnet: Prediction of Immunoglobulin Proteins Using Graphical Features and the Hierarchal Deep Learning-Based Approach.IGPred-HDnet:基于图特征和层次深度学习的免疫球蛋白蛋白预测方法。
Comput Intell Neurosci. 2023 Jan 25;2023:2465414. doi: 10.1155/2023/2465414. eCollection 2023.