• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Sparse group selection and analysis of function-related residue for protein-state recognition.稀疏分组选择和功能相关残基分析在蛋白质状态识别中的应用。
J Comput Chem. 2022 Jul 30;43(20):1342-1354. doi: 10.1002/jcc.26937. Epub 2022 Jun 3.
2
SGL-SVM: A novel method for tumor classification via support vector machine with sparse group Lasso.SGL-SVM:一种通过带稀疏组套索的支持向量机进行肿瘤分类的新方法。
J Theor Biol. 2020 Feb 7;486:110098. doi: 10.1016/j.jtbi.2019.110098. Epub 2019 Nov 28.
3
An interpretable machine learning method for homo-trimeric protein interface residue-residue interaction prediction.一种用于同源三聚体蛋白质界面残基-残基相互作用预测的可解释机器学习方法。
Biophys Chem. 2021 Nov;278:106666. doi: 10.1016/j.bpc.2021.106666. Epub 2021 Aug 13.
4
Resting-State Functional Network Scale Effects and Statistical Significance-Based Feature Selection in Machine Learning Classification.基于静息态功能网络尺度效应和统计显著性的机器学习分类特征选择。
Comput Math Methods Med. 2019 Nov 4;2019:9108108. doi: 10.1155/2019/9108108. eCollection 2019.
5
Hypernetwork Construction and Feature Fusion Analysis Based on Sparse Group Lasso Method on fMRI Dataset.基于功能磁共振成像数据集的稀疏组套索方法的超网络构建与特征融合分析
Front Neurosci. 2020 Feb 12;14:60. doi: 10.3389/fnins.2020.00060. eCollection 2020.
6
A Survey on Sparse Learning Models for Feature Selection.基于稀疏学习模型的特征选择研究综述
IEEE Trans Cybern. 2022 Mar;52(3):1642-1660. doi: 10.1109/TCYB.2020.2982445. Epub 2022 Mar 11.
7
A Feature and Algorithm Selection Method for Improving the Prediction of Protein Structural Class.一种用于改进蛋白质结构类预测的特征与算法选择方法
Comb Chem High Throughput Screen. 2017;20(7):612-621. doi: 10.2174/1386207320666170314103147.
8
Recognition of protein allosteric states and residues: Machine learning approaches.蛋白质变构态和残基的识别:机器学习方法。
J Comput Chem. 2018 Jul 30;39(20):1481-1490. doi: 10.1002/jcc.25218. Epub 2018 Mar 31.
9
A universal deep learning approach for modeling the flow of patients under different severities.一种通用的深度学习方法,用于对不同严重程度的患者进行建模。
Comput Methods Programs Biomed. 2018 Feb;154:191-203. doi: 10.1016/j.cmpb.2017.11.003. Epub 2017 Nov 7.
10
Comparison of radiomics machine-learning classifiers and feature selection for differentiation of sacral chordoma and sacral giant cell tumour based on 3D computed tomography features.基于 3D 计算机断层扫描特征的放射组学机器学习分类器和特征选择在区分骶骨脊索瘤和骶骨巨细胞瘤中的比较。
Eur Radiol. 2019 Apr;29(4):1841-1847. doi: 10.1007/s00330-018-5730-6. Epub 2018 Oct 2.

引用本文的文献

1
Application of Anomaly Detection to Identify Important Features of Protein Dynamics.异常检测在识别蛋白质动力学重要特征中的应用。
ACS Omega. 2025 May 29;10(22):22789-22801. doi: 10.1021/acsomega.4c11546. eCollection 2025 Jun 10.

本文引用的文献

1
t-Distributed Stochastic Neighbor Embedding Method with the Least Information Loss for Macromolecular Simulations.用于大分子模拟的信息损失最小的 t 分布随机邻居嵌入方法。
J Chem Theory Comput. 2018 Nov 13;14(11):5499-5510. doi: 10.1021/acs.jctc.8b00652. Epub 2018 Oct 9.
2
Recognition of protein allosteric states and residues: Machine learning approaches.蛋白质变构态和残基的识别:机器学习方法。
J Comput Chem. 2018 Jul 30;39(20):1481-1490. doi: 10.1002/jcc.25218. Epub 2018 Mar 31.
3
2D-IR Spectroscopy of an AHA Labeled Photoswitchable PDZ2 Domain.一种AHA标记的光开关PDZ2结构域的二维红外光谱
J Phys Chem A. 2017 Dec 14;121(49):9435-9445. doi: 10.1021/acs.jpca.7b09675. Epub 2017 Dec 4.
4
Structure-based prediction of protein allostery.基于结构的蛋白质变构预测。
Curr Opin Struct Biol. 2018 Jun;50:1-8. doi: 10.1016/j.sbi.2017.10.002. Epub 2017 Nov 5.
5
Combining protein sequence, structure, and dynamics: A novel approach for functional evolution analysis of PAS domain superfamily.结合蛋白质序列、结构和动力学:PAS 结构域超家族功能进化分析的新方法。
Protein Sci. 2018 Feb;27(2):421-430. doi: 10.1002/pro.3329. Epub 2017 Nov 2.
6
Grouped Gene Selection of Cancer via Adaptive Sparse Group Lasso Based on Conditional Mutual Information.基于条件互信息的自适应稀疏组套索的癌症基因分组选择。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Nov-Dec;15(6):2028-2038. doi: 10.1109/TCBB.2017.2761871. Epub 2017 Oct 11.
7
Allosteric Communication Networks in Proteins Revealed through Pocket Crosstalk Analysis.通过口袋串扰分析揭示蛋白质中的变构通信网络
ACS Cent Sci. 2017 Sep 27;3(9):949-960. doi: 10.1021/acscentsci.7b00211. Epub 2017 Aug 10.
8
The role of protein dynamics in the evolution of new enzyme function.蛋白质动力学在新酶功能进化中的作用。
Nat Chem Biol. 2016 Nov;12(11):944-950. doi: 10.1038/nchembio.2175. Epub 2016 Sep 12.
9
Probing Allosteric Inhibition Mechanisms of the Hsp70 Chaperone Proteins Using Molecular Dynamics Simulations and Analysis of the Residue Interaction Networks.利用分子动力学模拟和残基相互作用网络分析探究热休克蛋白70伴侣蛋白的变构抑制机制
J Chem Inf Model. 2016 Aug 22;56(8):1490-517. doi: 10.1021/acs.jcim.5b00755. Epub 2016 Aug 1.
10
Rigid Residue Scan Simulations Systematically Reveal Residue Entropic Roles in Protein Allostery.刚性残基扫描模拟系统地揭示了蛋白质变构中残基的熵作用。
PLoS Comput Biol. 2016 Apr 26;12(4):e1004893. doi: 10.1371/journal.pcbi.1004893. eCollection 2016 Apr.

稀疏分组选择和功能相关残基分析在蛋白质状态识别中的应用。

Sparse group selection and analysis of function-related residue for protein-state recognition.

机构信息

Department of Management Science and Engineering, Tongji University, Shanghai, China.

Department of Industrial, Manufacturing and Systems Engineering, University of Texas at Arlington, Arlington, Texas, USA.

出版信息

J Comput Chem. 2022 Jul 30;43(20):1342-1354. doi: 10.1002/jcc.26937. Epub 2022 Jun 3.

DOI:10.1002/jcc.26937
PMID:35656889
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9248267/
Abstract

Machine learning methods have helped to advance wide range of scientific and technological field in recent years, including computational chemistry. As the chemical systems could become complex with high dimension, feature selection could be critical but challenging to develop reliable machine learning based prediction models, especially for proteins as bio-macromolecules. In this study, we applied sparse group lasso (SGL) method as a general feature selection method to develop classification model for an allosteric protein in different functional states. This results into a much improved model with comparable accuracy (Acc) and only 28 selected features comparing to 289 selected features from a previous study. The Acc achieves 91.50% with 1936 selected feature, which is far higher than that of baseline methods. In addition, grouping protein amino acids into secondary structures provides additional interpretability of the selected features. The selected features are verified as associated with key allosteric residues through comparison with both experimental and computational works about the model protein, and demonstrate the effectiveness and necessity of applying rigorous feature selection and evaluation methods on complex chemical systems.

摘要

近年来,机器学习方法在包括计算化学在内的广泛科学技术领域取得了进展。由于化学系统可能变得复杂,维度高,特征选择对于开发可靠的基于机器学习的预测模型至关重要,但具有挑战性,特别是对于生物大分子蛋白质而言。在这项研究中,我们应用稀疏组套索(SGL)方法作为一般特征选择方法,为不同功能状态的别构蛋白开发分类模型。与之前的研究中从 289 个特征中选择相比,这得到了一个改进很多的模型,准确性(Acc)相当,只有 28 个特征。Acc 达到 91.50%,选择了 1936 个特征,远高于基线方法。此外,将蛋白质氨基酸分组为二级结构为所选特征提供了额外的可解释性。通过与模型蛋白质的实验和计算工作进行比较,对所选特征进行了验证,这些特征与关键别构残基相关,并证明了在复杂化学系统中应用严格的特征选择和评估方法的有效性和必要性。