• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质突变指标:用于溶质载体家族6突变致病性预测的机器学习方法

ProteoMutaMetrics: machine learning approaches for solute carrier family 6 mutation pathogenicity prediction.

作者信息

Huang Jiahui, Osthushenrich Tanja, MacNamara Aidan, Mälarstig Anders, Brocchetti Silvia, Bradberry Samuel, Scarabottolo Lia, Ferrada Evandro, Sosnin Sergey, Digles Daniela, Superti-Furga Giulio, Ecker Gerhard F

机构信息

University of Vienna, Department of Pharmaceutical Sciences Vienna Austria

Bayer AG, Division Pharmaceuticals, Biomedical Data Science II Wuppertal Germany.

出版信息

RSC Adv. 2024 Apr 22;14(19):13083-13094. doi: 10.1039/d4ra00748d.

DOI:10.1039/d4ra00748d
PMID:38655474
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11034476/
Abstract

The solute carrier transporter family 6 (SLC6) is of key interest for their critical role in the transport of small amino acids or amino acid-like molecules. Their dysfunction is strongly associated with human diseases such as including schizophrenia, depression, and Parkinson's disease. Linking single point mutations to disease may support insights into the structure-function relationship of these transporters. This work aimed to develop a computational model for predicting the potential pathogenic effect of single point mutations in the SLC6 family. Missense mutation data was retrieved from UniProt, LitVar, and ClinVar, covering multiple protein-coding transcripts. As encoding approach, amino acid descriptors were used to calculate the average sequence properties for both original and mutated sequences. In addition to the full-sequence calculation, the sequences were cut into twelve domains. The domains are defined according to the transmembrane domains of the SLC6 transporters to analyse the regions' contributions to the pathogenicity prediction. Subsequently, several classification models, namely Support Vector Machine (SVM), Logistic Regression (LR), Random Forest (RF), and Extreme Gradient Boosting (XGBoost) with the hyperparameters optimized through grid search were built. For estimation of model performance, repeated stratified k-fold cross-validation was used. The accuracy values of the generated models are in the range of 0.72 to 0.80. Analysis of feature importance indicates that mutations in distinct regions of SLC6 transporters are associated with an increased risk for pathogenicity. When applying the model on an independent validation set, the performance in accuracy dropped to averagely 0.6 with high precision but low sensitivity scores.

摘要

溶质载体转运蛋白家族6(SLC6)因其在小氨基酸或类氨基酸分子转运中的关键作用而备受关注。它们的功能障碍与精神分裂症、抑郁症和帕金森病等人类疾病密切相关。将单点突变与疾病联系起来可能有助于深入了解这些转运蛋白的结构-功能关系。这项工作旨在开发一种计算模型,用于预测SLC6家族单点突变的潜在致病效应。错义突变数据从UniProt、LitVar和ClinVar中检索,涵盖多个蛋白质编码转录本。作为编码方法,使用氨基酸描述符来计算原始序列和突变序列的平均序列特性。除了全序列计算外,还将序列切割成十二个结构域。这些结构域根据SLC6转运蛋白的跨膜结构域定义,以分析各区域对致病性预测的贡献。随后,构建了几种分类模型,即支持向量机(SVM)、逻辑回归(LR)、随机森林(RF)和通过网格搜索优化超参数的极端梯度提升(XGBoost)。为了评估模型性能,使用了重复分层k折交叉验证。生成模型的准确率值在0.72至0.80范围内。特征重要性分析表明,SLC6转运蛋白不同区域的突变与致病性风险增加有关。当将该模型应用于独立验证集时,准确率性能平均降至0.6,精度高但灵敏度得分低。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0d3/11034476/d94be5f4211b/d4ra00748d-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0d3/11034476/334f3b12790e/d4ra00748d-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0d3/11034476/2570b0b5609f/d4ra00748d-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0d3/11034476/0368d0bae831/d4ra00748d-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0d3/11034476/e8127175db31/d4ra00748d-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0d3/11034476/3f474bd7200f/d4ra00748d-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0d3/11034476/96e8c61f14d5/d4ra00748d-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0d3/11034476/d94be5f4211b/d4ra00748d-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0d3/11034476/334f3b12790e/d4ra00748d-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0d3/11034476/2570b0b5609f/d4ra00748d-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0d3/11034476/0368d0bae831/d4ra00748d-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0d3/11034476/e8127175db31/d4ra00748d-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0d3/11034476/3f474bd7200f/d4ra00748d-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0d3/11034476/96e8c61f14d5/d4ra00748d-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0d3/11034476/d94be5f4211b/d4ra00748d-f7.jpg

相似文献

1
ProteoMutaMetrics: machine learning approaches for solute carrier family 6 mutation pathogenicity prediction.蛋白质突变指标:用于溶质载体家族6突变致病性预测的机器学习方法
RSC Adv. 2024 Apr 22;14(19):13083-13094. doi: 10.1039/d4ra00748d.
2
Applications of Machine Learning to Diagnosis of Parkinson's Disease.机器学习在帕金森病诊断中的应用。
Brain Sci. 2023 Nov 3;13(11):1546. doi: 10.3390/brainsci13111546.
3
Use of Multiprognostic Index Domain Scores, Clinical Data, and Machine Learning to Improve 12-Month Mortality Risk Prediction in Older Hospitalized Patients: Prospective Cohort Study.使用多预后指标领域评分、临床数据和机器学习提高老年住院患者 12 个月死亡率风险预测:前瞻性队列研究。
J Med Internet Res. 2021 Jun 21;23(6):e26139. doi: 10.2196/26139.
4
Predictive model and risk analysis for peripheral vascular disease in type 2 diabetes mellitus patients using machine learning and shapley additive explanation.基于机器学习和 Shapley 加法解释的 2 型糖尿病患者外周血管疾病预测模型和风险分析。
Front Endocrinol (Lausanne). 2024 Feb 28;15:1320335. doi: 10.3389/fendo.2024.1320335. eCollection 2024.
5
A Risk Prediction Model for Physical Restraints Among Older Chinese Adults in Long-term Care Facilities: Machine Learning Study.长期护理机构中老年人身体约束的风险预测模型:机器学习研究。
J Med Internet Res. 2023 Apr 6;25:e43815. doi: 10.2196/43815.
6
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
7
Application of machine learning model in predicting the likelihood of blood transfusion after hip fracture surgery.机器学习模型在预测髋部骨折手术后输血可能性中的应用。
Aging Clin Exp Res. 2023 Nov;35(11):2643-2656. doi: 10.1007/s40520-023-02550-4. Epub 2023 Sep 21.
8
Deep Learning and Machine Learning with Grid Search to Predict Later Occurrence of Breast Cancer Metastasis Using Clinical Data.利用临床数据,通过深度学习和带网格搜索的机器学习预测乳腺癌转移的后期发生情况。
J Clin Med. 2022 Sep 29;11(19):5772. doi: 10.3390/jcm11195772.
9
Machine learning-based prediction of hospital prolonged length of stay admission at emergency department: a Gradient Boosting algorithm analysis.基于机器学习的急诊科住院时间延长预测:梯度提升算法分析
Front Artif Intell. 2023 Jul 28;6:1179226. doi: 10.3389/frai.2023.1179226. eCollection 2023.
10
Establishment and validation of a heart failure risk prediction model for elderly patients after coronary rotational atherectomy based on machine learning.基于机器学习的老年患者冠状动脉旋磨术后心力衰竭风险预测模型的建立与验证
PeerJ. 2024 Jan 31;12:e16867. doi: 10.7717/peerj.16867. eCollection 2024.

引用本文的文献

1
AISMPred: A Machine Learning Approach for Predicting Anti-Inflammatory Small Molecules.AISMPred:一种预测抗炎小分子的机器学习方法。
Pharmaceuticals (Basel). 2024 Dec 15;17(12):1693. doi: 10.3390/ph17121693.
2
Dinucleotide composition representation -based deep learning to predict scoliosis-associated Fibrillin-1 genotypes.基于二核苷酸组成表示的深度学习预测脊柱侧弯相关的原纤蛋白-1基因型。
Front Genet. 2024 Oct 22;15:1492226. doi: 10.3389/fgene.2024.1492226. eCollection 2024.

本文引用的文献

1
Experimental and Computational Analysis of Newly Identified Pathogenic Mutations in the Creatine Transporter SLC6A8.新型肌酸转运蛋白 SLC6A8 致病突变的实验与计算分析。
J Mol Biol. 2024 Jan 15;436(2):168383. doi: 10.1016/j.jmb.2023.168383. Epub 2023 Dec 7.
2
Accurate proteome-wide missense variant effect prediction with AlphaMissense.使用 AlphaMissense 进行精确的全蛋白质错义变异效应预测。
Science. 2023 Sep 22;381(6664):eadg7492. doi: 10.1126/science.adg7492.
3
Ensembl 2023.Ensembl 2023.
Nucleic Acids Res. 2023 Jan 6;51(D1):D933-D941. doi: 10.1093/nar/gkac958.
4
Recent Advances in Machine Learning Variant Effect Prediction Tools for Protein Engineering.蛋白质工程机器学习变体效应预测工具的最新进展
Ind Eng Chem Res. 2022 May 18;61(19):6235-6245. doi: 10.1021/acs.iecr.1c04943. Epub 2022 Apr 6.
5
Accelerating SLC Transporter Research: Streamlining Knowledge and Validated Tools.加速溶质载体转运体研究:简化知识与经过验证的工具。
Clin Pharmacol Ther. 2022 Sep;112(3):439-442. doi: 10.1002/cpt.2639. Epub 2022 Aug 8.
6
Systematic in silico discovery of novel solute carrier-like proteins from proteomes.系统计算发现蛋白质组中的新型溶质载体样蛋白。
PLoS One. 2022 Jul 28;17(7):e0271062. doi: 10.1371/journal.pone.0271062. eCollection 2022.
7
Principles of Alternating Access in LeuT-fold Transporters: Commonalities and Divergences.LeuT 折叠转运蛋白的交替访问原则:共性与差异。
J Mol Biol. 2022 Oct 15;434(19):167746. doi: 10.1016/j.jmb.2022.167746. Epub 2022 Jul 16.
8
Crystal structures of LeuT reveal conformational dynamics in the outward-facing states.LeuT 的晶体结构揭示了向外开放状态下的构象动态。
J Biol Chem. 2021 Jan-Jun;296:100609. doi: 10.1016/j.jbc.2021.100609. Epub 2021 Apr 1.
9
Proteochemometrics - recent developments in bioactivity and selectivity modeling.药物化学计量学——生物活性和选择性建模的最新进展。
Drug Discov Today Technol. 2019 Dec;32-33:89-98. doi: 10.1016/j.ddtec.2020.08.003. Epub 2020 Sep 20.
10
UniProt: the universal protein knowledgebase in 2021.UniProt:2021 年的通用蛋白质知识库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D480-D489. doi: 10.1093/nar/gkaa1100.