Suppr超能文献

整合数据增强和基于BERT的深度学习以预测源自黑升麻的α-葡萄糖苷酶抑制剂。

Integrating data augmentation and BERT-based deep learning for predicting alpha-glucosidase inhibitors derived from Black Cohosh.

作者信息

Torabi Mohammadreza, Mojtabavi Somayeh, Mahdavi Mohammad, Negahdari Babak, Faramarzi Mohammad Ali, Mazloumi Mohammadali, Ghasemi Fahimeh

机构信息

Department of Medical Biotechnology, School of Advanced Technologies in Medicine, Tehran University of Medical Sciences, Tehran, Iran.

Department of Pharmaceutical Biotechnology, Faculty of Pharmacy & Biotechnology Research Center, Tehran University of Medical Sciences, Tehran, P.O. Box 14155-6451, 1417614411, Iran.

出版信息

Sci Rep. 2025 Aug 27;15(1):31536. doi: 10.1038/s41598-025-14699-1.

Abstract

Diabetes remains one of the critical health issues worldwide, and its prevalence is gaining motion due to prevailing factors such as obesity and a sedentary lifestyle. Traditional herbal medications and natural products, particularly enzyme inhibitors, such as alpha-glucosidase, serve as promising alternatives. This study attempted to identify potent alpha-glucosidase inhibitors by including data augmentation in deep-learning modeling. To achieve the aim, various data augmentation techniques were generated from diverse SMILES strings and augmented deep learning model performances through improved data variability. Fine-tuning of pre-trained models from the Hugging Face repository was performed, and among all, it was shown that the performance of PC10M-450k was the best recall. Further applications consider the model identified as PC10M-450 K. With this model, it was identified actaeaepoxide 3-O-xyloside from Black Cohosh was a potential inhibitor. Further molecular docking and MD simulations presented this compound to interact stably with the enzyme and possess a high inhibition probability when compared to acarbose. The results of insilico drug discovery displayed that actaeaepoxide 3-O-xyloside is pointed out to be a potential candidate for diabetes therapy. In conclusion, the role of augmentation techniques and pre-trained models was also emphasized in the presented investigation to accelerate drug discovery toward more effective therapeutic solutions.

摘要

糖尿病仍然是全球关键的健康问题之一,由于肥胖和久坐不动的生活方式等普遍因素,其患病率正在上升。传统草药和天然产物,特别是酶抑制剂,如α-葡萄糖苷酶,是很有前景的替代物。本研究试图通过在深度学习建模中纳入数据增强来识别有效的α-葡萄糖苷酶抑制剂。为实现这一目标,从不同的SMILES字符串生成了各种数据增强技术,并通过改善数据变异性来增强深度学习模型的性能。对来自Hugging Face库的预训练模型进行了微调,其中,PC10M - 450k的性能显示出最佳召回率。进一步的应用考虑将该模型识别为PC10M - 450K。利用该模型,鉴定出黑升麻中的actaeaepoxide 3 - O -木糖苷是一种潜在抑制剂。进一步的分子对接和分子动力学模拟表明,与阿卡波糖相比,该化合物能与酶稳定相互作用,且具有较高的抑制概率。计算机辅助药物发现的结果表明,actaeaepoxide 3 - O -木糖苷有望成为糖尿病治疗的潜在候选药物。总之,本研究还强调了增强技术和预训练模型在加速药物发现以寻求更有效治疗方案方面的作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d739/12391535/65f345132c0a/41598_2025_14699_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验