Suppr超能文献

基于指纹、序列和临床症状的新型潜在小分子-微小RNA-癌症关联预测模型

Novel Potential Small Molecule-MiRNA-Cancer Associations Prediction Model Based on Fingerprint, Sequence, and Clinical Symptoms.

作者信息

Li Jinlong, Peng Dongdong, Xie Yun, Dai Zong, Zou Xiaoyong, Li Zhanchao

机构信息

School of Chemistry and Chemical Engineering, Guangdong Pharmaceutical University, Guangzhou 510006, People's Republic of China.

School of Biomedical Engineering, Sun Yat-Sen University, Guangzhou 510275, People's Republic of China.

出版信息

J Chem Inf Model. 2021 May 24;61(5):2208-2219. doi: 10.1021/acs.jcim.0c01458. Epub 2021 Apr 26.

Abstract

As an important biomarker in organisms, miRNA is closely related to various small molecules and diseases. Research on small molecule-miRNA-cancer associations is helpful for the development of cancer treatment drugs and the discovery of pathogenesis. It is very urgent to develop theoretical methods for identifying potential small molecular-miRNA-cancer associations, because experimental approaches are usually time-consuming, laborious, and expensive. To overcome this problem, we developed a new computational method, in which features derived from structure, sequence, and symptoms were utilized to characterize small molecule, miRNA, and cancer, respectively. A feature vector was construct to characterize small molecule-miRNA-cancer association by concatenating these features, and a random forest algorithm was utilized to construct a model for recognizing potential association. Based on the 5-fold cross-validation and benchmark data set, the model achieved an accuracy of 93.20 ± 0.52%, a precision of 93.22 ± 0.51%, a recall of 93.20 ± 0.53%, and an F1-measure of 93.20 ± 0.52%. The areas under the receiver operating characteristic curve and precision recall curve were 0.9873 and 0.9870. The real prediction ability and application performance of the developed method have also been further evaluated and verified through an independent data set test and case study. Some potential small molecules and miRNAs related to cancer have been identified and are worthy of further experimental research. It is anticipated that our model could be regarded as a useful high-throughput virtual screening tool for drug research and development. All source codes can be downloaded from https://github.com/LeeKamlong/Multi-class-SMMCA.

摘要

作为生物体中的一种重要生物标志物,微小RNA(miRNA)与各种小分子及疾病密切相关。对小分子 - miRNA - 癌症关联的研究有助于癌症治疗药物的开发和发病机制的发现。开发识别潜在小分子 - miRNA - 癌症关联的理论方法非常迫切,因为实验方法通常耗时、费力且昂贵。为克服这一问题,我们开发了一种新的计算方法,其中分别利用从结构、序列和症状衍生的特征来表征小分子、miRNA和癌症。通过连接这些特征构建一个特征向量来表征小分子 - miRNA - 癌症关联,并利用随机森林算法构建一个识别潜在关联的模型。基于5折交叉验证和基准数据集,该模型的准确率为93.20 ± 0.52%,精确率为93.22 ± 0.51%,召回率为93.20 ± 0.53%,F1值为93.20 ± 0.52%。受试者工作特征曲线和精确召回曲线下的面积分别为0.9873和0.9870。通过独立数据集测试和案例研究进一步评估和验证了所开发方法的实际预测能力和应用性能。已识别出一些与癌症相关的潜在小分子和miRNA,值得进一步进行实验研究。预计我们的模型可被视为药物研发中一种有用的高通量虚拟筛选工具。所有源代码可从https://github.com/LeeKamlong/Multi-class-SMMCA下载。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验