自动隐马尔可夫模型-语言模型框架：基于特征选择的方法，通过自动编码器和隐马尔可夫模型预测药物反应

Auto-HMM-LMF: feature selection based method for prediction of drug response via autoencoder and hidden Markov model.

作者信息

Emdadi Akram, Eslahchi Changiz

机构信息

Department of Computer and Data Sciences, Faculty of Mathematical Sciences, Shahid Beheshti University, Tehran, Iran.

School of Biological Sciences, Institute for Research in Fundamental Sciences (IPM), 193955746, Tehran, Iran.

出版信息

BMC Bioinformatics. 2021 Jan 28;22(1):33. doi: 10.1186/s12859-021-03974-3.

DOI:10.1186/s12859-021-03974-3

PMID:33509079

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7844991/

Abstract

BACKGROUND

Predicting the response of cancer cell lines to specific drugs is an essential problem in personalized medicine. Since drug response is closely associated with genomic information in cancer cells, some large panels of several hundred human cancer cell lines are organized with genomic and pharmacogenomic data. Although several methods have been developed to predict the drug response, there are many challenges in achieving accurate predictions. This study proposes a novel feature selection-based method, named Auto-HMM-LMF, to predict cell line-drug associations accurately. Because of the vast dimensions of the feature space for predicting the drug response, Auto-HMM-LMF focuses on the feature selection issue for exploiting a subset of inputs with a significant contribution.

RESULTS

This research introduces a novel method for feature selection of mutation data based on signature assignments and hidden Markov models. Also, we use the autoencoder models for feature selection of gene expression and copy number variation data. After selecting features, the logistic matrix factorization model is applied to predict drug response values. Besides, by comparing to one of the most powerful feature selection methods, the ensemble feature selection method (EFS), we showed that the performance of the predictive model based on selected features introduced in this paper is much better for drug response prediction. Two datasets, the Genomics of Drug Sensitivity in Cancer (GDSC) and Cancer Cell Line Encyclopedia (CCLE) are used to indicate the efficiency of the proposed method across unseen patient cell-line. Evaluation of the proposed model showed that Auto-HMM-LMF could improve the accuracy of the results of the state-of-the-art algorithms, and it can find useful features for the logistic matrix factorization method.

CONCLUSIONS

We depicted an application of Auto-HMM-LMF in exploring the new candidate drugs for head and neck cancer that showed the proposed method is useful in drug repositioning and personalized medicine. The source code of Auto-HMM-LMF method is available in https://github.com/emdadi/Auto-HMM-LMF .

摘要

背景

预测癌细胞系对特定药物的反应是个性化医疗中的一个关键问题。由于药物反应与癌细胞中的基因组信息密切相关，因此组织了一些包含数百个人类癌细胞系的大型数据集，并配备了基因组和药物基因组数据。尽管已经开发了多种方法来预测药物反应，但在实现准确预测方面仍存在许多挑战。本研究提出了一种基于特征选择的新方法，名为Auto-HMM-LMF，以准确预测细胞系与药物的关联。由于预测药物反应的特征空间维度巨大，Auto-HMM-LMF专注于特征选择问题，以利用具有显著贡献的输入子集。

结果

本研究引入了一种基于特征分配和隐马尔可夫模型的突变数据特征选择新方法。此外，我们使用自动编码器模型对基因表达和拷贝数变异数据进行特征选择。在选择特征后，应用逻辑矩阵分解模型来预测药物反应值。此外，通过与最强大的特征选择方法之一——集成特征选择方法（EFS）进行比较，我们表明本文中基于所选特征的预测模型在药物反应预测方面的性能要好得多。使用两个数据集，即癌症药物敏感性基因组学（GDSC）和癌细胞系百科全书（CCLE），来表明所提出方法在未见过的患者细胞系中的效率。对所提出模型的评估表明，Auto-HMM-LMF可以提高现有算法结果的准确性，并且它可以为逻辑矩阵分解方法找到有用的特征。

结论

我们描述了Auto-HMM-LMF在探索头颈部癌新候选药物中的应用，这表明所提出的方法在药物重新定位和个性化医疗中是有用的。Auto-HMM-LMF方法的源代码可在https://github.com/emdadi/Auto-HMM-LMF获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0712/7844991/24b3f97484ab/12859_2021_3974_Fig1_HTML.jpg

相似文献

Auto-HMM-LMF: feature selection based method for prediction of drug response via autoencoder and hidden Markov model.自动隐马尔可夫模型-语言模型框架：基于特征选择的方法，通过自动编码器和隐马尔可夫模型预测药物反应

BMC Bioinformatics. 2021 Jan 28;22(1):33. doi: 10.1186/s12859-021-03974-3.

Improved anticancer drug response prediction in cell lines using matrix factorization with similarity regularization.使用具有相似性正则化的矩阵分解改进细胞系中抗癌药物反应预测。

BMC Cancer. 2017 Aug 2;17(1):513. doi: 10.1186/s12885-017-3500-5.

Clinical drug response prediction from preclinical cancer cell lines by logistic matrix factorization approach.基于逻辑矩阵分解方法的临床药物反应预测：来自临床前癌细胞系的研究。

J Bioinform Comput Biol. 2022 Apr;20(2):2150035. doi: 10.1142/S0219720021500359. Epub 2021 Dec 17.

DSPLMF: A Method for Cancer Drug Sensitivity Prediction Using a Novel Regularization Approach in Logistic Matrix Factorization.DSPLMF：一种在逻辑矩阵分解中使用新型正则化方法进行癌症药物敏感性预测的方法。

Front Genet. 2020 Feb 27;11:75. doi: 10.3389/fgene.2020.00075. eCollection 2020.

A novel feature selection method for microarray data classification based on hidden Markov model.基于隐马尔可夫模型的微阵列数据分类新特征选择方法。

J Biomed Inform. 2019 Jul;95:103213. doi: 10.1016/j.jbi.2019.103213. Epub 2019 May 23.

Super.FELT: supervised feature extraction learning using triplet loss for drug response prediction with multi-omics data.Super.FELT：基于三重损失的监督特征提取学习在多组学数据药物反应预测中的应用。

BMC Bioinformatics. 2021 May 25;22(1):269. doi: 10.1186/s12859-021-04146-z.

Autoencoder Based Feature Selection Method for Classification of Anticancer Drug Response.基于自动编码器的抗癌药物反应分类特征选择方法

Front Genet. 2019 Mar 27;10:233. doi: 10.3389/fgene.2019.00233. eCollection 2019.

The prediction of drug sensitivity by multi-omics fusion reveals the heterogeneity of drug response in pan-cancer.多组学融合预测药物敏感性揭示了泛癌中药物反应的异质性。

Comput Biol Med. 2023 Sep;163:107220. doi: 10.1016/j.compbiomed.2023.107220. Epub 2023 Jul 1.

Predicting cancer drug response using parallel heterogeneous graph convolutional networks with neighborhood interactions.使用具有邻域交互的并行异构图卷积网络预测癌症药物反应。

Bioinformatics. 2022 Sep 30;38(19):4546-4553. doi: 10.1093/bioinformatics/btac574.

HMMPred: Accurate Prediction of DNA-Binding Proteins Based on HMM Profiles and XGBoost Feature Selection.HMMPred：基于 HMM 轮廓和 XGBoost 特征选择的 DNA 结合蛋白精确预测。

Comput Math Methods Med. 2020 Mar 28;2020:1384749. doi: 10.1155/2020/1384749. eCollection 2020.

引用本文的文献

Optimized models and deep learning methods for drug response prediction in cancer treatments: a review.癌症治疗中药物反应预测的优化模型和深度学习方法：综述

PeerJ Comput Sci. 2024 Mar 25;10:e1903. doi: 10.7717/peerj-cs.1903. eCollection 2024.

Text-mining-based feature selection for anticancer drug response prediction.基于文本挖掘的特征选择用于抗癌药物反应预测。

Bioinform Adv. 2024 Mar 26;4(1):vbae047. doi: 10.1093/bioadv/vbae047. eCollection 2024.

Human Activity Recognition with an HMM-Based Generative Model.基于 HMM 的生成模型的人体活动识别。

Sensors (Basel). 2023 Jan 26;23(3):1390. doi: 10.3390/s23031390.

NeRD: a multichannel neural network to predict cellular response of drugs by integrating multidimensional data.NeRD：一种多通道神经网络，通过整合多维数据来预测药物对细胞的反应。

BMC Med. 2022 Oct 17;20(1):368. doi: 10.1186/s12916-022-02549-0.

Learning a confidence score and the latent space of a new supervised autoencoder for diagnosis and prognosis in clinical metabolomic studies.学习新的监督自编码器的置信分数和潜在空间，用于临床代谢组学研究中的诊断和预后。

BMC Bioinformatics. 2022 Sep 1;23(1):361. doi: 10.1186/s12859-022-04900-x.

Dissecting the Genome for Drug Response Prediction.解析基因组以预测药物反应。

Methods Mol Biol. 2022;2449:187-196. doi: 10.1007/978-1-0716-2095-3_7.

Secondary Pulmonary Tuberculosis Identification Via pseudo-Zernike Moment and Deep Stacked Sparse Autoencoder.基于伪泽尼克矩和深度堆叠稀疏自编码器的继发性肺结核识别

J Grid Comput. 2022;20(1):1. doi: 10.1007/s10723-021-09596-6. Epub 2021 Dec 16.

An overview of machine learning methods for monotherapy drug response prediction.机器学习方法在单药药物反应预测中的应用概述。

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab408.

A computational method for drug sensitivity prediction of cancer cell lines based on various molecular information.基于多种分子信息的癌细胞系药物敏感性预测的计算方法。

PLoS One. 2021 Apr 29;16(4):e0250620. doi: 10.1371/journal.pone.0250620. eCollection 2021.

本文引用的文献

ADRML: anticancer drug response prediction using manifold learning.ADRML：基于流形学习的抗癌药物反应预测。

Sci Rep. 2020 Aug 28;10(1):14245. doi: 10.1038/s41598-020-71257-7.

Front Genet. 2020 Feb 27;11:75. doi: 10.3389/fgene.2020.00075. eCollection 2020.

RefDNN: a reference drug based neural network for more accurate prediction of anticancer drug resistance.RefDNN：一种基于参考药物的神经网络，可更准确地预测抗癌药物耐药性。

Sci Rep. 2020 Feb 5;10(1):1861. doi: 10.1038/s41598-020-58821-x.

Hidden Markov models lead to higher resolution maps of mutation signature activity in cancer.隐马尔可夫模型可提高癌症突变特征活动图谱的分辨率。

Genome Med. 2019 Jul 26;11(1):49. doi: 10.1186/s13073-019-0659-1.

Next-generation characterization of the Cancer Cell Line Encyclopedia.下一代癌症细胞系百科全书的特征描述。

Nature. 2019 May;569(7757):503-508. doi: 10.1038/s41586-019-1186-3. Epub 2019 May 8.

Autoencoder Based Feature Selection Method for Classification of Anticancer Drug Response.基于自动编码器的抗癌药物反应分类特征选择方法

Front Genet. 2019 Mar 27;10:233. doi: 10.3389/fgene.2019.00233. eCollection 2019.

A novel algorithm for parameter estimation of Hidden Markov Model inspired by Ant Colony Optimization.一种受蚁群优化启发的隐马尔可夫模型参数估计算法。

Heliyon. 2019 Mar 8;5(3):e01299. doi: 10.1016/j.heliyon.2019.e01299. eCollection 2019 Mar.

The International Cancer Genome Consortium Data Portal.国际癌症基因组联盟数据门户

Nat Biotechnol. 2019 Apr;37(4):367-369. doi: 10.1038/s41587-019-0055-9.

Anti-cancer Drug Response Prediction Using Neighbor-Based Collaborative Filtering with Global Effect Removal.使用基于邻居的协同过滤并去除全局效应进行抗癌药物反应预测。

Mol Ther Nucleic Acids. 2018 Dec 7;13:303-311. doi: 10.1016/j.omtn.2018.09.011. Epub 2018 Sep 22.

Predicting Cancer Drug Response using a Recommender System.利用推荐系统预测癌症药物反应。

Bioinformatics. 2018 Nov 15;34(22):3907-3914. doi: 10.1093/bioinformatics/bty452.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

自动隐马尔可夫模型-语言模型框架：基于特征选择的方法，通过自动编码器和隐马尔可夫模型预测药物反应

Auto-HMM-LMF: feature selection based method for prediction of drug response via autoencoder and hidden Markov model.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献