Ravindran U, Gunavathi C
School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, India.
School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, India.
Prog Biophys Mol Biol. 2023 Jan;177:1-13. doi: 10.1016/j.pbiomolbio.2022.08.004. Epub 2022 Aug 19.
Gene Expression Data is the biological data to extract meaningful hidden information from the gene dataset. This gene information is used for disease diagnosis especially in cancer treatment based on the variations in gene expression levels. DNA microarray is an efficient method for gene expression classification and prediction of cancer disease for specific types of cancer. Due to the abundance of computing power, deep learning (DL) has become a widespread technique in the healthcare sector. The gene expression dataset has a limited number of samples but a large number of features. Data augmentation is needed for gene expression datasets to overcome the dimensionality problem in gene data. It is a technique to generating the synthetic samples to increase the diversity of data. Deep learning methods are designed to learn and extract the features that come from the raw input data in the form of multidimensional arrays. This paper reviews the existing research in deep learning techniques like Feed Forward Neural Network (FFN), Convolutional Neural Network (CNN), Autoencoder (AE) and Recurrent Neural Network (RNN) for the classification and prediction of cancer disease and its types through gene expression data analysis.
基因表达数据是从基因数据集中提取有意义的隐藏信息的生物学数据。这种基因信息尤其用于基于基因表达水平变化的疾病诊断,特别是在癌症治疗中。DNA微阵列是一种用于特定类型癌症的基因表达分类和癌症疾病预测的有效方法。由于计算能力的丰富,深度学习(DL)已成为医疗保健领域广泛应用的技术。基因表达数据集样本数量有限但特征数量众多。基因表达数据集需要进行数据增强以克服基因数据中的维度问题。它是一种生成合成样本以增加数据多样性的技术。深度学习方法旨在学习和提取以多维数组形式来自原始输入数据的特征。本文综述了现有关于深度学习技术的研究,如前馈神经网络(FFN)、卷积神经网络(CNN)、自动编码器(AE)和循环神经网络(RNN),用于通过基因表达数据分析对癌症疾病及其类型进行分类和预测。