Bahreini Fatemeh, Soltanian Ali Reza
Department of Molecular Medicine and Genetics, School of Medicine, Hamadan University of Medical Sciences, Hamadan, Iran.
Modeling of Noncommunicable Diseases Research Center, School of Public Health, Hamadan University of Medical Sciences, Hamadan, Iran.
Cell J. 2019 Jan;20(4):569-575. doi: 10.22074/cellj.2019.5688. Epub 2018 Aug 1.
We sought to apply Shannon's entropy to determine colorectal cancer genes in a microarray dataset.
In the retrospective study, 36 samples were analysed, 18 colorectal carcinoma and 18 paired normal tissue samples. After identification of the gene fold-changes, we used the entropy theory to identify an effective gene set. These genes were subsequently categorised into homogenous clusters.
We assessed 36 tissue samples. The entropy theory was used to select a set of 29 genes from 3128 genes that had fold-changes greater than one, which provided the most information on colorectal cancer. This study shows that all genes fall into a cluster, except for the R08183 gene.
This study has identified several genes associated with colon cancer using the entropy method, which were not detected by custom methods. Therefore, we suggest that the entropy theory should be used to identify genes associated with cancers in a microarray dataset.
我们试图应用香农熵来确定微阵列数据集中的结直肠癌基因。
在这项回顾性研究中,分析了36个样本,其中18个为结直肠癌组织样本,18个为配对的正常组织样本。在确定基因的倍数变化后,我们使用熵理论来确定一组有效的基因集。这些基因随后被分类为同质簇。
我们评估了36个组织样本。熵理论用于从3128个倍数变化大于1的基因中选择一组29个基因,这些基因提供了关于结直肠癌的最多信息。这项研究表明,除R08183基因外,所有基因都属于一个簇。
本研究使用熵方法鉴定了几个与结肠癌相关的基因,这些基因是传统方法未检测到的。因此,我们建议应使用熵理论来鉴定微阵列数据集中与癌症相关的基因。