Suppr超能文献

利用数据挖掘和遗传算法进行癌症基因搜索。

Cancer gene search with data-mining and genetic algorithms.

作者信息

Shah Shital, Kusiak Andrew

机构信息

Intelligent Systems Laboratory, MIE, 2139 Seamans Center, The University of Iowa, Iowa City, IA 52242-1527, USA.

出版信息

Comput Biol Med. 2007 Feb;37(2):251-61. doi: 10.1016/j.compbiomed.2006.01.007. Epub 2006 Apr 17.

Abstract

Cancer leads to approximately 25% of all mortalities, making it the second leading cause of death in the United States. Early and accurate detection of cancer is critical to the well being of patients. Analysis of gene expression data leads to cancer identification and classification, which will facilitate proper treatment selection and drug development. Gene expression data sets for ovarian, prostate, and lung cancer were analyzed in this research. An integrated gene-search algorithm for genetic expression data analysis was proposed. This integrated algorithm involves a genetic algorithm and correlation-based heuristics for data preprocessing (on partitioned data sets) and data mining (decision tree and support vector machines algorithms) for making predictions. Knowledge derived by the proposed algorithm has high classification accuracy with the ability to identify the most significant genes. Bagging and stacking algorithms were applied to further enhance the classification accuracy. The results were compared with that reported in the literature. Mapping of genotype information to the phenotype parameters will ultimately reduce the cost and complexity of cancer detection and classification.

摘要

癌症导致了约25%的死亡,使其成为美国第二大死因。癌症的早期准确检测对患者的健康至关重要。基因表达数据分析有助于癌症的识别和分类,这将有助于选择合适的治疗方法和药物开发。本研究分析了卵巢癌、前列腺癌和肺癌的基因表达数据集。提出了一种用于基因表达数据分析的集成基因搜索算法。这种集成算法包括用于数据预处理(对划分后的数据集)的遗传算法和基于相关性的启发式算法,以及用于进行预测的数据挖掘(决策树和支持向量机算法)。所提出的算法得出的知识具有很高的分类准确率,能够识别出最重要的基因。应用装袋法和堆叠法算法进一步提高分类准确率。将结果与文献报道的结果进行了比较。将基因型信息映射到表型参数最终将降低癌症检测和分类的成本和复杂性。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验