Department of Systems and Computer Science, Howard University, Washington, DC 20059, USA.
Comput Math Methods Med. 2012;2012:135780. doi: 10.1155/2012/135780. Epub 2012 Nov 11.
Identifying molecular biomarkers has become one of the important tasks for scientists to assess the different phenotypic states of cells or organisms correlated to the genotypes of diseases from large-scale biological data. In this paper, we proposed a text-mining-based method to discover biomarkers from PubMed. First, we construct a database based on a dictionary, and then we used a finite state machine to identify the biomarkers. Our method of text mining provides a highly reliable approach to discover the biomarkers in the PubMed database.
从大规模的生物数据中鉴定与疾病基因型相关的细胞或生物体的不同表型状态的分子生物标志物已成为科学家的重要任务之一。在本文中,我们提出了一种基于文本挖掘的方法,从 PubMed 中发现生物标志物。首先,我们基于字典构建了一个数据库,然后使用有限状态机来识别生物标志物。我们的文本挖掘方法为在 PubMed 数据库中发现生物标志物提供了一种高度可靠的方法。