Chiang Jung-Hsien, Shin Jyh-Wei, Liu Heng-Hui, Chin Chong-Liang
Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan.
BMC Bioinformatics. 2006 Aug 29;7:392. doi: 10.1186/1471-2105-7-392.
Abundant information about gene products is stored in online searchable databases such as annotation or literature. To efficiently obtain and digest such information, there is a pressing need for automated information-summarization and functional-similarity clustering of genes.
We have developed a novel method for semantic measurement of annotation and integrated it with a biomedical literature summarization system to establish a platform, GeneLibrarian, to provide users well-organized information about any specific group of genes (e.g. one cluster of genes from a microarray chip) they might be interested in. The GeneLibrarian generates a summarized viewgraph of candidate genes for a user based on his/her preference and delivers the desired background information effectively to the user. The summarization technique involves optimizing the text mining algorithm and Gene Ontology-based clustering method to enable the discovery of gene relations.
GeneLibrarian is a Java-based web application that automates the process of retrieving critical information from the literature and expanding the number of potential genes for further analysis. This study concentrates on providing well organized information to users and we believe that will be useful in their researches. GeneLibrarian is available on http://gen.csie.ncku.edu.tw/GeneLibrarian/.
关于基因产物的大量信息存储在诸如注释或文献等可在线搜索的数据库中。为了有效地获取和消化这些信息,迫切需要对基因进行自动信息汇总和功能相似性聚类。
我们开发了一种用于注释语义测量的新方法,并将其与生物医学文献汇总系统集成,以建立一个名为GeneLibrarian的平台,为用户提供有关他们可能感兴趣的任何特定基因组(例如来自微阵列芯片的一组基因)的条理清晰的信息。GeneLibrarian根据用户偏好为其生成候选基因的汇总视图,并有效地向用户提供所需的背景信息。汇总技术涉及优化文本挖掘算法和基于基因本体的聚类方法,以发现基因关系。
GeneLibrarian是一个基于Java的Web应用程序,它自动执行从文献中检索关键信息并扩展潜在基因数量以进行进一步分析的过程。本研究专注于为用户提供条理清晰的信息,我们相信这将对他们的研究有用。可通过http://gen.csie.ncku.edu.tw/GeneLibrarian/访问GeneLibrarian。