Bajdik Chris D, Kuo Byron, Rusaw Shawn, Jones Steven, Brooks-Wilson Angela
Cancer Control Research Program, BC Cancer Agency, 600 West 10th Avenue, Vancouver BC, V5Z 4E6, Canada.
BMC Bioinformatics. 2005 Mar 29;6:78. doi: 10.1186/1471-2105-6-78.
Online Mendelian Inheritance in Man (OMIM) is a computerized database of information about genes and heritable traits in human populations, based on information reported in the scientific literature. Our objective was to establish an automated text-mining system for OMIM that will identify genetically-related cancers and cancer-related genes. We developed the computer program CGMIM to search for entries in OMIM that are related to one or more cancer types. We performed manual searches of OMIM to verify the program results.
In the OMIM database on September 30, 2004, CGMIM identified 1943 genes related to cancer. BRCA2 (OMIM *164757), BRAF (OMIM *164757) and CDKN2A (OMIM *600160) were each related to 14 types of cancer. There were 45 genes related to cancer of the esophagus, 121 genes related to cancer of the stomach, and 21 genes related to both. Analysis of CGMIM results indicate that fewer than three gene entries in OMIM should mention both, and the more than seven-fold discrepancy suggests cancers of the esophagus and stomach are more genetically related than current literature suggests.
CGMIM identifies genetically-related cancers and cancer-related genes. In several ways, cancers with shared genetic etiology are anticipated to lead to further etiologic hypotheses and advances regarding environmental agents. CGMIM results are posted monthly and the source code can be obtained free of charge from the BC Cancer Research Centre website http://www.bccrc.ca/ccr/CGMIM
《人类孟德尔遗传在线》(OMIM)是一个基于科学文献中所报道信息的关于人类群体中基因和可遗传性状的计算机化数据库。我们的目标是建立一个用于OMIM的自动文本挖掘系统,该系统将识别与基因相关的癌症和癌症相关基因。我们开发了计算机程序CGMIM来搜索OMIM中与一种或多种癌症类型相关的条目。我们对OMIM进行了人工搜索以验证程序结果。
在2004年9月30日的OMIM数据库中,CGMIM识别出1943个与癌症相关的基因。BRCA2(OMIM *164757)、BRAF(OMIM *164757)和CDKN2A(OMIM *600160)各自与14种癌症类型相关。有45个基因与食管癌相关,121个基因与胃癌相关,还有21个基因与两者都相关。对CGMIM结果的分析表明,OMIM中提及两者的基因条目应少于三个,而超过七倍的差异表明食管癌和胃癌在遗传上的相关性比当前文献所表明的更强。
CGMIM识别与基因相关的癌症和癌症相关基因。在几个方面,具有共同遗传病因的癌症有望引发更多关于环境因素的病因假说和进展。CGMIM的结果每月发布一次,其源代码可从卑诗癌症研究中心网站http://www.bccrc.ca/ccr/CGMIM免费获取。