Romano Paolo, Manniello Assunta, Aresu Ottavia, Armento Massimiliano, Cesaro Michela, Parodi Barbara
Bioinformatics, Cell Bank, National Cancer Research Institute and IEIIT, National Research Council, Genova, Italy.
Nucleic Acids Res. 2009 Jan;37(Database issue):D925-32. doi: 10.1093/nar/gkn730. Epub 2008 Oct 15.
The Cell Line Data Base (CLDB) is a well-known reference information source on human and animal cell lines including information on more than 6000 cell lines. Main biological features are coded according to controlled vocabularies derived from international lists and taxonomies. HyperCLDB (http://bioinformatics.istge.it/hypercldb/) is a hypertext version of CLDB that improves data accessibility by also allowing information retrieval through web spiders. Access to HyperCLDB is provided through indexes of biological characteristics and navigation in the hypertext is granted by many internal links. HyperCLDB also includes links to external resources. Recently, an interest was raised for a reference nomenclature for cell lines and CLDB was seen as an authoritative system. Furthermore, to overcome the cell line misidentification problem, molecular authentication methods, such as fingerprinting, single-locus short tandem repeat (STR) profile and single nucleotide polymorphisms validation, were proposed. Since this data is distributed, a reference portal on authentication of human cell lines is needed. We present here the architecture and contents of CLDB, its recent enhancements and perspectives. We also present a new related database, the Cell Line Integrated Molecular Authentication (CLIMA) database (http://bioinformatics.istge.it/clima/), that allows to link authentication data to actual cell lines.
细胞系数据库(CLDB)是一个关于人类和动物细胞系的知名参考信息源,包含6000多个细胞系的信息。主要生物学特征根据源自国际列表和分类法的受控词汇表进行编码。HyperCLDB(http://bioinformatics.istge.it/hypercldb/)是CLDB的超文本版本,它通过允许通过网络蜘蛛进行信息检索来提高数据可访问性。通过生物学特征索引提供对HyperCLDB的访问,超文本中的导航由许多内部链接提供。HyperCLDB还包括到外部资源的链接。最近,人们对细胞系的参考命名法产生了兴趣,CLDB被视为一个权威系统。此外,为了克服细胞系错误识别问题,提出了分子鉴定方法,如指纹识别、单基因座短串联重复序列(STR)图谱和单核苷酸多态性验证。由于这些数据是分散的,因此需要一个关于人类细胞系鉴定的参考门户。我们在此介绍CLDB的架构和内容、其最近的增强功能和前景。我们还介绍了一个新的相关数据库,细胞系综合分子鉴定(CLIMA)数据库(http://bioinformatics.istge.it/clima/),它允许将鉴定数据与实际细胞系相链接。