Liang Feng, Matrubutham Udayakumar, Parvizi Babak, Yen Jessica, Duan Daniel, Mirchandani Jyotika, Hashima Sandra, Nguyen Uyen, Ubil Eric, Loewenheim Jake, Yu Xin, Sipes Sara, Williams Wendy, Wang Ling, Bennett Robert, Carrino John
Research and Development, Invitrogen Corporation, Carlsbad, CA 92008, USA.
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D595-9. doi: 10.1093/nar/gkh118.
The ORFDB (http://orf.invitrogen.com/) represents an ongoing effort at Invitrogen Corporation to integrate relevant scientific data with an evolving collection of human and mouse Open Reading Frame (ORF) clones (Ultimate ORF Clones). The ORFDB serves as a central data warehouse enabling researchers to search the ORF collection through its web portal ORFBrowser, allowing researchers to find the Ultimate ORF clones by blast, keyword, GenBank accession, gene symbol, clone ID, Unigene ID, LocusLink ID or through functional relationships by browsing the collection via the Gene Ontology (GO) Browser. As of October 2003, the ORFDB contains 6200 human and 2870 mouse Ultimate ORF clones. All Ultimate ORF clones have been fully sequenced with high quality, and are matched to public reference protein sequences. In addition, the cloned ORFs have been extensively annotated across six categories: Gene, ORF, Clone Format, Protein, SNP and Genomic links, with the information assembled in a format termed the ORFCard. The ORFCard represents an information repository that documents the sequence quality, alignment with respect to public protein sequences, and the latest publicly available information associated with each human and mouse gene represented in the collection.
ORF数据库(http://orf.invitrogen.com/)是英杰公司正在进行的一项工作,旨在将相关科学数据与不断发展的人类和小鼠开放阅读框(ORF)克隆(终极ORF克隆)集合整合起来。ORF数据库作为一个中央数据仓库,使研究人员能够通过其门户网站ORFBrowser搜索ORF克隆集合,从而让研究人员能够通过BLAST、关键词、GenBank登录号、基因符号、克隆ID、Unigene ID、LocusLink ID或通过基因本体(GO)浏览器浏览集合来查找终极ORF克隆,进而根据功能关系找到它们。截至2003年10月,ORF数据库包含6200个人类和2870个小鼠终极ORF克隆。所有终极ORF克隆均已高质量地完成全序列测定,并与公共参考蛋白质序列进行了匹配。此外,克隆的ORF已在六个类别中进行了广泛注释:基因、ORF、克隆格式、蛋白质、单核苷酸多态性(SNP)和基因组链接,相关信息以一种称为ORF卡的格式汇编而成。ORF卡代表了一个信息库,记录了序列质量、与公共蛋白质序列的比对情况以及与该集合中所代表的每个人类和小鼠基因相关的最新公开可用信息。