Skrabanek L, Campagne F
Institute for Computational Biomedicine and Department of Physiology and Biophysics, Mount Sinai School of Medicine, Box 1218, 1 Gustave L. Levy Place, New York, NY 10029, USA.
Nucleic Acids Res. 2001 Nov 1;29(21):E102-2. doi: 10.1093/nar/29.21.e102.
We describe TissueInfo, a knowledge-based method for the high-throughput identification of tissue expression profiles and tissue specificity. TissueInfo defines a set of tissue information calculations that can be computed for large numbers of genes, expressed sequence tags (ESTs) or proteins. Tissue information records that result from the TissueInfo calculations are used to generate tables suitable for data mining and for the selection of genes according to a given expression profile or specificity. When benchmarked against a test set of 116 proteins and literature information, TissueInfo was found to be accurate for 69% of identified tissue specificities and for 80% of expression profiles. The accuracy of the identifications can be increased if query sequences for which little information is available from dbEST are ignored. Thus, with 80% coverage, TissueInfo achieves an accuracy of 76% for specificity and 89% for expression. For the same set of proteins, the curated tissue specificity offered in SWISS-PROT was accurate in 78% of cases. TissueInfo can be useful for the selection of clones for custom microarrays, selection of training sets for ab initio identification of tissue information, gene discovery and genome-wide predictions. Further information about the program can be found at http://icb.mssm.edu/tissueinfo.
我们描述了TissueInfo,一种基于知识的用于高通量识别组织表达谱和组织特异性的方法。TissueInfo定义了一组可针对大量基因、表达序列标签(EST)或蛋白质进行计算的组织信息计算方法。由TissueInfo计算得出的组织信息记录用于生成适合数据挖掘以及根据给定表达谱或特异性选择基因的表格。当与116种蛋白质的测试集和文献信息进行基准测试时,发现TissueInfo对69%已识别的组织特异性和80%的表达谱是准确的。如果忽略那些在dbEST中可获取信息很少的查询序列,识别的准确性可以提高。因此,在覆盖率为80%的情况下,TissueInfo对于特异性的准确率达到76%,对于表达的准确率达到89%。对于同一组蛋白质,SWISS-PROT中整理的组织特异性在78%的情况下是准确的。TissueInfo可用于为定制微阵列选择克隆、为从头识别组织信息选择训练集、基因发现和全基因组预测。有关该程序的更多信息可在http://icb.mssm.edu/tissueinfo上找到。