• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RaPiDS:一种用于快速表达谱数据库搜索的算法。

RaPiDS: an algorithm for rapid expression profile database search.

作者信息

Horton Paul B, Kiseleva Larisa, Fujibuchi Wataru

机构信息

Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi, Koto-ku, Tokyo 135-0064, Japan.

出版信息

Genome Inform. 2006;17(2):67-76.

PMID:17503380
Abstract

In this paper we present a fast algorithm and implementation for computing the Spearman rank correlation (SRC) between a query expression profile and each expression profile in a database of profiles. The algorithm is linear in the size of the profile database with a very small constant factor. It is designed to efficiently handle multiple profile platforms and missing values. We show that our specialized algorithm and C++ implementation can achieve an approximately 100-fold speed-up over a reasonable baseline implementation using Perl hash tables. RaPiDS is designed for general similarity search rather than classification - but in order to attempt to classify the usefulness of SRC as a similarity measure we investigate the usefulness of this program as a classifier for classifying normal human cell types based on gene expression. Specifically we use the k nearest neighbor classifier with a t statistic derived from SRC as the similarity measure for profile pairs. We estimate the accuracy using a jackknife test on the microarray data with manually checked cell type annotation. Preliminary results suggest the measure is useful (64% accuracy on 1,685 profiles vs. the majority class classifier's 17.5%) for profiles measured under similar conditions (same laboratory and chip platform); but requires improvement when comparing profiles from different experimental series.

摘要

在本文中,我们提出了一种快速算法及实现方法,用于计算查询表达谱与谱数据库中每个表达谱之间的斯皮尔曼等级相关性(SRC)。该算法在谱数据库大小方面呈线性关系,且常数因子非常小。它旨在高效处理多个谱平台及缺失值。我们表明,相较于使用Perl哈希表的合理基线实现方法,我们的专门算法及C++实现能够实现约100倍的加速。RaPiDS专为一般相似性搜索而非分类设计——但为了尝试将SRC作为相似性度量的有用性进行分类,我们研究了该程序作为基于基因表达对正常人类细胞类型进行分类的分类器的有用性。具体而言,我们使用具有从SRC导出的t统计量的k近邻分类器作为谱对的相似性度量。我们使用留一法检验对带有手动检查的细胞类型注释的微阵列数据估计准确性。初步结果表明,对于在相似条件下(同一实验室和芯片平台)测量的谱,该度量是有用的(在1685个谱上准确率为64%,而多数类分类器的准确率为17.5%);但在比较来自不同实验系列的谱时需要改进。

相似文献

1
RaPiDS: an algorithm for rapid expression profile database search.RaPiDS:一种用于快速表达谱数据库搜索的算法。
Genome Inform. 2006;17(2):67-76.
2
GeneMCL in microarray analysis.微阵列分析中的基因MCL
Comput Biol Chem. 2005 Oct;29(5):354-9. doi: 10.1016/j.compbiolchem.2005.07.002. Epub 2005 Sep 19.
3
List of lists-annotated (LOLA): a database for annotation and comparison of published microarray gene lists.列表注释列表(LOLA):一个用于已发表微阵列基因列表注释和比较的数据库。
Gene. 2005 Oct 24;360(1):78-82. doi: 10.1016/j.gene.2005.07.008. Epub 2005 Sep 2.
4
LyM: a tool to reach the best factor in gene expression comparison.LyM:一种用于在基因表达比较中获得最佳因子的工具。
In Silico Biol. 2007;7(1):101-4.
5
Exploring the functional landscape of gene expression: directed search of large microarray compendia.探索基因表达的功能全景:对大型微阵列数据集的定向搜索。
Bioinformatics. 2007 Oct 15;23(20):2692-9. doi: 10.1093/bioinformatics/btm403. Epub 2007 Aug 27.
6
Indirect two-sided relative ranking: a robust similarity measure for gene expression data.间接双边相对排序:基因表达数据的稳健相似性度量。
BMC Bioinformatics. 2010 Mar 17;11:137. doi: 10.1186/1471-2105-11-137.
7
CellMontage: similar expression profile search server.细胞蒙太奇:相似表达谱搜索服务器。
Bioinformatics. 2007 Nov 15;23(22):3103-4. doi: 10.1093/bioinformatics/btm462. Epub 2007 Sep 25.
8
A new geometric biclustering algorithm based on the Hough transform for analysis of large-scale microarray data.一种基于霍夫变换的新型几何双聚类算法,用于大规模微阵列数据分析。
J Theor Biol. 2008 Mar 21;251(2):264-74. doi: 10.1016/j.jtbi.2007.11.030. Epub 2007 Dec 4.
9
Tumor classification ranking from microarray data.基于微阵列数据的肿瘤分类排名
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S21. doi: 10.1186/1471-2164-9-S2-S21.
10
A new method for class prediction based on signed-rank algorithms applied to Affymetrix microarray experiments.一种基于符号秩算法应用于Affymetrix微阵列实验的类别预测新方法。
BMC Bioinformatics. 2008 Jan 11;9:16. doi: 10.1186/1471-2105-9-16.

引用本文的文献

1
Retrieving relevant time-course experiments: a study on Arabidopsis microarrays.检索相关的时间进程实验:一项关于拟南芥微阵列的研究。
IET Syst Biol. 2016 Jun;10(3):87-93. doi: 10.1049/iet-syb.2015.0042.
2
Bayesian approach to transforming public gene expression repositories into disease diagnosis databases.贝叶斯方法将公共基因表达库转化为疾病诊断数据库。
Proc Natl Acad Sci U S A. 2010 Apr 13;107(15):6823-8. doi: 10.1073/pnas.0912043107. Epub 2010 Apr 1.
3
Improving gene expression similarity measurement using pathway-based analytic dimension.
利用基于通路的分析维度提高基因表达相似性测量。
BMC Genomics. 2009 Dec 3;10 Suppl 3(Suppl 3):S15. doi: 10.1186/1471-2164-10-S3-S15.