IndexToolkit：一个用于为高通量蛋白质组学索引蛋白质数据库的开源工具箱。

IndexToolkit: an open source toolbox to index protein databases for high-throughput proteomics.

作者信息

Li Dequan, Gao Wen, Ling Charles X, Wang Xiaobiao, Sun Ruixiang, He Simin

机构信息

Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, China.

出版信息

Bioinformatics. 2006 Oct 15;22(20):2572-3. doi: 10.1093/bioinformatics/btl410. Epub 2006 Aug 31.

DOI:10.1093/bioinformatics/btl410

PMID:16945944

Abstract

UNLABELLED

A software package, IndexToolkit, aimed at overcoming the disadvantage of FASTA-format databases for frequent searching, is developed to utilize an indexing strategy to substantially accelerate sequence queries. IndexToolkit includes user-friendly tools and an Application Programming Interface (API) to facilitate indexing, storage and retrieval of protein sequence databases. As open source, it provides a sequence-retrieval developing framework, which is easily extensible for high-speed-request proteomic applications, such as database searching or modification discovering. We applied IndexToolkit to database searching engine pFind to demonstrate its effect. Experimental studies show that IndexToolkit is able to support significantly faster searches of protein database.

AVAILABILITY

The IndexToolkit is free to use under the open source GNU GPL license. The source code and the compiled binary can be freely accessed through the website http://pfind.jdl.ac.cn/IndexToolkit. In this website, the more detailed information including screenshots and documentations for users and developers is also available.

摘要

未标注

开发了一个名为IndexToolkit的软件包，旨在克服FASTA格式数据库在频繁搜索方面的缺点，它利用索引策略大幅加速序列查询。IndexToolkit包括用户友好的工具和应用程序编程接口（API），以方便蛋白质序列数据库的索引、存储和检索。作为开源软件，它提供了一个序列检索开发框架，易于扩展以用于高速请求的蛋白质组学应用，如数据库搜索或修饰发现。我们将IndexToolkit应用于数据库搜索引擎pFind以证明其效果。实验研究表明，IndexToolkit能够显著加快蛋白质数据库的搜索速度。

可用性

IndexToolkit在开源的GNU GPL许可下可免费使用。源代码和编译后的二进制文件可通过网站http://pfind.jdl.ac.cn/IndexToolkit免费获取。在该网站上，还提供了更详细的信息，包括用户和开发者的屏幕截图及文档。

相似文献

IndexToolkit: an open source toolbox to index protein databases for high-throughput proteomics.IndexToolkit：一个用于为高通量蛋白质组学索引蛋白质数据库的开源工具箱。

Bioinformatics. 2006 Oct 15;22(20):2572-3. doi: 10.1093/bioinformatics/btl410. Epub 2006 Aug 31.

DBToolkit: processing protein databases for peptide-centric proteomics.DBToolkit：用于以肽为中心的蛋白质组学的蛋白质数据库处理

Bioinformatics. 2005 Sep 1;21(17):3584-5. doi: 10.1093/bioinformatics/bti588. Epub 2005 Jul 19.

pFind 2.0: a software package for peptide and protein identification via tandem mass spectrometry.pFind 2.0：一款通过串联质谱进行肽段和蛋白质鉴定的软件包。

Rapid Commun Mass Spectrom. 2007;21(18):2985-91. doi: 10.1002/rcm.3173.

TOPP--the OpenMS proteomics pipeline.TOPP——开放式质谱蛋白质组学流程

Bioinformatics. 2007 Jan 15;23(2):e191-7. doi: 10.1093/bioinformatics/btl299.

PROTEIOS: an open source proteomics initiative.蛋白质组计划：一项开源蛋白质组学计划。

Bioinformatics. 2005 May 1;21(9):2085-7. doi: 10.1093/bioinformatics/bti291. Epub 2005 Feb 3.

pFind: a novel database-searching software system for automated peptide and protein identification via tandem mass spectrometry.pFind：一种用于通过串联质谱法自动鉴定肽和蛋白质的新型数据库搜索软件系统。

Bioinformatics. 2005 Jul 1;21(13):3049-50. doi: 10.1093/bioinformatics/bti439. Epub 2005 Apr 7.

ProteomeCommons.org JAF: reference information and tools for proteomics.ProteomeCommons.org JAF：蛋白质组学的参考信息与工具。

Bioinformatics. 2006 Mar 1;22(5):632-3. doi: 10.1093/bioinformatics/btk015. Epub 2006 Jan 24.

The proteios software environment: an extensible multiuser platform for management and analysis of proteomics data.Proteios软件环境：一个用于蛋白质组学数据管理与分析的可扩展多用户平台。

J Proteome Res. 2009 Jun;8(6):3037-43. doi: 10.1021/pr900189c.

Proteomics FASTA archive and reference resource.蛋白质组学FASTA存档与参考资源。

Proteomics. 2008 May;8(9):1756-7. doi: 10.1002/pmic.200701194.

BicAT: a biclustering analysis toolbox.BicAT：一个双聚类分析工具箱。

Bioinformatics. 2006 May 15;22(10):1282-3. doi: 10.1093/bioinformatics/btl099. Epub 2006 Mar 21.

引用本文的文献

A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.用于在鸟枪法蛋白质组学中鉴定肽和蛋白质的计算方法和错误率估计程序的调查。

J Proteomics. 2010 Oct 10;73(11):2092-123. doi: 10.1016/j.jprot.2010.08.009. Epub 2010 Sep 8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

IndexToolkit：一个用于为高通量蛋白质组学索引蛋白质数据库的开源工具箱。

IndexToolkit: an open source toolbox to index protein databases for high-throughput proteomics.

作者信息

机构信息

出版信息

UNLABELLED

AVAILABILITY

未标注

可用性

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献