• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用并行计算加速比较基因组学研究。

Accelerating comparative genomics using parallel computing.

作者信息

Janaki Chintalapati, Joshi Rajendra R

机构信息

Bioinformatics Team, Scientific and Engineering Computing Group, Centre for Development of Advanced Computing, Pune University Campus, Ganeshkhind, Pune-411007, India.

出版信息

In Silico Biol. 2003;3(4):429-40.

PMID:12954086
Abstract

In the past decade there has been an increase in the number of completely sequenced genomes due to the race of multibillion-dollar genome-sequencing projects. The enormous biological sequence data thus flooding into the sequence databases necessitates the development of efficient tools for comparative genome sequence analysis. The information deduced by such analysis has various applications viz. structural and functional annotation of novel genes and proteins, finding gene order in the genome, gene fusion studies, constructing metabolic pathways etc. Such study also proves invaluable for pharmaceutical industries, such as in silico drug target identification and new drug discovery. There are various sequence analysis tools available for mining such useful information of which FASTA and Smith-Waterman algorithms are widely used. However, analyzing large datasets of genome sequences using the above codes seems to be impractical on uniprocessor machines. Hence there is a need for improving the performance of the above popular sequence analysis tools on parallel cluster computers. Performance of the Smith-Waterman (SSEARCH) and FASTA programs were studied on PARAM 10000, a parallel cluster of workstations designed and developed in-house. FASTA and SSEARCH programs, which are available from the University of Virginia, were ported on PARAM and were optimized. In this era of high performance computing, where the paradigm is shifting from conventional supercomputers to the cost-effective general-purpose cluster of workstations and PCs, this study finds extreme relevance. Good performance of sequence analysis tools on a cluster of workstations was demonstrated, which is important for accelerating identification of novel genes and drug targets by screening large databases.

摘要

在过去十年中,由于数十亿美元的基因组测序项目的竞争,完全测序的基因组数量有所增加。大量生物序列数据因此涌入序列数据库,这就需要开发高效的工具来进行比较基因组序列分析。通过这种分析推断出的信息有多种应用,即新基因和蛋白质的结构与功能注释、确定基因组中的基因顺序、基因融合研究、构建代谢途径等。这样的研究对制药行业也证明是非常宝贵的,比如在计算机辅助药物靶点识别和新药发现方面。有各种序列分析工具可用于挖掘此类有用信息,其中FASTA和史密斯-沃特曼算法被广泛使用。然而,在单处理器机器上使用上述代码分析大型基因组序列数据集似乎不切实际。因此,需要提高上述流行序列分析工具在并行集群计算机上的性能。在自行设计和开发的并行工作站集群PARAM 10000上研究了史密斯-沃特曼(SSEARCH)和FASTA程序的性能。从弗吉尼亚大学获得的FASTA和SSEARCH程序被移植到PARAM上并进行了优化。在这个高性能计算的时代,范式正从传统超级计算机转向经济高效的通用工作站和个人电脑集群,这项研究具有极其重要的意义。展示了序列分析工具在工作站集群上的良好性能,这对于通过筛选大型数据库加速新基因和药物靶点的识别非常重要。

相似文献

1
Accelerating comparative genomics using parallel computing.利用并行计算加速比较基因组学研究。
In Silico Biol. 2003;3(4):429-40.
2
T-iDT : tool for identification of drug target in bacteria and validation by Mycobacterium tuberculosis.T-iDT:用于鉴定细菌中药物靶点并通过结核分枝杆菌进行验证的工具。
In Silico Biol. 2006;6(6):485-93.
3
In silico analysis of Burkholderia pseudomallei genome sequence for potential drug targets.针对潜在药物靶点对伯克霍尔德菌基因组序列进行计算机分析。
In Silico Biol. 2006;6(4):341-6.
4
Windows .NET Network Distributed Basic Local Alignment Search Toolkit (W.ND-BLAST).Windows .NET网络分布式基本局部比对搜索工具包(W.ND-BLAST)。
BMC Bioinformatics. 2005 Apr 8;6:93. doi: 10.1186/1471-2105-6-93.
5
Comparative and evolutionary genomics of globin genes in fish.鱼类珠蛋白基因的比较与进化基因组学
Methods Enzymol. 2008;436:511-38. doi: 10.1016/S0076-6879(08)36029-7.
6
High performance GRID based implementation for genomics and protein analysis.用于基因组学和蛋白质分析的基于高性能网格的实现。
Stud Health Technol Inform. 2006;120:374-80.
7
GenColors: annotation and comparative genomics of prokaryotes made easy.GenColors:原核生物的注释与比较基因组学变得轻松。
Methods Mol Biol. 2007;395:75-96.
8
Annotation, comparison and databases for hundreds of bacterial genomes.数百种细菌基因组的注释、比较及数据库
Res Microbiol. 2007 Dec;158(10):724-36. doi: 10.1016/j.resmic.2007.09.009. Epub 2007 Oct 6.
9
PLATCOM: a Platform for Computational Comparative Genomics.PLATCOM:一个用于计算比较基因组学的平台。
Bioinformatics. 2005 May 15;21(10):2514-6. doi: 10.1093/bioinformatics/bti350. Epub 2005 Feb 24.
10
The UCSC Genome Browser Database: update 2006.加州大学圣克鲁兹分校基因组浏览器数据库:2006年更新
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D590-8. doi: 10.1093/nar/gkj144.

引用本文的文献

1
Comparative genomics allowed the identification of drug targets against human fungal pathogens.比较基因组学使得针对人类真菌病原体的药物靶点的鉴定成为可能。
BMC Genomics. 2011 Jan 27;12:75. doi: 10.1186/1471-2164-12-75.
2
A high productivity/low maintenance approach to high-performance computation for biomedicine: four case studies.一种用于生物医学的高性能计算的高生产力/低维护方法:四个案例研究
J Am Med Inform Assoc. 2005 Jan-Feb;12(1):90-8. doi: 10.1197/jamia.M1571. Epub 2004 Oct 18.