Suppr超能文献

WebStruct和VisualStruct:在集群环境中实现的Structure软件的Web界面与可视化工具。

WebStruct and VisualStruct: Web interfaces and visualization for Structure software implemented in a cluster environment.

作者信息

Jayashree B, Rajgopal S, Hoisington D, Prasanth V P, Chandra S

机构信息

Bioinformatics Unit, ICRISAT, Patancheru 502324, Andhra Pradeesh, India.

出版信息

J Integr Bioinform. 2008 Sep 24;5(1):89. doi: 10.2390/biecoll-jib-2008-89.

Abstract

Structure, is a widely used software tool to investigate population genetic structure with multi-locus genotyping data. The software uses an iterative algorithm to group individuals into "K" clusters, representing possibly K genetically distinct subpopulations. The serial implementation of this programme is processor-intensive even with small datasets. We describe an implementation of the program within a parallel framework. Speedup was achieved by running different replicates and values of K on each node of the cluster. A web-based user-oriented GUI has been implemented in PHP, through which the user can specify input parameters for the programme. The number of processors to be used can be specified in the background command. A web-based visualization tool "Visualstruct", written in PHP (HTML and Java script embedded), allows for the graphical display of population clusters output from Structure, where each individual may be visualized as a line segment with K colors defining its possible genomic composition with respect to the K genetic sub-populations. The advantage over available programs is in the increased number of individuals that can be visualized. The analyses of real datasets indicate a speedup of up to four, when comparing the speed of execution on clusters of eight processors with the speed of execution on one desktop. The software package is freely available to interested users upon request.

摘要

Structure是一个广泛使用的软件工具,用于利用多位点基因分型数据研究群体遗传结构。该软件使用迭代算法将个体分组为“K”个簇,代表可能的K个遗传上不同的亚群。即使是小数据集,该程序的串行实现也需要大量处理器资源。我们描述了该程序在并行框架中的一种实现方式。通过在集群的每个节点上运行不同的重复样本和K值来实现加速。基于网络的面向用户的图形用户界面(GUI)已用PHP实现,用户可以通过它为程序指定输入参数。要使用的处理器数量可以在后台命令中指定。基于网络的可视化工具“Visualstruct”用PHP编写(嵌入了HTML和JavaScript),可以对Structure输出的群体簇进行图形显示,其中每个个体可以可视化为一条线段,用K种颜色定义其相对于K个遗传亚群的可能基因组组成。与现有程序相比,其优势在于可以可视化的个体数量增加。对真实数据集的分析表明,将八个处理器的集群上的执行速度与一台台式机上的执行速度进行比较时,加速比高达四倍。感兴趣的用户可通过请求免费获得该软件包。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验