Laboratório de Evolução, Universidade Federal do Pará, Alameda Leandro Ribeiro, Aldeia, Bragança, Pará, Brazil.
BMC Bioinformatics. 2024 Feb 15;25(1):73. doi: 10.1186/s12859-024-05697-7.
The STRUCTURE software has gained popularity as a tool for population structure and genetic analysis. Nevertheless, formatting data to meet STRUCTURE's specific requirements can be daunting and susceptible to errors, especially when handling multilocus data. This article highlights the creation of a graphical user interface (GUI) application tailored to streamline the process of converting multiple sequence alignments into a single, cohesive file that is compatible with the STRUCTURE software.
The application has been developed utilizing Tkinter for the GUI and Biopython for handling FASTA files. This program processes the files, pinpoints variable sites, and converts the sequences into a binary format. Subsequently, the sequences are concatenated and presented within the graphical interface's text area, enabling users to review and confirm the results. Furthermore, the program stores the concatenated results in a file, delivering a ready-to-use input for the STRUCTURE software.
This application offers an efficient and dependable solution for transforming multiple aligned FASTA files into a concatenated binary format file, which is compatible with the STRUCTURE software. With its user-friendly graphical interface and error-reduction approach, this tool proves invaluable for researchers engaged in population structure and genetic analysis.
STRUCTURE 软件作为一种用于群体结构和遗传分析的工具已经广受欢迎。然而,为了满足 STRUCTURE 的特定要求格式化数据可能会让人望而生畏,并且容易出错,尤其是在处理多基因座数据时。本文介绍了创建一个图形用户界面 (GUI) 应用程序的过程,该应用程序旨在简化将多个序列比对转换为与 STRUCTURE 软件兼容的单个连贯文件的过程。
该应用程序使用 Tkinter 开发了 GUI,并使用 Biopython 处理 FASTA 文件。该程序处理文件,确定可变位点,并将序列转换为二进制格式。然后,序列被连接并在图形界面的文本区域中呈现,使用户能够查看和确认结果。此外,该程序将连接的结果存储在一个文件中,为 STRUCTURE 软件提供了一个即用型的输入。
该应用程序为将多个对齐的 FASTA 文件转换为与 STRUCTURE 软件兼容的连接二进制格式文件提供了一种高效可靠的解决方案。该工具具有用户友好的图形界面和减少错误的方法,对于从事群体结构和遗传分析的研究人员来说非常有价值。