Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Republic of Korea.
Mol Biol Evol. 2024 Mar 1;41(3). doi: 10.1093/molbev/msae040.
Advancements in next-generation sequencing (NGS) technologies have led to a substantial increase in the availability of population genetic variant data, thus prompting the development of various population analysis tools to enhance our understanding of population structure and evolution. The tools that are currently used to analyze population genetic variant data generally require different environments, parameters, and formats of the input data, which can act as a barrier preventing the wide-spread usage of such tools by general researchers who may not be familiar with bioinformatics. To address this problem, we have developed an automated and comprehensive pipeline called PAPipe to perform nine widely used population genetic analyses using population NGS data. PAPipe seamlessly interconnects and serializes multiple steps, such as read trimming and mapping, genetic variant calling, data filtering, and format converting, along with nine population genetic analyses such as principal component analysis, phylogenetic analysis, population tree analysis, population structure analysis, linkage disequilibrium decay analysis, selective sweep analysis, population admixture analysis, sequentially Markovian coalescent analysis, and fixation index analysis. PAPipe also provides an easy-to-use web interface that allows for the parameters to be set and the analysis results to be browsed in intuitive manner. PAPipe can be used to generate extensive results that provide insights that can help enhance user convenience and data usability. PAPipe is freely available at https://github.com/jkimlab/PAPipe.
下一代测序(NGS)技术的进步使得群体遗传变异数据的可用性大大增加,从而促使开发了各种群体分析工具来增强我们对群体结构和进化的理解。目前用于分析群体遗传变异数据的工具通常需要不同的环境、参数和输入数据格式,这可能成为一般研究人员广泛使用这些工具的障碍,因为他们可能不熟悉生物信息学。为了解决这个问题,我们开发了一个名为 PAPipe 的自动化和全面的管道,用于使用群体 NGS 数据执行九种广泛使用的群体遗传分析。PAPipe 无缝地连接和序列化多个步骤,如读取修剪和映射、遗传变异调用、数据过滤和格式转换,以及九种群体遗传分析,如主成分分析、系统发育分析、群体树分析、群体结构分析、连锁不平衡衰减分析、选择清扫分析、群体混合分析、连续马尔可夫凝聚分析和固定指数分析。PAPipe 还提供了一个易于使用的 Web 界面,允许以直观的方式设置参数和浏览分析结果。PAPipe 可用于生成广泛的结果,提供有助于增强用户便利性和数据可用性的见解。PAPipe 可在 https://github.com/jkimlab/PAPipe 上免费获得。