School of Science, Monash University Malaysia, 47500, Bandar Sunway, Selangor Darul Ehsan, Malaysia.
Monash University Malaysia Genomics Facility, 47500, Bandar Sunway, Selangor Darul Ehsan, Malaysia.
BMC Bioinformatics. 2021 Dec 18;22(1):604. doi: 10.1186/s12859-021-04506-9.
In population genomics, polymorphisms that are highly differentiated between geographically separated populations are often suggestive of Darwinian positive selection. Genomic scans have highlighted several such regions in African and non-African populations, but only a handful of these have functional data that clearly associates candidate variations driving the selection process. Fine-Mapping of Adaptive Variation (FineMAV) was developed to address this in a high-throughput manner using population based whole-genome sequences generated by the 1000 Genomes Project. It pinpoints positively selected genetic variants in sequencing data by prioritizing high frequency, population-specific and functional derived alleles.
We developed a stand-alone software that implements the FineMAV statistic. To graphically visualise the FineMAV scores, it outputs the statistics as bigWig files, which is a common file format supported by many genome browsers. It is available as a command-line and graphical user interface. The software was tested by replicating the FineMAV scores obtained using 1000 Genomes Project African, European, East and South Asian populations and subsequently applied to whole-genome sequencing datasets from Singapore and China to highlight population specific variants that can be subsequently modelled. The software tool is publicly available at https://github.com/fadilla-wahyudi/finemav .
The software tool described here determines genome-wide FineMAV scores, using low or high-coverage whole-genome sequencing datasets, that can be used to prioritize a list of population specific, highly differentiated candidate variants for in vitro or in vivo functional screens. The tool displays these scores on the human genome browsers for easy visualisation, annotation and comparison between different genomic regions in worldwide human populations.
在群体基因组学中,在地理上分离的人群之间高度分化的多态性通常表明达尔文正选择。基因组扫描已经在非洲和非非洲人群中突出了几个这样的区域,但其中只有少数几个具有明确关联候选变异驱动选择过程的功能数据。Fine-Mapping of Adaptive Variation (FineMAV) 是为了以高通量的方式解决这个问题而开发的,它使用由 1000 基因组计划生成的基于人群的全基因组序列。它通过优先考虑高频、群体特异性和功能衍生等位基因来确定测序数据中被正选择的遗传变异。
我们开发了一个独立的软件来实现 FineMAV 统计。为了直观地显示 FineMAV 得分,它将统计数据输出为 bigWig 文件,这是许多基因组浏览器支持的常见文件格式。它有命令行和图形用户界面两种形式。该软件通过复制使用 1000 基因组计划非洲、欧洲、东亚和南亚人群获得的 FineMAV 得分进行了测试,随后应用于来自新加坡和中国的全基因组测序数据集,以突出可以随后进行建模的特定于人群的变体。该软件工具可在 https://github.com/fadilla-wahyudi/finemav 上公开获取。
这里描述的软件工具使用低或高覆盖全基因组测序数据集确定全基因组 FineMAV 得分,可用于优先考虑一系列特定于人群的、高度分化的候选变体,以进行体外或体内功能筛选。该工具在人类基因组浏览器上显示这些得分,便于在全球人类群体的不同基因组区域之间进行直观、注释和比较。