Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, 10461, USA.
Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, 10461, USA.
BMC Bioinformatics. 2024 May 7;25(1):179. doi: 10.1186/s12859-024-05794-7.
As genomic studies continue to implicate non-coding sequences in disease, testing the roles of these variants requires insights into the cell type(s) in which they are likely to be mediating their effects. Prior methods for associating non-coding variants with cell types have involved approaches using linkage disequilibrium or ontological associations, incurring significant processing requirements. GaiaAssociation is a freely available, open-source software that enables thousands of genomic loci implicated in a phenotype to be tested for enrichment at regulatory loci of multiple cell types in minutes, permitting insights into the cell type(s) mediating the studied phenotype.
In this work, we present Regulatory Landscape Enrichment Analysis (RLEA) by GaiaAssociation and demonstrate its capability to test the enrichment of 12,133 variants across the cis-regulatory regions of 44 cell types. This analysis was completed in 134.0 ± 2.3 s, highlighting the efficient processing provided by GaiaAssociation. The intuitive interface requires only four inputs, offers a collection of customizable functions, and visualizes variant enrichment in cell-type regulatory regions through a heatmap matrix. GaiaAssociation is available on PyPi for download as a command line tool or Python package and the source code can also be installed from GitHub at https://github.com/GreallyLab/gaiaAssociation .
GaiaAssociation is a novel package that provides an intuitive and efficient resource to understand the enrichment of non-coding variants across the cis-regulatory regions of different cells, empowering studies seeking to identify disease-mediating cell types.
随着基因组研究不断揭示非编码序列在疾病中的作用,测试这些变体的作用需要深入了解它们可能在其中发挥作用的细胞类型。先前将非编码变体与细胞类型相关联的方法涉及使用连锁不平衡或本体关联的方法,需要大量的处理要求。GaiaAssociation 是一款免费的开源软件,可在几分钟内测试数千个与表型相关的基因组位点在多种细胞类型的调控位点是否富集,从而深入了解介导研究表型的细胞类型。
在这项工作中,我们提出了 GaiaAssociation 的调控景观富集分析(RLEA),并证明了它能够测试 12133 个变体在 44 种细胞类型的顺式调控区域中的富集情况。这项分析在 134.0±2.3 秒内完成,突出了 GaiaAssociation 提供的高效处理能力。直观的界面只需要四个输入,提供了一系列可定制的功能,并通过热图矩阵可视化变体在细胞类型调控区域中的富集情况。GaiaAssociation 可在 PyPi 上下载,作为命令行工具或 Python 包使用,也可以从 GitHub 上的 https://github.com/GreallyLab/gaiaAssociation 安装源代码。
GaiaAssociation 是一个新颖的软件包,提供了一种直观高效的资源,用于了解不同细胞的顺式调控区域中非编码变体的富集情况,为识别疾病介导的细胞类型的研究提供了支持。