Bryner Darshan, Criscione Stephen, Leith Andrew, Huynh Quyen, Huffer Fred, Neretti Nicola
Naval Surface Warfare Center Panama City Division, Panama City, Florida, United States of America.
Department of Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, Rhode Island, United States of America.
PLoS Comput Biol. 2017 Jun 15;13(6):e1005586. doi: 10.1371/journal.pcbi.1005586. eCollection 2017 Jun.
A common problem in genomics is to test for associations between two or more genomic features, typically represented as intervals interspersed across the genome. Existing methodologies can test for significant pairwise associations between two genomic intervals; however, they cannot test for associations involving multiple sets of intervals. This limits our ability to uncover more complex, yet biologically important associations between multiple sets of genomic features. We introduce GINOM (Genomic INterval Overlap Model), a new method that enables testing of significant associations between multiple genomic features. We demonstrate GINOM's ability to identify higher-order associations with both simulated and real data. In particular, we used GINOM to explore L1 retrotransposable element insertion bias in lung cancer and found a significant pairwise association between L1 insertions and heterochromatic marks. Unlike other methods, GINOM also detected an association between L1 insertions and gene bodies marked by a facultative heterochromatic mark, which could explain the observed bias for L1 insertions towards cancer-associated genes.
基因组学中的一个常见问题是测试两个或更多基因组特征之间的关联,这些特征通常表示为散布在基因组中的区间。现有方法可以测试两个基因组区间之间的显著成对关联;然而,它们无法测试涉及多组区间的关联。这限制了我们揭示多组基因组特征之间更复杂但生物学上重要的关联的能力。我们引入了GINOM(基因组区间重叠模型),这是一种能够测试多个基因组特征之间显著关联的新方法。我们通过模拟数据和真实数据展示了GINOM识别高阶关联的能力。特别是,我们使用GINOM探索了肺癌中L1逆转座子元件的插入偏向,并发现L1插入与异染色质标记之间存在显著的成对关联。与其他方法不同,GINOM还检测到L1插入与由兼性异染色质标记标记的基因体之间的关联,这可以解释观察到的L1插入对癌症相关基因的偏向。