Abinaya E, Narang Pankaj, Bhardwaj Anshu
Department of Bioinformatics, SASTRA University, Thanjavur, Tamil Nadu, India.
School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, India.
PLoS One. 2015 Aug 5;10(8):e0134693. doi: 10.1371/journal.pone.0134693. eCollection 2015.
Genetic variations play a crucial role in differential phenotypic outcomes. Given the complexity in establishing this correlation and the enormous data available today, it is imperative to design machine-readable, efficient methods to store, label, search and analyze this data. A semantic approach, FROG: "FingeRprinting Ontology of Genomic variations" is implemented to label variation data, based on its location, function and interactions. FROG has six levels to describe the variation annotation, namely, chromosome, DNA, RNA, protein, variations and interactions. Each level is a conceptual aggregation of logically connected attributes each of which comprises of various properties for the variant. For example, in chromosome level, one of the attributes is location of variation and which has two properties, allosomes or autosomes. Another attribute is variation kind which has four properties, namely, indel, deletion, insertion, substitution. Likewise, there are 48 attributes and 278 properties to capture the variation annotation across six levels. Each property is then assigned a bit score which in turn leads to generation of a binary fingerprint based on the combination of these properties (mostly taken from existing variation ontologies). FROG is a novel and unique method designed for the purpose of labeling the entire variation data generated till date for efficient storage, search and analysis. A web-based platform is designed as a test case for users to navigate sample datasets and generate fingerprints. The platform is available at http://ab-openlab.csir.res.in/frog.
基因变异在不同的表型结果中起着至关重要的作用。鉴于建立这种关联的复杂性以及当今可用的海量数据,设计机器可读、高效的方法来存储、标记、搜索和分析这些数据势在必行。一种语义方法FROG(“基因组变异指纹本体”)被用于根据变异数据的位置、功能和相互作用来标记变异数据。FROG有六个层次来描述变异注释,即染色体、DNA、RNA、蛋白质、变异和相互作用。每个层次都是逻辑相连属性的概念性集合,每个属性都包含变异的各种特性。例如,在染色体层次,其中一个属性是变异位置,它有两个特性,性染色体或常染色体。另一个属性是变异类型,它有四个特性,即插入缺失、缺失、插入、替换。同样,有48个属性和278个特性来捕获六个层次的变异注释。然后为每个特性分配一个比特分数,进而基于这些特性(大多取自现有的变异本体)的组合生成一个二进制指纹。FROG是一种新颖独特的方法,旨在标记迄今为止生成的所有变异数据,以便进行高效存储、搜索和分析。设计了一个基于网络的平台作为测试案例,供用户浏览样本数据集并生成指纹。该平台可在http://ab-openlab.csir.res.in/frog上获取。