Iourov Ivan Y, Vorsanova Svetlana G, Yurov Yuri B
Mental Health Research Center, Russian Academy of Medical Sciences, 117152 Moscow, Russia ; Russian National Research Medical University named after N.I. Pirogov, Separated Structural Unit "Clinical Research Institute of Pediatrics", Ministry of Health of Russian Federation, 125412 Moscow, Russia ; Department of Medical Genetics, Russian Medical Academy of Postgraduate Education, Moscow, 123995 Russia.
Mental Health Research Center, Russian Academy of Medical Sciences, 117152 Moscow, Russia ; Russian National Research Medical University named after N.I. Pirogov, Separated Structural Unit "Clinical Research Institute of Pediatrics", Ministry of Health of Russian Federation, 125412 Moscow, Russia.
Mol Cytogenet. 2014 Dec 9;7(1):98. doi: 10.1186/s13039-014-0098-z. eCollection 2014.
The availability of multiple in silico tools for prioritizing genetic variants widens the possibilities for converting genomic data into biological knowledge. However, in molecular cytogenetics, bioinformatic analyses are generally limited to result visualization or database mining for finding similar cytogenetic data. Obviously, the potential of bioinformatics might go beyond these applications. On the other hand, the requirements for performing successful in silico analyses (i.e. deep knowledge of computer science, statistics etc.) can hinder the implementation of bioinformatics in clinical and basic molecular cytogenetic research. Here, we propose a bioinformatic approach to prioritization of genomic variations that is able to solve these problems.
Selecting gene expression as an initial criterion, we have proposed a bioinformatic approach combining filtering and ranking prioritization strategies, which includes analyzing metabolome and interactome data on proteins encoded by candidate genes. To finalize the prioritization of genetic variants, genomic, epigenomic, interactomic and metabolomic data fusion has been made. Structural abnormalities and aneuploidy revealed by array CGH and FISH have been evaluated to test the approach through determining genotype-phenotype correlations, which have been found similar to those of previous studies. Additionally, we have been able to prioritize copy number variations (CNV) (i.e. differentiate between benign CNV and CNV with phenotypic outcome). Finally, the approach has been applied to prioritize genetic variants in cases of somatic mosaicism (including tissue-specific mosaicism).
In order to provide for an in silico evaluation of molecular cytogenetic data, we have proposed a bioinformatic approach to prioritization of candidate genes and CNV. While having the disadvantage of possible unavailability of gene expression data or lack of expression variability between genes of interest, the approach provides several advantages. These are (i) the versatility due to independence from specific databases/tools or software, (ii) relative algorithm simplicity (possibility to avoid sophisticated computational/statistical methodology) and (iii) applicability to molecular cytogenetic data because of the chromosome-centric nature. In conclusion, the approach is able to become useful for increasing the yield of molecular cytogenetic techniques.
多种用于对基因变异进行优先级排序的计算机工具的出现,拓宽了将基因组数据转化为生物学知识的可能性。然而,在分子细胞遗传学中,生物信息学分析通常仅限于结果可视化或数据库挖掘以寻找相似的细胞遗传学数据。显然,生物信息学的潜力可能不止于这些应用。另一方面,进行成功的计算机分析的要求(即对计算机科学、统计学等的深入了解)可能会阻碍生物信息学在临床和基础分子细胞遗传学研究中的应用。在此,我们提出一种能够解决这些问题的生物信息学方法来对基因组变异进行优先级排序。
以基因表达作为初始标准,我们提出了一种结合过滤和排序优先级策略的生物信息学方法,该方法包括分析候选基因编码蛋白质的代谢组和相互作用组数据。为了确定基因变异的优先级,已进行了基因组、表观基因组、相互作用组和代谢组数据融合。通过确定基因型 - 表型相关性,对阵列比较基因组杂交(array CGH)和荧光原位杂交(FISH)揭示的结构异常和非整倍体进行了评估,以测试该方法,发现其与先前研究的结果相似。此外,我们能够对拷贝数变异(CNV)进行优先级排序(即区分良性CNV和具有表型结果的CNV)。最后,该方法已应用于对体细胞镶嵌现象(包括组织特异性镶嵌现象)病例中的基因变异进行优先级排序。
为了对分子细胞遗传学数据进行计算机评估,我们提出了一种对候选基因和CNV进行优先级排序的生物信息学方法。尽管该方法存在可能无法获得基因表达数据或感兴趣基因之间缺乏表达变异性的缺点,但它具有几个优点。这些优点包括:(i)由于不依赖特定数据库/工具或软件而具有通用性;(ii)相对算法简单(有可能避免复杂的计算/统计方法);(iii)由于以染色体为中心的性质而适用于分子细胞遗传学数据。总之,该方法对于提高分子细胞遗传学技术的效率可能会很有用。