Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson Street, New Brunswick, NJ, USA.
Department of Medicine, Robert Wood Johnson Medical School, Rutgers Biomedical and Health Sciences, 125 Paterson Street, New Brunswick, NJ, USA.
Hum Genomics. 2021 Jun 26;15(1):37. doi: 10.1186/s40246-021-00336-1.
Genetic disposition is considered critical for identifying subjects at high risk for disease development. Investigating disease-causing and high and low expressed genes can support finding the root causes of uncertainties in patient care. However, independent and timely high-throughput next-generation sequencing data analysis is still a challenge for non-computational biologists and geneticists.
In this manuscript, we present a findable, accessible, interactive, and reusable (FAIR) bioinformatics platform, i.e., GVViZ (visualizing genes with disease-causing variants). GVViZ is a user-friendly, cross-platform, and database application for RNA-seq-driven variable and complex gene-disease data annotation and expression analysis with a dynamic heat map visualization. GVViZ has the potential to find patterns across millions of features and extract actionable information, which can support the early detection of complex disorders and the development of new therapies for personalized patient care. The execution of GVViZ is based on a set of simple instructions that users without a computational background can follow to design and perform customized data analysis. It can assimilate patients' transcriptomics data with the public, proprietary, and our in-house developed gene-disease databases to query, easily explore, and access information on gene annotation and classified disease phenotypes with greater visibility and customization. To test its performance and understand the clinical and scientific impact of GVViZ, we present GVViZ analysis for different chronic diseases and conditions, including Alzheimer's disease, arthritis, asthma, diabetes mellitus, heart failure, hypertension, obesity, osteoporosis, and multiple cancer disorders. The results are visualized using GVViZ and can be exported as image (PNF/TIFF) and text (CSV) files that include gene names, Ensembl (ENSG) IDs, quantified abundances, expressed transcript lengths, and annotated oncology and non-oncology diseases.
We emphasize that automated and interactive visualization should be an indispensable component of modern RNA-seq analysis, which is currently not the case. However, experts in clinics and researchers in life sciences can use GVViZ to visualize and interpret the transcriptomics data, making it a powerful tool to study the dynamics of gene expression and regulation. Furthermore, with successful deployment in clinical settings, GVViZ has the potential to enable high-throughput correlations between patient diagnoses based on clinical and transcriptomics data.
遗传倾向被认为是识别疾病高发人群的关键因素。研究致病基因和高表达及低表达基因可以帮助找到患者护理不确定性的根本原因。然而,独立且及时的高通量下一代测序数据分析对于非计算生物学家和遗传学家来说仍然是一个挑战。
在本文中,我们提出了一个可发现、可访问、可交互和可重复使用(FAIR)的生物信息学平台,即 GVViZ(可视化具有致病变异的基因)。GVViZ 是一个用户友好、跨平台的数据库应用程序,用于 RNA-seq 驱动的可变和复杂基因-疾病数据注释和表达分析,并具有动态热图可视化功能。GVViZ 有可能发现数百万个特征之间的模式,并提取可操作的信息,从而支持对复杂疾病的早期检测和为个性化患者护理开发新疗法。GVViZ 的执行基于一组简单的指令,没有计算背景的用户可以按照这些指令来设计和执行定制化数据分析。它可以将患者的转录组学数据与公共、专有的和我们内部开发的基因-疾病数据库集成,以便查询、轻松探索和访问基因注释和分类疾病表型的信息,具有更高的可见性和可定制性。为了测试其性能并了解 GVViZ 的临床和科学影响,我们展示了 GVViZ 对不同慢性疾病和病症的分析,包括阿尔茨海默病、关节炎、哮喘、糖尿病、心力衰竭、高血压、肥胖症、骨质疏松症和多种癌症疾病。结果使用 GVViZ 进行可视化,并可以作为图像(PNF/TIFF)和文本(CSV)文件导出,其中包括基因名称、Ensembl(ENSG)ID、量化丰度、表达转录长度以及注释的肿瘤学和非肿瘤学疾病。
我们强调,自动化和交互式可视化应该是现代 RNA-seq 分析不可或缺的组成部分,但目前并非如此。然而,临床专家和生命科学研究人员可以使用 GVViZ 来可视化和解释转录组学数据,使其成为研究基因表达和调控动态的强大工具。此外,通过在临床环境中的成功部署,GVViZ 有可能基于临床和转录组学数据实现患者诊断之间的高通量相关性。