Ullah Shahid, Ullah Farhan, Rahman Wajeeha, Karras Dimitrios A, Ullah Anees, Ahmad Gulzar, Ijaz Muhammad, Gao Tianshun
S-Khan Lab Mardan, Khyber Pakhtunkhwa, Pakistan.
Department General, Faculty of Science, National and Kapodistrian University of Athens, Athens, Greece.
JMIR Cancer. 2022 Jun 10;8(2):e35020. doi: 10.2196/35020.
The advancement of cancer research has been facilitated through freely available cancer literature, databases, and tools. The age of genomics and big data has given rise to the need for cooperation and data sharing in order to make efficient use of this new information in the COVID-19 pandemic. Although there are many databases for cancer research, their access is not easy owing to different ways of processing and managing the data. There is an absence of a unified platform to manage all of them in a transparent and more comprehensible way.
In this study, an improved integrated cancer research database and platform is provided to facilitate a deeper statistical insight into the correlation between cancer and the COVID-19 pandemic, unifying the collection of almost all previous published cancer databases and defining a model web database for cancer research, and scoring databases on the basis of the variety types of cancer, sample size, completeness of omics results, and user interface.
Databases examined and integrated include the Data Portal database, Genomic database, Proteomic database, Expression database, Gene database, and Mutation database; and it is expected that this launch will sort, save, advance the understanding and encourage the use of these resources in the cancer research environment.
To make it easy to search valuable information, 85 cancer databases are provided in the form of a table, and a database of databases named the Cancer Research Database (CRDB) has been built and presented herein. Furthermore, the CRDB has been herein equipped with unique navigation tools in order to be explored by three methods; that is, any single database can be browsed by typing the name in the given search bar, while all categories can be browsed by clicking on the name of the category or image expression icon, thus serving as a facility that could provide all the category databases on a single click.
The computational platform (PHP, HTML, CSS, and MySQL) used to build CRDB for the cancer scientific community can be freely investigated and browsed on the internet and is planned to be updated in a timely manner. In addition, based on the proposed platform, the status and diagnoses statistics of cancer during the COVID-19 pandemic have been thoroughly investigated herein using CRDB, thus providing an easy-to-manage, understandable framework that mines knowledge for future researchers.
免费获取的癌症文献、数据库和工具推动了癌症研究的进展。基因组学和大数据时代使得为在新冠疫情中有效利用这些新信息而开展合作和数据共享成为必要。尽管有许多用于癌症研究的数据库,但由于数据处理和管理方式不同,获取这些数据库并不容易。缺乏一个以透明且更易于理解的方式管理所有这些数据库的统一平台。
在本研究中,提供了一个改进的综合癌症研究数据库和平台,以促进对癌症与新冠疫情之间相关性的更深入统计洞察,统一几乎所有先前发表的癌症数据库的收集,定义一个用于癌症研究的模型网络数据库,并根据癌症类型、样本量、组学结果完整性和用户界面等多种类型对数据库进行评分。
所检查和整合的数据库包括数据门户数据库、基因组数据库、蛋白质组数据库、表达数据库、基因数据库和突变数据库;预计此次发布将对这些资源进行分类、保存、增进理解并鼓励在癌症研究环境中使用。
为便于搜索有价值的信息,以表格形式提供了85个癌症数据库,并在此构建并展示了一个名为癌症研究数据库(CRDB)的数据库之数据库。此外,CRDB在此配备了独特的导航工具,以便通过三种方法进行探索;也就是说,可以通过在给定搜索栏中输入名称来浏览任何单个数据库,而通过点击类别名称或图像表达图标可以浏览所有类别,从而成为一个点击一下就能提供所有类别数据库的工具。
用于为癌症科学界构建CRDB的计算平台(PHP、HTML、CSS和MySQL)可在互联网上免费研究和浏览,并计划及时更新。此外,基于所提出的平台,本文使用CRDB对新冠疫情期间癌症的状况和诊断统计进行了全面调查,从而提供了一个易于管理、易于理解的框架,为未来的研究人员挖掘知识。