Suppr超能文献

geoCancerPrognosticDatasetsRetriever:一个生物信息学工具,可轻松在基因表达综合数据库(GEO)上识别癌症预后数据集。

geoCancerPrognosticDatasetsRetriever: a bioinformatics tool to easily identify cancer prognostic datasets on Gene Expression Omnibus (GEO).

机构信息

Department of Biological Sciences, Kuwait University, 13060 Kuwait City, Kuwait.

Institute of Health Policy Management and Evaluation, University of Toronto, Toronto M5T 1P8, Ontario, Canada.

出版信息

Bioinformatics. 2022 Mar 4;38(6):1761-1763. doi: 10.1093/bioinformatics/btab852.

Abstract

SUMMARY

Having multiple datasets is a key aspect of robust bioinformatics analyses, because it allows researchers to find possible confirmation of the discoveries made on multiple cohorts. For this purpose, Gene Expression Omnibus (GEO) can be a useful database, since it provides hundreds of thousands of microarray gene expression datasets freely available for download and usage. Despite this large availability, collecting prognostic datasets of a specific cancer type from GEO can be a long, time-consuming and energy-consuming activity for any bioinformatician, who needs to execute it manually by first performing a search on the GEO website and then by checking all the datasets found one by one. To solve this problem, we present here geoCancerPrognosticDatasetsRetriever, a Perl 5 application which reads a cancer type and a list of microarray platforms, searches for prognostic gene expression datasets of that cancer type and based on those platforms available on GEO, and returns the GEO accession codes of those datasets, if found. Our bioinformatics tool can easily generate in a few minutes a list of cancer prognostic datasets that otherwise would require numerous hours of manual work to any bioinformatician. geoCancerPrognosticDatasetsRetriever can handily retrieve multiple prognostic datasets of gene expression of any cancer type, laying the foundations for numerous bioinformatics studies and meta-analyses that can have a strong impact on oncology research.

AVAILABILITY AND IMPLEMENTATION

geoCancerPrognosticDatasetsRetriever is freely available under the GPLv2 license on the Comprehensive Perl Archive Network (CPAN) at https://metacpan.org/pod/App::geoCancerPrognosticDatasetsRetriever and on GitHub at https://github.com/AbbasAlameer/geoCancerPrognosticDatasetsRetriever.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

摘要

拥有多个数据集是稳健的生物信息学分析的一个关键方面,因为它允许研究人员在多个队列中找到发现的可能证实。为此,基因表达综合(GEO)可以是一个有用的数据库,因为它提供了数十万的微阵列基因表达数据集,可免费下载和使用。尽管有大量的可用性,从 GEO 收集特定癌症类型的预后数据集对于任何生物信息学家来说可能是一项漫长、耗时和耗力的活动,他需要通过首先在 GEO 网站上进行搜索,然后逐个检查找到的所有数据集来手动执行此操作。为了解决这个问题,我们在这里提出了 geoCancerPrognosticDatasetsRetriever,这是一个 Perl 5 应用程序,它读取癌症类型和微阵列平台列表,搜索该癌症类型的预后基因表达数据集,并根据 GEO 上可用的那些平台,返回这些数据集的 GEO 访问码,如果找到的话。我们的生物信息学工具可以在几分钟内轻松生成一份癌症预后数据集列表,如果没有我们的工具,这将需要生物信息学家数小时的手动工作。geoCancerPrognosticDatasetsRetriever 可以方便地检索任何癌症类型的多个基因表达预后数据集,为众多生物信息学研究和荟萃分析奠定基础,这些研究和荟萃分析可能对肿瘤学研究产生重大影响。

可用性和实现

geoCancerPrognosticDatasetsRetriever 可在 Comprehensive Perl Archive Network(CPAN)上以 GPLv2 许可证免费获得,网址为 https://metacpan.org/pod/App::geoCancerPrognosticDatasetsRetriever,也可在 GitHub 上获得,网址为 https://github.com/AbbasAlameer/geoCancerPrognosticDatasetsRetriever。

补充信息

补充数据可在 Bioinformatics 在线获得。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验