Department of Pharmacological Sciences, Mount Sinai Center for Bioinformatics, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place Box 1215, New York, NY 10029, USA.
Department of Cell, Developmental and Regenerative Biology, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place Box 1020, New York, NY 10029, USA.
Nucleic Acids Res. 2019 Jul 2;47(W1):W183-W190. doi: 10.1093/nar/gkz347.
High-throughput experiments produce increasingly large datasets that are difficult to analyze and integrate. While most data integration approaches focus on aligning metadata, data integration can be achieved by abstracting experimental results into gene sets. Such gene sets can be made available for reuse through gene set enrichment analysis tools such as Enrichr. Enrichr currently only supports gene sets compiled from human and mouse, limiting accessibility for investigators that study other model organisms. modEnrichr is an expansion of Enrichr for four model organisms: fish, fly, worm and yeast. The gene set libraries within FishEnrichr, FlyEnrichr, WormEnrichr and YeastEnrichr are created from the Gene Ontology, mRNA expression profiles, GeneRIF, pathway databases, protein domain databases and other organism-specific resources. Additionally, libraries were created by predicting gene function from RNA-seq co-expression data processed uniformly from the gene expression omnibus for each organism. The modEnrichr suite of tools provides the ability to convert gene lists across species using an ortholog conversion tool that automatically detects the species. For complex analyses, modEnrichr provides API access that enables submitting batch queries. In summary, modEnrichr leverages existing model organism databases and other resources to facilitate comprehensive hypothesis generation through data integration.
高通量实验产生了越来越大的数据集,这些数据集难以分析和整合。虽然大多数数据集成方法都侧重于对齐元数据,但通过将实验结果抽象为基因集也可以实现数据集成。这样的基因集可以通过基因集富集分析工具(如 Enrichr)供重复使用。Enrichr 目前仅支持来自人类和小鼠的基因集,限制了研究其他模式生物的研究人员的可访问性。modEnrichr 是 Enrichr 的扩展,适用于四个模型生物:鱼类、果蝇、线虫和酵母。FishEnrichr、FlyEnrichr、WormEnrichr 和 YeastEnrichr 中的基因集库是从基因本体论、mRNA 表达谱、GeneRIF、途径数据库、蛋白质结构域数据库和其他特定于生物体的资源中创建的。此外,还通过从每个生物体的基因表达综合数据库中均匀处理的 RNA-seq 共表达数据来预测基因功能来创建库。modEnrichr 工具套件提供了使用同源转换工具跨物种转换基因列表的能力,该工具可以自动检测物种。对于复杂的分析,modEnrichr 提供了 API 访问权限,允许提交批处理查询。总之,modEnrichr 利用现有的模式生物数据库和其他资源,通过数据集成促进全面的假设生成。