• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大众对基因表达综合数据库中的特征进行提取和分析。

Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd.

作者信息

Wang Zichen, Monteiro Caroline D, Jagodnik Kathleen M, Fernandez Nicolas F, Gundersen Gregory W, Rouillard Andrew D, Jenkins Sherry L, Feldmann Axel S, Hu Kevin S, McDermott Michael G, Duan Qiaonan, Clark Neil R, Jones Matthew R, Kou Yan, Goff Troy, Woodland Holly, Amaral Fabio M R, Szeto Gregory L, Fuchs Oliver, Schüssler-Fiorenza Rose Sophia M, Sharma Shvetank, Schwartz Uwe, Bausela Xabier Bengoetxea, Szymkiewicz Maciej, Maroulis Vasileios, Salykin Anton, Barra Carolina M, Kruth Candice D, Bongio Nicholas J, Mathur Vaibhav, Todoric Radmila D, Rubin Udi E, Malatras Apostolos, Fulp Carl T, Galindo John A, Motiejunaite Ruta, Jüschke Christoph, Dishuck Philip C, Lahl Katharina, Jafari Mohieddin, Aibar Sara, Zaravinos Apostolos, Steenhuizen Linda H, Allison Lindsey R, Gamallo Pablo, de Andres Segura Fernando, Dae Devlin Tyler, Pérez-García Vicente, Ma'ayan Avi

机构信息

Department of Pharmacological Sciences, BD2K-LINCS Data Coordination and Integration Center, Illuminating the Druggable Genome Knowledge Management Center, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place Box 1215, New York, New York 10029, USA.

Fluid Physics and Transport Processes Branch, NASA Glenn Research Center, 21000 Brookpark Rd, Cleveland, Ohio 44135, USA.

出版信息

Nat Commun. 2016 Sep 26;7:12846. doi: 10.1038/ncomms12846.

DOI:10.1038/ncomms12846
PMID:27667448
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5052684/
Abstract

Gene expression data are accumulating exponentially in public repositories. Reanalysis and integration of themed collections from these studies may provide new insights, but requires further human curation. Here we report a crowdsourcing project to annotate and reanalyse a large number of gene expression profiles from Gene Expression Omnibus (GEO). Through a massive open online course on Coursera, over 70 participants from over 25 countries identify and annotate 2,460 single-gene perturbation signatures, 839 disease versus normal signatures, and 906 drug perturbation signatures. All these signatures are unique and are manually validated for quality. Global analysis of these signatures confirms known associations and identifies novel associations between genes, diseases and drugs. The manually curated signatures are used as a training set to develop classifiers for extracting similar signatures from the entire GEO repository. We develop a web portal to serve these signatures for query, download and visualization.

摘要

公共数据库中的基因表达数据正在呈指数级积累。对这些研究中的主题数据集进行重新分析和整合可能会提供新的见解,但需要进一步的人工整理。在此,我们报告一项众包项目,该项目对来自基因表达综合数据库(GEO)的大量基因表达谱进行注释和重新分析。通过在Coursera上开展的大规模在线开放课程,来自25个以上国家的70多名参与者识别并注释了2460个单基因扰动特征、839个疾病与正常对照特征以及906个药物扰动特征。所有这些特征都是独一无二的,并经过人工质量验证。对这些特征的全局分析证实了已知的关联,并识别出基因、疾病和药物之间的新关联。经过人工整理的特征被用作训练集,以开发用于从整个GEO数据库中提取相似特征的分类器。我们开发了一个门户网站来提供这些特征以供查询、下载和可视化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/5052684/74598e97d7a0/ncomms12846-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/5052684/9383ba9753ad/ncomms12846-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/5052684/d037ada280e0/ncomms12846-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/5052684/2db9783ede64/ncomms12846-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/5052684/ba5021aafc17/ncomms12846-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/5052684/74598e97d7a0/ncomms12846-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/5052684/9383ba9753ad/ncomms12846-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/5052684/d037ada280e0/ncomms12846-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/5052684/2db9783ede64/ncomms12846-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/5052684/ba5021aafc17/ncomms12846-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/5052684/74598e97d7a0/ncomms12846-f5.jpg

相似文献

1
Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd.大众对基因表达综合数据库中的特征进行提取和分析。
Nat Commun. 2016 Sep 26;7:12846. doi: 10.1038/ncomms12846.
2
GEN3VA: aggregation and analysis of gene expression signatures from related studies.GEN3VA:相关研究中基因表达特征的汇总与分析
BMC Bioinformatics. 2016 Nov 15;17(1):461. doi: 10.1186/s12859-016-1321-1.
3
GESgnExt: Gene Expression Signature Extraction and Meta-Analysis on Gene Expression Omnibus.GESgnExt:基于基因表达综合数据库的基因表达特征提取和荟萃分析。
IEEE J Biomed Health Inform. 2020 Jan;24(1):311-318. doi: 10.1109/JBHI.2019.2896144. Epub 2019 Jan 30.
4
GEM-TREND: a web tool for gene expression data mining toward relevant network discovery.GEM-TREND:一个用于挖掘基因表达数据以发现相关网络的网络工具。
BMC Genomics. 2009 Sep 3;10:411. doi: 10.1186/1471-2164-10-411.
5
Meta-analysis of crowdsourced data compendia suggests pan-disease transcriptional signatures of autoimmunity.众包数据汇编的荟萃分析表明自身免疫的泛疾病转录特征。
F1000Res. 2016 Dec 20;5:2884. doi: 10.12688/f1000research.10465.1. eCollection 2016.
6
Precision annotation of digital samples in NCBI's gene expression omnibus.NCBI 基因表达综合数据库中数字样本的精确注释。
Sci Data. 2017 Sep 19;4:170125. doi: 10.1038/sdata.2017.125.
7
Mining data and metadata from the gene expression omnibus.从基因表达综合数据库挖掘数据和元数据。
Biophys Rev. 2019 Feb;11(1):103-110. doi: 10.1007/s12551-018-0490-8. Epub 2018 Dec 29.
8
LINCS Data Portal 2.0: next generation access point for perturbation-response signatures.LINCS 数据门户 2.0:扰动-响应特征的新一代接入点。
Nucleic Acids Res. 2020 Jan 8;48(D1):D431-D439. doi: 10.1093/nar/gkz1023.
9
ADAGE signature analysis: differential expression analysis with data-defined gene sets.ADAGE特征分析:使用数据定义的基因集进行差异表达分析。
BMC Bioinformatics. 2017 Nov 22;18(1):512. doi: 10.1186/s12859-017-1905-4.
10
MARQ: an online tool to mine GEO for experiments with similar or opposite gene expression signatures.MARQ:一个在线工具,用于挖掘 GEO 中具有相似或相反基因表达特征的实验。
Nucleic Acids Res. 2010 Jul;38(Web Server issue):W228-32. doi: 10.1093/nar/gkq476. Epub 2010 May 31.

引用本文的文献

1
Application of perturbation gene expression profiles in drug discovery-From mechanism of action to quantitative modelling.扰动基因表达谱在药物发现中的应用——从作用机制到定量建模
Front Syst Biol. 2023 Feb 9;3:1126044. doi: 10.3389/fsysb.2023.1126044. eCollection 2023.
2
The novel diagnostic markers for systemic lupus erythematosus and periodontal disease.系统性红斑狼疮和牙周病的新型诊断标志物。
Front Immunol. 2025 Jul 22;16:1614044. doi: 10.3389/fimmu.2025.1614044. eCollection 2025.
3
Drug Search and Design Considering Cell Specificity of Chemically Induced Gene Expression Profiles for Disease-Associated Tissues.

本文引用的文献

1
Crowdsourcing the General Public for Large Scale Molecular Pathology Studies in Cancer.众包公众参与癌症大规模分子病理学研究
EBioMedicine. 2015 May 9;2(7):681-9. doi: 10.1016/j.ebiom.2015.05.009. eCollection 2015 Jul.
2
Dynamics of the discovery process of protein-protein interactions from low content studies.基于低含量研究的蛋白质-蛋白质相互作用发现过程的动力学
BMC Syst Biol. 2015 Jun 6;9:26. doi: 10.1186/s12918-015-0173-z.
3
GEO2Enrichr: browser extension and server app to extract gene sets from GEO and analyze them for biological functions.
基于化学诱导基因表达谱的细胞特异性对疾病相关组织进行药物搜索与设计
Mol Inform. 2025 Jun;44(5-6):e2444. doi: 10.1002/minf.2444.
4
CORESH: a gene signature-based search engine for public gene expression datasets.CORESH:一种基于基因特征的公共基因表达数据集搜索引擎。
Nucleic Acids Res. 2025 May 5. doi: 10.1093/nar/gkaf372.
5
Therapeutic target prediction for orphan diseases integrating genome-wide and transcriptome-wide association studies.整合全基因组和全转录组关联研究的罕见病治疗靶点预测
Nat Commun. 2025 Apr 18;16(1):3355. doi: 10.1038/s41467-025-58464-4.
6
Playbook workflow builder: Interactive construction of bioinformatics workflows.剧本工作流程构建器:生物信息学工作流程的交互式构建
PLoS Comput Biol. 2025 Apr 3;21(4):e1012901. doi: 10.1371/journal.pcbi.1012901. eCollection 2025 Apr.
7
Using semantic search to find publicly available gene-expression datasets.使用语义搜索来查找公开可用的基因表达数据集。
bioRxiv. 2025 Mar 15:2025.03.13.643153. doi: 10.1101/2025.03.13.643153.
8
Identification of as a Key Biomarker Linking Iron Metabolism and Dendritic Cell Activation in Systemic Lupus Erythematosus Through Bioinformatics and Experimental Validation.通过生物信息学和实验验证鉴定[具体物质名称未给出]作为系统性红斑狼疮中铁代谢与树突状细胞活化之间联系的关键生物标志物
J Inflamm Res. 2025 Mar 14;18:3859-3878. doi: 10.2147/JIR.S500115. eCollection 2025.
9
Identification biomarkers and therapeutic targets of disulfidptosis-related in rheumatoid arthritis via bioinformatics, molecular dynamics simulation, and experimental validation.通过生物信息学、分子动力学模拟和实验验证确定类风湿关节炎中与二硫键连接的细胞程序性坏死相关的生物标志物和治疗靶点。
Sci Rep. 2025 Mar 13;15(1):8779. doi: 10.1038/s41598-025-93656-4.
10
A Computational Recognition Analysis of Promising Prognostic Biomarkers in Breast, Colon and Lung Cancer Patients.乳腺癌、结肠癌和肺癌患者中有前景的预后生物标志物的计算识别分析
Int J Mol Sci. 2025 Jan 25;26(3):1017. doi: 10.3390/ijms26031017.
GEO2Enrichr:用于从基因表达综合数据库(GEO)中提取基因集并分析其生物学功能的浏览器扩展程序和服务器应用程序。
Bioinformatics. 2015 Sep 15;31(18):3060-2. doi: 10.1093/bioinformatics/btv297. Epub 2015 May 13.
4
Crowdsourcing in biomedicine: challenges and opportunities.生物医学中的众包:挑战与机遇。
Brief Bioinform. 2016 Jan;17(1):23-32. doi: 10.1093/bib/bbv021. Epub 2015 Apr 17.
5
Ranking adverse drug reactions with crowdsourcing.通过众包对药物不良反应进行排名。
J Med Internet Res. 2015 Mar 23;17(3):e80. doi: 10.2196/jmir.3962.
6
Scaling drug indication curation through crowdsourcing.通过众包扩大药物适应症整理规模。
Database (Oxford). 2015 Mar 22;2015. doi: 10.1093/database/bav016. Print 2015.
7
limma powers differential expression analyses for RNA-sequencing and microarray studies.limma为RNA测序和微阵列研究提供差异表达分析的动力。
Nucleic Acids Res. 2015 Apr 20;43(7):e47. doi: 10.1093/nar/gkv007. Epub 2015 Jan 20.
8
Targeted exploration and analysis of large cross-platform human transcriptomic compendia.对大型跨平台人类转录组数据集进行靶向探索和分析。
Nat Methods. 2015 Mar;12(3):211-4, 3 p following 214. doi: 10.1038/nmeth.3249. Epub 2015 Jan 12.
9
mirPub: a database for searching microRNA publications.mirPub:一个用于搜索微小RNA出版物的数据库。
Bioinformatics. 2015 May 1;31(9):1502-4. doi: 10.1093/bioinformatics/btu819. Epub 2014 Dec 20.
10
DISEASES: text mining and data integration of disease-gene associations.疾病:疾病-基因关联的文本挖掘与数据整合
Methods. 2015 Mar;74:83-9. doi: 10.1016/j.ymeth.2014.11.020. Epub 2014 Dec 5.