• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于评估简档强度和相似度的通用信息检索框架。

A versatile information retrieval framework for evaluating profile strength and similarity.

作者信息

Kalinin Alexandr A, Arevalo John, Serrano Erik, Vulliard Loan, Tsang Hillary, Bornholdt Michael, Muñoz Alán F, Sivagurunathan Suganya, Rajwa Bartek, Carpenter Anne E, Way Gregory P, Singh Shantanu

机构信息

Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Department of Biomedical Informatics, University of Colorado School of Medicine, Aurora, CO, USA.

出版信息

Nat Commun. 2025 Jun 4;16(1):5181. doi: 10.1038/s41467-025-60306-2.

DOI:10.1038/s41467-025-60306-2
PMID:40467541
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12137819/
Abstract

Large-scale profiling assays capture a cell population's state by measuring thousands of biological properties per cell or sample. However, evaluating profile strength and similarity remains challenging due to the high dimensionality and non-linear, heterogeneous nature of measurements. Here, we develop a statistical framework using mean average precision (mAP) as a single, data-driven metric to address this challenge. We validate the mAP framework against established metrics through simulations and real-world data, revealing its ability to capture subtle and meaningful biological differences in cell state. Specifically, we use mAP to assess a sample's phenotypic activity relative to controls, as well as the phenotypic consistency of groups of perturbations (or samples). We evaluate the framework across diverse datasets and on different profile types (image, protein, mRNA), perturbations (CRISPR, gene overexpression, small molecules), and resolutions (single-cell, bulk). The mAP framework, together with our open-source software package copairs, is useful for evaluating high-dimensional profiling data in biological research and drug discovery.

摘要

大规模分析检测通过测量每个细胞或样本的数千种生物学特性来捕捉细胞群体的状态。然而,由于测量的高维度以及非线性、异质性,评估图谱强度和相似性仍然具有挑战性。在这里,我们开发了一个统计框架,使用平均精度均值(mAP)作为单一的数据驱动指标来应对这一挑战。我们通过模拟和实际数据,对照既定指标验证了mAP框架,揭示了其捕捉细胞状态中细微且有意义的生物学差异的能力。具体而言,我们使用mAP来评估样本相对于对照的表型活性,以及扰动(或样本)组的表型一致性。我们在不同的数据集以及不同的图谱类型(图像、蛋白质、mRNA)、扰动(CRISPR、基因过表达、小分子)和分辨率(单细胞、批量)上评估了该框架。mAP框架与我们的开源软件包copairs一起,可用于评估生物学研究和药物发现中的高维分析数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9a3/12137819/95946c8e9f10/41467_2025_60306_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9a3/12137819/d02214b315d1/41467_2025_60306_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9a3/12137819/d7bfe213c8c5/41467_2025_60306_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9a3/12137819/4079a554ff08/41467_2025_60306_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9a3/12137819/a8738ad8d729/41467_2025_60306_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9a3/12137819/95946c8e9f10/41467_2025_60306_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9a3/12137819/d02214b315d1/41467_2025_60306_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9a3/12137819/d7bfe213c8c5/41467_2025_60306_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9a3/12137819/4079a554ff08/41467_2025_60306_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9a3/12137819/a8738ad8d729/41467_2025_60306_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9a3/12137819/95946c8e9f10/41467_2025_60306_Fig5_HTML.jpg

相似文献

1
A versatile information retrieval framework for evaluating profile strength and similarity.一种用于评估简档强度和相似度的通用信息检索框架。
Nat Commun. 2025 Jun 4;16(1):5181. doi: 10.1038/s41467-025-60306-2.
2
A versatile information retrieval framework for evaluating profile strength and similarity.一个用于评估轮廓强度和相似度的通用信息检索框架。
bioRxiv. 2025 Mar 13:2024.04.01.587631. doi: 10.1101/2024.04.01.587631.
3
Pheno-Ranker: a toolkit for comparison of phenotypic data stored in GA4GH standards and beyond.Pheno-Ranker:用于比较存储在GA4GH标准及其他标准中的表型数据的工具包。
BMC Bioinformatics. 2024 Dec 4;25(1):373. doi: 10.1186/s12859-024-05993-2.
4
scBOL: a universal cell type identification framework for single-cell and spatial transcriptomics data.scBOL:单细胞和空间转录组学数据的通用细胞类型识别框架。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae188.
5
Data mining in protein interactomics. Six computational research challenges and opportunities.蛋白质相互作用组学中的数据挖掘。六大计算研究挑战与机遇。
IEEE Eng Med Biol Mag. 2005 May-Jun;24(3):95-102. doi: 10.1109/memb.2005.1436466.
6
Improving information retrieval in functional analysis.提高功能分析中的信息检索。
Comput Biol Med. 2016 Dec 1;79:10-20. doi: 10.1016/j.compbiomed.2016.09.017. Epub 2016 Sep 24.
7
Potential bias in GO::TermFinder.基因本体术语查找器中的潜在偏差。
Brief Bioinform. 2009 May;10(3):289-94. doi: 10.1093/bib/bbn054. Epub 2009 Mar 11.
8
Booly: a new data integration platform.Booly:一个新的数据集成平台。
BMC Bioinformatics. 2010 Oct 13;11:513. doi: 10.1186/1471-2105-11-513.
9
DrSim: Similarity Learning for Transcriptional Phenotypic Drug Discovery.Sim 博士:转录表型药物发现的相似性学习。
Genomics Proteomics Bioinformatics. 2022 Oct;20(5):1028-1036. doi: 10.1016/j.gpb.2022.09.006. Epub 2022 Sep 29.
10
Morphology and gene expression profiling provide complementary information for mapping cell state.形态学和基因表达谱分析为细胞状态的描绘提供了互补信息。
Cell Syst. 2022 Nov 16;13(11):911-923.e9. doi: 10.1016/j.cels.2022.10.001. Epub 2022 Oct 28.

引用本文的文献

1
Prediction of cellular morphology changes under perturbations with a transcriptome-guided diffusion model.基于转录组引导的扩散模型预测扰动下的细胞形态变化。
Nat Commun. 2025 Sep 2;16(1):8210. doi: 10.1038/s41467-025-63478-z.
2
Morphological map of under- and overexpression of genes in human cells.人类细胞中基因表达不足和过度表达的形态学图谱。
Nat Methods. 2025 Aug;22(8):1742-1752. doi: 10.1038/s41592-025-02753-9. Epub 2025 Aug 7.
3
Triple-effect correction for Cell Painting data with contrastive and domain-adversarial learning.

本文引用的文献

1
Reproducible image-based profiling with Pycytominer.使用Pycytominer进行基于图像的可重复分析。
Nat Methods. 2025 Apr;22(4):677-680. doi: 10.1038/s41592-025-02611-8. Epub 2025 Mar 3.
2
Building, benchmarking, and exploring perturbative maps of transcriptional and morphological data.构建、基准测试和探索转录和形态数据的微扰图。
PLoS Comput Biol. 2024 Oct 1;20(10):e1012463. doi: 10.1371/journal.pcbi.1012463. eCollection 2024 Oct.
3
Evaluating batch correction methods for image-based cell profiling.评估基于图像的细胞分析中的批量校正方法。
基于对比学习和域对抗学习的细胞绘画数据三重效应校正
Nat Commun. 2025 Jul 25;16(1):6886. doi: 10.1038/s41467-025-62193-z.
4
A morphology and secretome map of pyroptosis.细胞焦亡的形态学和分泌组图谱。
Mol Biol Cell. 2025 Jun 1;36(6):ar63. doi: 10.1091/mbc.E25-03-0119. Epub 2025 Apr 9.
5
Capturing cell heterogeneity in representations of cell populations for image-based profiling using contrastive learning.利用对比学习在基于图像分析的细胞群体表征中捕捉细胞异质性。
bioRxiv. 2024 Jul 31:2023.11.14.567038. doi: 10.1101/2023.11.14.567038.
Nat Commun. 2024 Aug 2;15(1):6516. doi: 10.1038/s41467-024-50613-5.
4
Differential gene expression analysis pipelines and bioinformatic tools for the identification of specific biomarkers: A review.用于鉴定特定生物标志物的差异基因表达分析流程和生物信息学工具:综述
Comput Struct Biotechnol J. 2024 Mar 1;23:1154-1168. doi: 10.1016/j.csbj.2024.02.018. eCollection 2024 Dec.
5
Learning representations for image-based profiling of perturbations.基于图像的扰动分析的表示学习。
Nat Commun. 2024 Feb 21;15(1):1594. doi: 10.1038/s41467-024-45999-1.
6
Segmentation metric misinterpretations in bioimage analysis.生物影像分析中的分割度量误读。
Nat Methods. 2024 Feb;21(2):213-216. doi: 10.1038/s41592-023-01942-8. Epub 2023 Jul 27.
7
Morphology and gene expression profiling provide complementary information for mapping cell state.形态学和基因表达谱分析为细胞状态的描绘提供了互补信息。
Cell Syst. 2022 Nov 16;13(11):911-923.e9. doi: 10.1016/j.cels.2022.10.001. Epub 2022 Oct 28.
8
CORUM: the comprehensive resource of mammalian protein complexes-2022.CORUM:哺乳动物蛋白质复合物综合资源-2022 年版。
Nucleic Acids Res. 2023 Jan 6;51(D1):D539-D545. doi: 10.1093/nar/gkac1015.
9
Ten quick tips for deep learning in biology.生物学深度学习的十条快速提示。
PLoS Comput Biol. 2022 Mar 24;18(3):e1009803. doi: 10.1371/journal.pcbi.1009803. eCollection 2022 Mar.
10
Benchmarking atlas-level data integration in single-cell genomics.单细胞基因组学中图谱级数据整合的基准测试。
Nat Methods. 2022 Jan;19(1):41-50. doi: 10.1038/s41592-021-01336-8. Epub 2021 Dec 23.