• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。

Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.

作者信息

Kalweit Gabriel, Klett Anusha, Silvestrini Paula, Rahnfeld Jens, Naouar Mehdi, Vogt Yannick, Infante Diana, Berger Rebecca, Duque-Afonso Jesús, Hartmann Tanja Nicole, Follo Marie, Bodurova-Spassova Elitsa, Lübbert Michael, Mertelsmann Roland, Boedecker Joschka, Ullrich Evelyn, Kalweit Maria

机构信息

Collaborative Research Institute Intelligent Oncology (CRIION), Freiburg, Germany.

Neurorobotics Lab, Department of Computer Science, University of Freiburg, Freiburg, Germany.

出版信息

Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

DOI:10.3389/fonc.2025.1480384
PMID:40606969
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12213826/
Abstract

BACKGROUND

Cellular imaging analysis using the traditional retrospective approach is extremely time-consuming and labor-intensive. Although AI-based solutions are available, these approaches rely heavily on supervised learning techniques that require high quality, large labeled datasets from the same microscope to be reliable. In addition, primary patient samples are often heterogeneous cell populations and need to be stained to distinguish the cellular subsets. The resulting imaging data is analyzed and labeled manually by experts. Therefore, a method to distinguish cell populations across imaging devices without the need for staining and extensive manual labeling would help immensely to gain real-time insights into cell population dynamics. This especially holds true for recognizing specific cell types and states in response to treatments.

OBJECTIVE

We aim to develop an unsupervised approach using general vision foundation models trained on diverse and extensive imaging datasets to extract rich visual features for cell-analysis across devices, including both stained and unstained live cells. Our method, Entropy-guided Weighted Combinational FAISS (EWC-FAISS), uses these models purely in an inference-only mode without task-specific retraining on the cellular data. Combining the generated embeddings in an efficient and adaptive k-nearest neighbor search allows for automated, cross device identification of cell types and states, providing a strong basis for AI-assisted cancer therapy.

METHODS

We utilized two publicly available datasets. The WBC dataset includes 14,424 images of stained white blood cell samples from patients with acute myeloid and lymphoid leukemia, as well as those without leukemic pathology. The LISC dataset comprises 257 images of white blood cell samples from healthy individuals. We generated four in-house datasets utilizing the JIMT-1 breast cancer cell line, as well as Jurkat and K562 (leukemic cell lines). These datasets were acquired using the Nanolive 3D Cell Explorer-fluo (CX-A) holotomographic microscope and the BioTek Lionheart FX automated brightfield microscope. The images from the in-house datasets were manually annotated using Roboflow software. To generate the embeddings, we used and optimized a concatenated combination of SAM, DINO, ConvNeXT, SWIN, CLIP and ViTMAE. The combined embeddings were used as input for the adaptive k-nearest neighbor search, building an approximate Hierarchical Navigable Small World FAISS index. We compared EWC-FAISS to fully fined-tuned ViT-Classifiers with DINO-, and SWIN-backbones, a ConvNeXT architecture, as well as to NMTune as a lightweight domain-adaptation method with frozen backbone.

RESULTS

EWC-FAISS performed competitively with the baselines on the original datasets in terms of macro accuracy. Macro accuracy is the average of class-specific accuracies, treating all classes equally by averaging their individual accuracies. EWC-FAISS ranked second for the WBC dataset (macro accuracy: 97.6 ± 0.2), first for cell state classification from Nanolive (macro accuracy: 90 ± 0), and performed comparably for cell type classification from Lionheart (macro accuracy: 87 ± 0). For the transfer to out-of-distribution (OOD) datasets, which the model had not seen during training, EWC-FAISS consistently outperformed the other baselines. For the LISC dataset, EWC-FAISS achieved a macro accuracy of 78.5 ± 0.3, compared to DINO FT's 17 ± 1, SWIN FT's 44 ± 14, ConvNeXT FT's 45 ± 9, and NMTune's 52 ± 10. For the cell state classification from Lionheart, EWC-FAISS had a macro accuracy of 86 ± 1, while DINO FT, SWIN FT, and ConvNeXT FT achieved 65 ± 11, 68 ± 16, and 81 ± 1, respectively, and NMTune 81 ± 7. For the transfer of cell type classification from Nanolive, EWC-FAISS attained a macro accuracy of 85 ± 0, compared to DINO FT's 24.5 ± 0.9, SWIN FT's 57 ± 6, ConvNeXT FT's 54 ± 4, and NMTune's 63 ± 4. Additionally, building EWC-FAISS after embedding generation was significantly faster than training DINO FT (∼ 6 minutes compared to 10 hours). Lastly, EWC-FAISS performed comparably in distinguishing cancerous cell lines from Peripheral Blood Mononuclear Cells with a mean accuracy of 80 ± 5, compared to CellMixer with a mean accuracy of 79.7.

CONCLUSION

We present a novel approach to identify various cell lines and primary cells based on their identity and state using images acquired across various imaging platforms which vary in resolution, magnification and image quality. Despite these differences, we could show that our efficient, adaptive k-nearest neighbor search pipeline can be applied on a large image dataset containing different cell types and effectively differentiate between the cells and their states such as live, apoptotic or necrotic. There are several applications, particularly in distinguishing various cell populations in patient samples or monitoring therapy.

摘要

背景

使用传统回顾性方法进行细胞成像分析极其耗时且费力。尽管有基于人工智能的解决方案,但这些方法严重依赖监督学习技术,而这需要来自同一显微镜的高质量、大量标记数据集才能可靠。此外,原发性患者样本通常是异质细胞群体,需要进行染色以区分细胞亚群。所得的成像数据由专家手动分析和标记。因此,一种无需染色和大量手动标记即可跨成像设备区分细胞群体的方法,将极大地有助于实时洞察细胞群体动态。这对于识别特定细胞类型以及对治疗作出反应的细胞状态尤其适用。

目的

我们旨在开发一种无监督方法,利用在多样且广泛的成像数据集上训练的通用视觉基础模型,为跨设备的细胞分析提取丰富的视觉特征,包括染色和未染色的活细胞。我们的方法,即熵引导加权组合FAISS(EWC - FAISS),仅在推理模式下使用这些模型,无需对细胞数据进行特定任务的再训练。在高效且自适应的k近邻搜索中组合生成的嵌入,能够自动跨设备识别细胞类型和状态,为人工智能辅助癌症治疗提供有力基础。

方法

我们利用了两个公开可用的数据集。白细胞数据集(WBC)包括来自急性髓系和淋巴细胞白血病患者以及无白血病病理患者的14424张染色白细胞样本图像。LISC数据集包含来自健康个体的257张白细胞样本图像。我们利用JIMT - 1乳腺癌细胞系以及Jurkat和K562(白血病细胞系)生成了四个内部数据集。这些数据集是使用Nanolive 3D细胞探索者 - 荧光(CX - A)全息显微镜和BioTek Lionheart FX自动明场显微镜获取的。来自内部数据集的图像使用Roboflow软件进行手动注释。为了生成嵌入,我们使用并优化了SAM、DINO、ConvNeXT、SWIN、CLIP和ViTMAE的串联组合。组合后的嵌入用作自适应k近邻搜索的输入,构建近似分层可导航小世界FAISS索引。我们将EWC - FAISS与具有DINO - 和SWIN - 主干的完全微调的ViT分类器、ConvNeXT架构以及作为具有冻结主干的轻量级域适应方法的NMTune进行了比较。

结果

在宏观准确率方面,EWC - FAISS在原始数据集上与基线方法表现相当。宏观准确率是特定类别准确率的平均值,通过平均各个类别的准确率来平等对待所有类别。在WBC数据集上,EWC - FAISS排名第二(宏观准确率:97.6 ± 0.2),在Nanolive的细胞状态分类中排名第一(宏观准确率:90 ± 0),在Lionheart的细胞类型分类中表现相当(宏观准确率:87 ± 0)。对于转移到训练期间模型未见过的分布外(OOD)数据集,EWC - FAISS始终优于其他基线方法。对于LISC数据集,EWC - FAISS的宏观准确率达到78.5 ± 0.3,而DINO FT为17 ± 1,SWIN FT为44 ± 14,ConvNeXT FT为45 ± 9,NMTune为52 ± 10。对于Lionheart的细胞状态分类,EWC - FAISS的宏观准确率为86 ± 1,而DINO FT、SWIN FT和ConvNeXT FT分别为65 ± 11、68 ± 16和81 ± 1,NMTune为81 ± 7。对于从Nanolive转移的细胞类型分类,EWC - FAISS的宏观准确率达到85 ± 0,而DINO FT为24.5 ± 0.9,SWIN FT为57 ± 6,ConvNeXT FT为54 ± 4,NMTune为63 ± 4。此外,在嵌入生成后构建EWC - FAISS明显比训练DINO FT更快(约6分钟,而训练DINO FT需要10小时)。最后,在区分癌细胞系与外周血单个核细胞方面,EWC - FAISS的平均准确率为80 ± 5,与CellMixer的平均准确率79.7相当。

结论

我们提出了一种新颖的方法,基于在分辨率、放大倍数和图像质量各不相同的各种成像平台上获取的图像,根据细胞系和原代细胞的身份及状态对其进行识别。尽管存在这些差异,但我们能够证明,我们高效、自适应的k近邻搜索管道可应用于包含不同细胞类型的大型图像数据集,并有效区分细胞及其状态,如活细胞、凋亡细胞或坏死细胞。该方法有多种应用,特别是在区分患者样本中的各种细胞群体或监测治疗方面。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/0ce6755f5b4d/fonc-15-1480384-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/3559366a005e/fonc-15-1480384-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/9722282d35e2/fonc-15-1480384-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/ecce66877fb8/fonc-15-1480384-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/337f598b26d5/fonc-15-1480384-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/b8e5896a23e5/fonc-15-1480384-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/afe71e66b546/fonc-15-1480384-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/17e8f0530bac/fonc-15-1480384-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/8f3958321338/fonc-15-1480384-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/9b4390676590/fonc-15-1480384-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/643616c24046/fonc-15-1480384-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/0ce6755f5b4d/fonc-15-1480384-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/3559366a005e/fonc-15-1480384-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/9722282d35e2/fonc-15-1480384-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/ecce66877fb8/fonc-15-1480384-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/337f598b26d5/fonc-15-1480384-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/b8e5896a23e5/fonc-15-1480384-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/afe71e66b546/fonc-15-1480384-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/17e8f0530bac/fonc-15-1480384-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/8f3958321338/fonc-15-1480384-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/9b4390676590/fonc-15-1480384-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/643616c24046/fonc-15-1480384-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0e5/12213826/0ce6755f5b4d/fonc-15-1480384-g011.jpg

相似文献

1
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
3
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
4
Carbon dioxide detection for diagnosis of inadvertent respiratory tract placement of enterogastric tubes in children.用于诊断儿童肠胃管意外置入呼吸道的二氧化碳检测
Cochrane Database Syst Rev. 2025 Feb 19;2(2):CD011196. doi: 10.1002/14651858.CD011196.pub2.
5
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
6
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
7
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
9
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
10
Intraoperative frozen section analysis for the diagnosis of early stage ovarian cancer in suspicious pelvic masses.术中冰冻切片分析用于诊断可疑盆腔肿块中的早期卵巢癌。
Cochrane Database Syst Rev. 2016 Mar 1;3(3):CD010360. doi: 10.1002/14651858.CD010360.pub2.

本文引用的文献

1
Towards a general-purpose foundation model for computational pathology.迈向计算病理学的通用基础模型。
Nat Med. 2024 Mar;30(3):850-862. doi: 10.1038/s41591-024-02857-3. Epub 2024 Mar 19.
2
Artificial Intelligence-Based Treatment Decisions: A New Era for NSCLC.基于人工智能的治疗决策:非小细胞肺癌的新时代。
Cancers (Basel). 2024 Feb 19;16(4):831. doi: 10.3390/cancers16040831.
3
Segment anything in medical images.在医学图像中分割任何内容。
Nat Commun. 2024 Jan 22;15(1):654. doi: 10.1038/s41467-024-44824-z.
4
A high-resolution large-scale dataset of pathological and normal white blood cells.一个高分辨率的大规模病理性和正常白细胞数据集。
Sci Data. 2023 Jul 19;10(1):466. doi: 10.1038/s41597-023-02378-7.
5
Applying artificial intelligence for cancer immunotherapy.将人工智能应用于癌症免疫治疗。
Acta Pharm Sin B. 2021 Nov;11(11):3393-3405. doi: 10.1016/j.apsb.2021.02.007. Epub 2021 Feb 11.
6
The Emergence of Artificial Intelligence within Radiation Oncology Treatment Planning.人工智能在放射肿瘤治疗计划中的出现。
Oncology. 2021;99(2):124-134. doi: 10.1159/000512172. Epub 2020 Dec 22.
7
Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs.使用分层可导航小世界图进行高效且鲁棒的近似最近邻搜索
IEEE Trans Pattern Anal Mach Intell. 2020 Apr;42(4):824-836. doi: 10.1109/TPAMI.2018.2889473. Epub 2018 Dec 28.
8
Automatic recognition of five types of white blood cells in peripheral blood.外周血中五种白细胞的自动识别。
Comput Med Imaging Graph. 2011 Jun;35(4):333-43. doi: 10.1016/j.compmedimag.2011.01.003.