Kedzierska Kasia Z, Crawford Lorin, Amini Ava P, Lu Alex X
University of Oxford, Oxford, UK.
Microsoft Research, Cambridge, MA, USA.
Genome Biol. 2025 Apr 18;26(1):101. doi: 10.1186/s13059-025-03574-x.
Foundation models such as scGPT and Geneformer have not been rigorously evaluated in a setting where they are used without any further training (i.e., zero-shot). Understanding the performance of models in zero-shot settings is critical to applications that exclude the ability to fine-tune, such as discovery settings where labels are unknown. Our evaluation of the zero-shot performance of Geneformer and scGPT suggests that, in some cases, these models may face reliability challenges and could be outperformed by simpler methods. Our findings underscore the importance of zero-shot evaluations in development and deployment of foundation models in single-cell research.
诸如scGPT和Geneformer等基础模型,在未经过任何进一步训练(即零样本)的情况下使用时,尚未得到严格评估。了解模型在零样本设置下的性能对于那些排除了微调能力的应用至关重要,比如标签未知的发现设置。我们对Geneformer和scGPT的零样本性能评估表明,在某些情况下,这些模型可能面临可靠性挑战,并且可能被更简单的方法超越。我们的研究结果强调了零样本评估在单细胞研究中基础模型开发和部署中的重要性。