Suppr超能文献

SpaDiT:基于 scRNA-seq 的空间基因表达预测扩散转换器。

SpaDiT: diffusion transformer for spatial gene expression prediction using scRNA-seq.

机构信息

School of Information Science and Engineering, Yunnan University, 650500, Kunming, Yunnan, China.

School of Health and Nursing, Yunnan Open University, 650599, Kunming, China.

出版信息

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae571.

Abstract

The rapid development of spatially resolved transcriptomics (SRT) technologies has provided unprecedented opportunities for exploring the structure of specific organs or tissues. However, these techniques (such as image-based SRT) can achieve single-cell resolution, but can only capture the expression levels of tens to hundreds of genes. Such spatial transcriptomics (ST) data, carrying a large number of undetected genes, have limited its application value. To address the challenge, we develop SpaDiT, a deep learning framework for spatial reconstruction and gene expression prediction using scRNA-seq data. SpaDiT employs scRNA-seq data as an a priori condition and utilizes shared genes between ST and scRNA-seq data as latent representations to construct inputs, thereby facilitating the accurate prediction of gene expression in ST data. SpaDiT enhances the accuracy of spatial gene expression predictions over a variety of spatial transcriptomics datasets. We have demonstrated the effectiveness of SpaDiT by conducting extensive experiments on both seq-based and image-based ST data. We compared SpaDiT with eight highly effective baseline methods and found that our proposed method achieved an 8%-12% improvement in performance across multiple metrics. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/SpaDiT and https://zenodo.org/records/12792074.

摘要

空间分辨转录组学(SRT)技术的快速发展为探索特定器官或组织的结构提供了前所未有的机会。然而,这些技术(如基于图像的 SRT)可以实现单细胞分辨率,但只能捕获数十到数百个基因的表达水平。这种空间转录组学(ST)数据,携带大量未检测到的基因,限制了其应用价值。为了解决这个挑战,我们开发了 SpaDiT,这是一个使用 scRNA-seq 数据进行空间重建和基因表达预测的深度学习框架。SpaDiT 将 scRNA-seq 数据作为先验条件,并利用 ST 和 scRNA-seq 数据之间的共享基因作为潜在表示来构建输入,从而促进 ST 数据中基因表达的准确预测。SpaDiT 提高了多种空间转录组数据集上空间基因表达预测的准确性。我们通过对基于序列和基于图像的 ST 数据进行广泛的实验,证明了 SpaDiT 的有效性。我们将 SpaDiT 与八种高效的基线方法进行了比较,发现我们的方法在多个指标上的性能提高了 8%-12%。本文中使用的源代码和所有数据集均可在 https://github.com/wenwenmin/SpaDiThttps://zenodo.org/records/12792074 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce7e/11541600/90eac7d252f0/bbae571f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验