Complex Analysis Group, Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad-Gurgaon Expressway, Faridabad, 121001, India.
Complex Analysis Group, Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad-Gurgaon Expressway, Faridabad, 121001, India.
Methods. 2024 Dec;232:107-114. doi: 10.1016/j.ymeth.2024.11.002. Epub 2024 Nov 7.
Spatial transcriptomics (ST) enables the visualization of gene expression within the context of tissue morphology. This emerging discipline has the potential to serve as a foundation for developing tools to design precision medicines. However, due to the higher costs and expertise required for such experiments, its translation into a regular clinical practice might be challenging. Despite implementing modern deep learning to enhance information obtained from histological images using AI, efforts have been constrained by limitations in the diversity of information. In this paper, we developed a model, HistoSPACE, that explores the diversity of histological images available with ST data to extract molecular insights from tissue images. Further, our approach allows us to link the predicted expression with disease pathology. Our proposed study built an image encoder derived from a universal image autoencoder. This image encoder was connected to convolution blocks to build the final model. It was further fine-tuned with the help of ST-Data. The number of model parameters is small and requires lesser system memory and relatively lesser training time. Making it lightweight in comparison to traditional histological models. Our developed model demonstrates significant efficiency compared to contemporary algorithms, revealing a correlation of 0.56 in leave-one-out cross-validation. Finally, its robustness was validated through an independent dataset, showing similar prediction with predefined disease pathology. Our code is available at https://github.com/samrat-lab/HistoSPACE.
空间转录组学 (ST) 能够在组织形态学背景下可视化基因表达。这一新兴学科有可能成为开发精准医学工具的基础。然而,由于此类实验所需的成本和专业知识较高,将其转化为常规临床实践可能具有挑战性。尽管已经实施了现代深度学习技术,利用人工智能增强从组织学图像中获取的信息,但由于信息多样性的限制,这些努力受到了限制。在本文中,我们开发了一种名为 HistoSPACE 的模型,该模型利用 ST 数据探索了可供探索的组织学图像的多样性,以从组织图像中提取分子见解。此外,我们的方法允许我们将预测的表达与疾病病理学联系起来。我们提出的研究构建了一个源自通用图像自动编码器的图像编码器。该图像编码器与卷积块相连,构成最终模型。它进一步在 ST 数据的帮助下进行了微调。该模型的参数数量较少,需要较少的系统内存和相对较少的训练时间。与传统的组织学模型相比,它更加轻量级。与当代算法相比,我们开发的模型显示出显著的效率,在留一交叉验证中相关性为 0.56。最后,通过独立数据集验证了其稳健性,显示与预定义疾病病理学具有相似的预测。我们的代码可在 https://github.com/samrat-lab/HistoSPACE 上获得。