稀疏表示学习从艾伦老鼠大脑图谱中获得具有明确基因权重的生物学特征。

Sparse representation learning derives biological features with explicit gene weights from the Allen Mouse Brain Atlas.

机构信息

School for Biological and Health Systems Engineering, Arizona State University, Tempe, Arizona, United States of America.

Department of Mathematics, Tufts University, Medford, Massachusetts, United States of America.

出版信息

PLoS One. 2023 Mar 6;18(3):e0282171. doi: 10.1371/journal.pone.0282171. eCollection 2023.

DOI:10.1371/journal.pone.0282171

PMID:36877707

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9987823/

Abstract

Unsupervised learning methods are commonly used to detect features within transcriptomic data and ultimately derive meaningful representations of biology. Contributions of individual genes to any feature however becomes convolved with each learning step, requiring follow up analysis and validation to understand what biology might be represented by a cluster on a low dimensional plot. We sought learning methods that could preserve the gene information of detected features, using the spatial transcriptomic data and anatomical labels of the Allen Mouse Brain Atlas as a test dataset with verifiable ground truth. We established metrics for accurate representation of molecular anatomy to find sparse learning approaches were uniquely capable of generating anatomical representations and gene weights in a single learning step. Fit to labeled anatomy was highly correlated with intrinsic properties of the data, offering a means to optimize parameters without established ground truth. Once representations were derived, complementary gene lists could be further compressed to generate a low complexity dataset, or to probe for individual features with >95% accuracy. We demonstrate the utility of sparse learning as a means to derive biologically meaningful representations from transcriptomic data and reduce the complexity of large datasets while preserving intelligible gene information throughout the analysis.

摘要

无监督学习方法常用于检测转录组数据中的特征，并最终得出生物学的有意义表示。然而，单个基因对任何特征的贡献都与每个学习步骤交织在一起，需要进行后续分析和验证，以了解低维图谱上的聚类代表什么生物学。我们寻求能够保留检测到的特征的基因信息的学习方法，使用 Allen 小鼠大脑图谱的空间转录组数据和解剖标签作为测试数据集，具有可验证的真实信息。我们建立了用于准确表示分子解剖结构的指标，发现稀疏学习方法能够在单个学习步骤中生成独特的解剖表示和基因权重。与有标签的解剖结构的拟合与数据的内在特性高度相关，为在没有既定真实信息的情况下优化参数提供了一种方法。一旦得到表示，就可以进一步压缩补充的基因列表，以生成一个低复杂度的数据集，或者以 95%以上的准确率探测单个特征。我们展示了稀疏学习作为一种从转录组数据中提取有生物学意义的表示并降低大型数据集复杂性的方法的实用性，同时在整个分析过程中保留可理解的基因信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e13d/9987823/90761782f4fc/pone.0282171.g001.jpg

相似文献

Sparse representation learning derives biological features with explicit gene weights from the Allen Mouse Brain Atlas.

PLoS One. 2023 Mar 6;18(3):e0282171. doi: 10.1371/journal.pone.0282171. eCollection 2023.

Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain.

PLoS Biol. 2021 Jul 19;19(7):e3001341. doi: 10.1371/journal.pbio.3001341. eCollection 2021 Jul.

Spage2vec: Unsupervised representation of localized spatial gene expression signatures.

FEBS J. 2021 Mar;288(6):1859-1870. doi: 10.1111/febs.15572. Epub 2020 Oct 11.

Discover mouse gene coexpression landscapes using dictionary learning and sparse coding.

Brain Struct Funct. 2017 Dec;222(9):4253-4270. doi: 10.1007/s00429-017-1460-9. Epub 2017 Jun 29.

Unsupervised pattern identification in spatial gene expression atlas reveals mouse brain regions beyond established ontology.

Proc Natl Acad Sci U S A. 2024 Sep 10;121(37):e2319804121. doi: 10.1073/pnas.2319804121. Epub 2024 Sep 3.

Transcriptome Architecture of Adult Mouse Brain Revealed by Sparse Coding of Genome-Wide In Situ Hybridization Images.

Neuroinformatics. 2017 Jul;15(3):285-295. doi: 10.1007/s12021-017-9333-1.

A high-resolution transcriptomic and spatial atlas of cell types in the whole mouse brain.

Nature. 2023 Dec;624(7991):317-332. doi: 10.1038/s41586-023-06812-z. Epub 2023 Dec 13.

Subspace learning using low-rank latent representation learning and perturbation theorem: Unsupervised gene selection.

Comput Biol Med. 2025 Feb;185:109567. doi: 10.1016/j.compbiomed.2024.109567. Epub 2024 Dec 14.

Automated gene expression pattern annotation in the mouse brain.

Pac Symp Biocomput. 2015;20:144-55.

Deep learning in single-cell and spatial transcriptomics data analysis: advances and challenges from a data science perspective.

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf136.

本文引用的文献

High-resolution alignment of single-cell and spatial transcriptomes with CytoSPACE.

Nat Biotechnol. 2023 Nov;41(11):1543-1548. doi: 10.1038/s41587-023-01697-9. Epub 2023 Mar 6.

Spatial omics and multiplexed imaging to explore cancer biology.

Nat Methods. 2021 Sep;18(9):997-1012. doi: 10.1038/s41592-021-01203-6. Epub 2021 Aug 2.

Contrastive self-supervised clustering of scRNA-seq data.

BMC Bioinformatics. 2021 May 27;22(1):280. doi: 10.1186/s12859-021-04210-8.

Single-Cell RNA Sequencing in Parkinson's Disease.

Biomedicines. 2021 Apr 1;9(4):368. doi: 10.3390/biomedicines9040368.

A Comparison for Dimensionality Reduction Methods of Single-Cell RNA-seq Data.

Front Genet. 2021 Mar 23;12:646936. doi: 10.3389/fgene.2021.646936. eCollection 2021.

scREAD: A Single-Cell RNA-Seq Database for Alzheimer's Disease.

iScience. 2020 Nov 5;23(11):101769. doi: 10.1016/j.isci.2020.101769. eCollection 2020 Nov 20.

Visualizing Single-Cell RNA-seq Data with Semisupervised Principal Component Analysis.

Int J Mol Sci. 2020 Aug 12;21(16):5797. doi: 10.3390/ijms21165797.

Molecular atlas of the adult mouse brain.

Sci Adv. 2020 Jun 26;6(26):eabb3446. doi: 10.1126/sciadv.abb3446. eCollection 2020 Jun.

The art of using t-SNE for single-cell transcriptomics.

Nat Commun. 2019 Nov 28;10(1):5416. doi: 10.1038/s41467-019-13056-x.

Towards understanding sparse filtering: A theoretical perspective.

Neural Netw. 2018 Feb;98:154-177. doi: 10.1016/j.neunet.2017.11.010. Epub 2017 Dec 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

稀疏表示学习从艾伦老鼠大脑图谱中获得具有明确基因权重的生物学特征。

Sparse representation learning derives biological features with explicit gene weights from the Allen Mouse Brain Atlas.

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献