• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

谐波对齐

Harmonic Alignment.

作者信息

Stanley Jay S, Gigante Scott, Wolf Guy, Krishnaswamy Smita

机构信息

Yale University, Appl. Math. Prog.

Yale University, Comp. Bio. & Bioinf. Prog.

出版信息

Proc SIAM Int Conf Data Min. 2020;2020:316-324. doi: 10.1137/1.9781611976236.36.

DOI:10.1137/1.9781611976236.36
PMID:33723496
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7956229/
Abstract

We propose a novel framework for combining datasets via alignment of their intrinsic geometry. This alignment can be used to fuse data originating from disparate modalities, or to correct batch effects while preserving intrinsic data structure. Importantly, we do not assume any pointwise correspondence between datasets, but instead rely on correspondence between a (possibly unknown) subset of data features. We leverage this assumption to construct an isometric alignment between the data. This alignment is obtained by relating the expansion of data features in harmonics derived from diffusion operators defined over each dataset. These expansions encode each feature as a function of the data geometry. We use this to relate the diffusion coordinates of each dataset through our assumption of partial feature correspondence. Then, a unified diffusion geometry is constructed over the aligned data, which can also be used to correct the original data measurements. We demonstrate our method on several datasets, showing in particular its effectiveness in biological applications including fusion of single-cell RNA sequencing (scRNA-seq) and single-cell ATAC sequencing (scATAC-seq) data measured on the same population of cells, and removal of batch effect between biological samples.

摘要

我们提出了一种通过对齐数据集的内在几何结构来组合数据集的新颖框架。这种对齐可用于融合来自不同模态的数据,或在保留内在数据结构的同时校正批次效应。重要的是,我们不假设数据集之间存在任何逐点对应关系,而是依赖于数据特征的(可能未知的)子集之间的对应关系。我们利用这一假设来构建数据之间的等距对齐。这种对齐是通过关联在每个数据集上定义的扩散算子导出的谐波中数据特征的扩展来获得的。这些扩展将每个特征编码为数据几何的函数。我们利用这一点,通过我们的部分特征对应假设来关联每个数据集的扩散坐标。然后,在对齐的数据上构建统一的扩散几何结构,这也可用于校正原始数据测量。我们在几个数据集上展示了我们的方法,特别展示了其在生物学应用中的有效性,包括融合在同一细胞群体上测量的单细胞RNA测序(scRNA-seq)和单细胞ATAC测序(scATAC-seq)数据,以及消除生物样本之间的批次效应。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73df/7956229/884d2f60d707/nihms-1591275-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73df/7956229/5e3cefb8d280/nihms-1591275-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73df/7956229/0f134adee38f/nihms-1591275-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73df/7956229/f1102f54955f/nihms-1591275-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73df/7956229/884d2f60d707/nihms-1591275-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73df/7956229/5e3cefb8d280/nihms-1591275-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73df/7956229/0f134adee38f/nihms-1591275-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73df/7956229/f1102f54955f/nihms-1591275-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73df/7956229/884d2f60d707/nihms-1591275-f0003.jpg

相似文献

1
Harmonic Alignment.谐波对齐
Proc SIAM Int Conf Data Min. 2020;2020:316-324. doi: 10.1137/1.9781611976236.36.
2
CBA: Cluster-Guided Batch Alignment for Single Cell RNA-seq.CBA:单细胞RNA测序的聚类引导批量比对
Front Genet. 2021 Apr 13;12:644211. doi: 10.3389/fgene.2021.644211. eCollection 2021.
3
BATMAN: Fast and Accurate Integration of Single-Cell RNA-Seq Datasets via Minimum-Weight Matching.蝙蝠侠:通过最小权重匹配实现单细胞RNA测序数据集的快速准确整合
iScience. 2020 Jun 26;23(6):101185. doi: 10.1016/j.isci.2020.101185. Epub 2020 May 20.
4
ChimPipe: accurate detection of fusion genes and transcription-induced chimeras from RNA-seq data.ChimPipe:从RNA测序数据中准确检测融合基因和转录诱导嵌合体。
BMC Genomics. 2017 Jan 3;18(1):7. doi: 10.1186/s12864-016-3404-9.
5
bayNorm: Bayesian gene expression recovery, imputation and normalization for single-cell RNA-sequencing data.bayNorm:用于单细胞 RNA-seq 数据的贝叶斯基因表达恢复、插补和标准化。
Bioinformatics. 2020 Feb 15;36(4):1174-1181. doi: 10.1093/bioinformatics/btz726.
6
GLOBE: a contrastive learning-based framework for integrating single-cell transcriptome datasets.GLOBE:基于对比学习的整合单细胞转录组数据集的框架。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac311.
7
JingleBells: A Repository of Immune-Related Single-Cell RNA-Sequencing Datasets.《铃儿响叮当》:一个免疫相关单细胞RNA测序数据集的储存库。
J Immunol. 2017 May 1;198(9):3375-3379. doi: 10.4049/jimmunol.1700272.
8
Combining white matter diffusion and geometry for tract-specific alignment and variability analysis.结合白质扩散和几何形态学进行束特异性配准和变异性分析。
Neuroimage. 2019 Oct 15;200:674-689. doi: 10.1016/j.neuroimage.2019.05.003. Epub 2019 May 13.
9
Impact of similarity metrics on single-cell RNA-seq data clustering.相似度度量对单细胞 RNA-seq 数据聚类的影响。
Brief Bioinform. 2019 Nov 27;20(6):2316-2326. doi: 10.1093/bib/bby076.
10
A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa.一种用于隐性营养不良型大疱性表皮松解症的单细胞 RNA-seq 分析的多任务聚类方法。
PLoS Comput Biol. 2018 Apr 9;14(4):e1006053. doi: 10.1371/journal.pcbi.1006053. eCollection 2018 Apr.

引用本文的文献

1
LSMMD-MA: scaling multimodal data integration for single-cell genomics data analysis.LSMMD-MA:用于单细胞基因组数据分析的多模态数据整合的尺度不变方法。
Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad420.
2
Con-AAE: contrastive cycle adversarial autoencoders for single-cell multi-omics alignment and integration.Con-AAE:用于单细胞多组学比对和整合的对比循环对抗自动编码器。
Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad162.
3
Linking cells across single-cell modalities by synergistic matching of neighborhood structure.

本文引用的文献

1
Exploring single-cell data with deep multitasking neural networks.用深度多任务神经网络探索单细胞数据。
Nat Methods. 2019 Nov;16(11):1139-1145. doi: 10.1038/s41592-019-0576-7. Epub 2019 Oct 7.
2
Optimal Shrinkage of Eigenvalues in the Spiked Covariance Model.尖峰协方差模型中特征值的最优收缩
Ann Stat. 2018 Aug;46(4):1742-1778. doi: 10.1214/17-AOS1601. Epub 2018 Jun 27.
3
Joint profiling of chromatin accessibility and gene expression in thousands of single cells.在数千个单细胞中进行染色质可及性和基因表达的联合分析。
通过协同匹配邻域结构将单细胞模态中的细胞连接起来。
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii148-ii154. doi: 10.1093/bioinformatics/btac481.
4
Single-cell multi-modal GAN reveals spatial patterns in single-cell data from triple-negative breast cancer.单细胞多模态生成对抗网络揭示三阴性乳腺癌单细胞数据中的空间模式。
Patterns (N Y). 2022 Sep 1;3(9):100577. doi: 10.1016/j.patter.2022.100577. eCollection 2022 Sep 9.
5
Multi-domain translation between single-cell imaging and sequencing data using autoencoders.基于自动编码器的单细胞成像和测序数据的多领域转换。
Nat Commun. 2021 Jan 4;12(1):31. doi: 10.1038/s41467-020-20249-2.
Science. 2018 Sep 28;361(6409):1380-1385. doi: 10.1126/science.aau0730. Epub 2018 Aug 30.
4
Recovering Gene Interactions from Single-Cell Data Using Data Diffusion.利用数据扩散从单细胞数据中恢复基因相互作用。
Cell. 2018 Jul 26;174(3):716-729.e27. doi: 10.1016/j.cell.2018.05.061. Epub 2018 Jun 28.
5
Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors.通过匹配相互最近邻,纠正单细胞 RNA 测序数据中的批次效应。
Nat Biotechnol. 2018 Jun;36(5):421-427. doi: 10.1038/nbt.4091. Epub 2018 Apr 2.
6
Kernel Manifold Alignment for Domain Adaptation.用于域适应的核流形对齐
PLoS One. 2016 Feb 12;11(2):e0148655. doi: 10.1371/journal.pone.0148655. eCollection 2016.
7
Circulating levels of tumour necrosis factor-alpha & interferon-gamma in patients with dengue & dengue haemorrhagic fever during an outbreak.登革热和登革出血热暴发期间患者体内肿瘤坏死因子-α和干扰素-γ的循环水平。
Indian J Med Res. 2006 Jan;123(1):25-30.
8
Synergy between interferon-gamma and tumor necrosis factor-alpha in transcriptional activation is mediated by cooperation between signal transducer and activator of transcription 1 and nuclear factor kappaB.γ干扰素与肿瘤坏死因子-α在转录激活中的协同作用是由信号转导和转录激活因子1与核因子κB之间的合作介导的。
J Biol Chem. 1997 Jun 6;272(23):14899-907. doi: 10.1074/jbc.272.23.14899.