• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Doubly Stochastic Normalization of the Gaussian Kernel Is Robust to Heteroskedastic Noise.高斯核的双重随机归一化对异方差噪声具有鲁棒性。
SIAM J Math Data Sci. 2021;3(1):388-413. doi: 10.1137/20M1342124. Epub 2021 Mar 23.
2
Understanding Symmetric Smoothing Filters: A Gaussian Mixture Model Perspective.理解对称平滑滤波器:高斯混合模型视角。
IEEE Trans Image Process. 2017 Nov;26(11):5107-5121. doi: 10.1109/TIP.2017.2731208. Epub 2017 Jul 24.
3
Scalable Kernel Ordinal Regression via Doubly Stochastic Gradients.通过双重随机梯度实现可扩展内核序数回归
IEEE Trans Neural Netw Learn Syst. 2021 Aug;32(8):3677-3689. doi: 10.1109/TNNLS.2020.3015937. Epub 2021 Aug 3.
4
Accounting for outliers and heteroskedasticity in multibreed genetic evaluations of postweaning gain of Nelore-Hereford cattle.考虑在肉牛-赫里福德牛断奶后增重的多品种遗传评估中的异常值和异方差性。
J Anim Sci. 2007 Apr;85(4):909-18. doi: 10.2527/jas.2006-668. Epub 2006 Dec 18.
5
Human Motion Segmentation via Robust Kernel Sparse Subspace Clustering.基于鲁棒核稀疏子空间聚类的人体运动分割。
IEEE Trans Image Process. 2018;27(1):135-150. doi: 10.1109/TIP.2017.2738562.
6
Diffusion maps for high-dimensional single-cell analysis of differentiation data.用于分化数据高维单细胞分析的扩散映射
Bioinformatics. 2015 Sep 15;31(18):2989-98. doi: 10.1093/bioinformatics/btv325. Epub 2015 May 21.
7
Discovering transition phenomena from data of stochastic dynamical systems with Lévy noise.从具有 Lévy 噪声的随机动力系统数据中发现转变现象。
Chaos. 2020 Sep;30(9):093110. doi: 10.1063/5.0004450.
8
Introducing User-Prescribed Constraints in Markov Chains for Nonlinear Dimensionality Reduction.在马尔可夫链中引入用户规定的约束以进行非线性降维。
Neural Comput. 2019 May;31(5):980-997. doi: 10.1162/neco_a_01184. Epub 2019 Mar 18.
9
Efficient implementation of the gaussian kernel algorithm in estimating invariants and noise level from noisy time series data.高斯核算法在从噪声时间序列数据估计不变量和噪声水平中的高效实现。
Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics. 2000 Apr;61(4 Pt A):3750-6. doi: 10.1103/physreve.61.3750.
10
A Doubly Stochastic Change Point Detection Algorithm for Noisy Biological Signals.一种用于有噪声生物信号的双重随机变化点检测算法
Front Physiol. 2018 Jan 5;8:1112. doi: 10.3389/fphys.2017.01112. eCollection 2017.

引用本文的文献

1
Bi-stochastically normalized graph Laplacian: convergence to manifold Laplacian and robustness to outlier noise.双随机归一化图拉普拉斯算子:向流形拉普拉斯算子的收敛性及对离群噪声的鲁棒性
Inf inference. 2024 Sep 20;13(4):iaae026. doi: 10.1093/imaiai/iaae026. eCollection 2024 Dec.
2
Feature Optimization Method of Material Identification for Loose Particles Inside Sealed Relays.密封继电器内部松散颗粒的材料鉴别特征优化方法。
Sensors (Basel). 2022 May 7;22(9):3566. doi: 10.3390/s22093566.

本文引用的文献

1
Demystifying "drop-outs" in single-cell UMI data.破解单细胞 UMI 数据中的“dropout”现象。
Genome Biol. 2020 Aug 6;21(1):196. doi: 10.1186/s13059-020-02096-y.
2
Determining sequencing depth in a single-cell RNA-seq experiment.确定单细胞 RNA-seq 实验中的测序深度。
Nat Commun. 2020 Feb 7;11(1):774. doi: 10.1038/s41467-020-14482-y.
3
Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression.使用正则化负二项式回归进行单细胞 RNA-seq 数据的归一化和方差稳定化。
Genome Biol. 2019 Dec 23;20(1):296. doi: 10.1186/s13059-019-1874-1.
4
Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model.基于多项模型的单细胞 RNA-Seq 特征选择和降维。
Genome Biol. 2019 Dec 23;20(1):295. doi: 10.1186/s13059-019-1861-6.
5
Fast interpolation-based t-SNE for improved visualization of single-cell RNA-seq data.基于快速插值的 t-SNE 用于改善单细胞 RNA-seq 数据的可视化。
Nat Methods. 2019 Mar;16(3):243-245. doi: 10.1038/s41592-018-0308-4. Epub 2019 Feb 11.
6
Single-cell RNA-seq reveals new types of human blood dendritic cells, monocytes, and progenitors.单细胞RNA测序揭示了人类血液中新型树突状细胞、单核细胞和祖细胞。
Science. 2017 Apr 21;356(6335). doi: 10.1126/science.aah4573.
7
Graph Laplacian Regularization for Image Denoising: Analysis in the Continuous Domain.图拉普拉斯正则化在图像去噪中的应用:连续域分析。
IEEE Trans Image Process. 2017 Apr;26(4):1770-1785. doi: 10.1109/TIP.2017.2651400. Epub 2017 Jan 11.
8
Massively parallel digital transcriptional profiling of single cells.大规模平行数字化单细胞转录组分析。
Nat Commun. 2017 Jan 16;8:14049. doi: 10.1038/ncomms14049.
9
Div-Seq: Single-nucleus RNA-Seq reveals dynamics of rare adult newborn neurons.Div-Seq:单核RNA测序揭示了成年期罕见新生神经元的动态变化。
Science. 2016 Aug 26;353(6302):925-8. doi: 10.1126/science.aad7038. Epub 2016 Jul 28.
10
Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq.通过单细胞RNA测序剖析转移性黑色素瘤的多细胞生态系统
Science. 2016 Apr 8;352(6282):189-96. doi: 10.1126/science.aad0501.

高斯核的双重随机归一化对异方差噪声具有鲁棒性。

Doubly Stochastic Normalization of the Gaussian Kernel Is Robust to Heteroskedastic Noise.

作者信息

Landa Boris, Coifman Ronald R, Kluger Yuval

机构信息

Program in Applied Mathematics, Yale University.

Interdepartmental Program in Computational Biology and Bioinformatics, Yale University.

出版信息

SIAM J Math Data Sci. 2021;3(1):388-413. doi: 10.1137/20M1342124. Epub 2021 Mar 23.

DOI:10.1137/20M1342124
PMID:34124607
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8194191/
Abstract

A fundamental step in many data-analysis techniques is the construction of an affinity matrix describing similarities between data points. When the data points reside in Euclidean space, a widespread approach is to from an affinity matrix by the Gaussian kernel with pairwise distances, and to follow with a certain normalization (e.g. the row-stochastic normalization or its symmetric variant). We demonstrate that the doubly-stochastic normalization of the Gaussian kernel with zero main diagonal (i.e., no self loops) is robust to heteroskedastic noise. That is, the doubly-stochastic normalization is advantageous in that it automatically accounts for observations with different noise variances. Specifically, we prove that in a suitable high-dimensional setting where heteroskedastic noise does not concentrate too much in any particular direction in space, the resulting (doubly-stochastic) noisy affinity matrix converges to its clean counterpart with rate , where is the ambient dimension. We demonstrate this result numerically, and show that in contrast, the popular row-stochastic and symmetric normalizations behave unfavorably under heteroskedastic noise. Furthermore, we provide examples of simulated and experimental single-cell RNA sequence data with intrinsic heteroskedasticity, where the advantage of the doubly-stochastic normalization for exploratory analysis is evident.

摘要

许多数据分析技术的一个基本步骤是构建一个描述数据点之间相似性的亲和矩阵。当数据点位于欧几里得空间时,一种广泛采用的方法是通过高斯核与成对距离来构建亲和矩阵,并随后进行某种归一化(例如行随机归一化或其对称变体)。我们证明,主对角线为零(即无自环)的高斯核的双随机归一化对异方差噪声具有鲁棒性。也就是说,双随机归一化的优势在于它能自动考虑具有不同噪声方差的观测值。具体而言,我们证明,在一个合适的高维环境中,当异方差噪声在空间的任何特定方向上都不会过度集中时,所得的(双随机)噪声亲和矩阵以速率 收敛到其无噪声对应矩阵,其中 是环境维度。我们通过数值演示了这一结果,并表明相比之下,流行的行随机归一化和对称归一化在异方差噪声下表现不佳。此外,我们提供了具有内在异方差性的模拟和实验单细胞RNA序列数据的示例,其中双随机归一化在探索性分析中的优势显而易见。