Suppr超能文献

细胞动力学最优传输中Sinkhorn散度的偏差校正

Debiasing Sinkhorn divergence in optimal transport of cellular dynamics.

作者信息

Cooper Jayshawn, Young Christina, Lee Pilhwa

机构信息

Department of Mathematics, Morgan State University, 1700 E. Cold Spring Lane, Baltimore, 21251, MD, USA.

出版信息

bioRxiv. 2025 Jan 14:2025.01.11.632566. doi: 10.1101/2025.01.11.632566.

Abstract

Single-cell RNA-seq analysis characterizes developmental mechanisms of cellular differentiation, lineage determination, and reprogramming with differential conditioning of the microenvironment. In this article, the underlying dynamics are formulated via optimal transport with algorithms that calculate the transition probability of the state of cell dynamics over time. The algorithmic biases of optimal transport (OT) due to entropic regularization are balanced by Sinkhorn divergence, which normally de-biases the regularized transport by centering them. In the case of reprogramming mouse embryonic fibroblasts [1] with dense time points, Sinkhorn divergence is shown to improve the trajectories of targeted cell fates depending on the specific cell types. When the time points are filtered out with sparser 9 and 5 time points, some cell phenotypes show better outcomes from strong entropic regularization. For 9 time points with 2 days interval, Sinkhorn divergence shows a clear advantage with broad bandwith of optimal entropic regularization. For these derived time points, when the cell population is scaled down from to , there comes no benefit from Sinkhorn divergence for some specific cell types. In the case of stratifying morphogenesis of the epidermis [2], the sparsity of time points makes it not significant to prescribe Sinkhorn divergence in the accuracy of transporting to the expected cell fates. Overall, whether to prescribe Sinkhorn divergence for the accurate prediction of lineages of single cells depends on temporal sparsity.

摘要

单细胞RNA测序分析通过微环境的差异调节来表征细胞分化、谱系确定和重编程的发育机制。在本文中,潜在动态通过最优传输来构建,所使用的算法可计算细胞动态状态随时间的转移概率。最优传输(OT)由于熵正则化产生的算法偏差由Sinkhorn散度平衡,Sinkhorn散度通常通过将正则化传输中心化来消除偏差。在用密集时间点对小鼠胚胎成纤维细胞进行重编程的情况下[1],Sinkhorn散度显示出根据特定细胞类型改善目标细胞命运轨迹的作用。当用更稀疏的9个和5个时间点过滤时间点时,一些细胞表型在强熵正则化下显示出更好的结果。对于间隔2天的9个时间点,Sinkhorn散度在最优熵正则化的宽带宽下显示出明显优势。对于这些导出的时间点,当细胞群体从 缩小到 时,对于某些特定细胞类型,Sinkhorn散度没有带来益处。在表皮分层形态发生的情况下[2],时间点的稀疏性使得在向预期细胞命运传输的准确性方面规定Sinkhorn散度并不重要。总体而言,是否规定Sinkhorn散度以准确预测单细胞谱系取决于时间稀疏性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e70e/11761365/b52fb936025d/nihpp-2025.01.11.632566v1-f0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验