• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过残差阈值深度矩阵分解对噪声药物基因组数据集进行大规模信息检索和校正。

Large-scale information retrieval and correction of noisy pharmacogenomic datasets through residual thresholded deep matrix factorization.

作者信息

Tom Hu Zhiyue, Yu Yaodong, Chen Ruoqiao, Yeh Shan-Ju, Chen Bin, Huang Haiyan

机构信息

Division of Biostatistics, University of California Berkeley, Berkeley, CA 94720, United States.

Department of Electrical Engineer and Computer Science, University of California Berkeley, Berkeley, CA 94720, United States.

出版信息

Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf226.

DOI:10.1093/bib/bbaf226
PMID:40420482
Abstract

Pharmacogenomics studies are attracting an increasing amount of interest from researchers in precision medicine. The advances in high-throughput experiments and multiplexed approaches allow the large-scale quantification of drug sensitivities in molecularly characterized cancer cell lines (CCLs), resulting in a number of open drug sensitivity datasets for drug biomarker discovery. However, a significant inconsistency in drug sensitivity values among these datasets has been noted. Such inconsistency indicates the presence of substantial noise, subsequently hindering downstream analyses. To address the noise in drug sensitivity data, we introduce a robust and scalable deep learning framework, Residual Thresholded Deep Matrix Factorization (RT-DMF). This method takes a single drug sensitivity data matrix as its sole input and outputs a corrected and imputed matrix. Deep matrix factorization (DMF) excels at uncovering subtle patterns, due to its minimal reliance on data structure assumptions. This attribute significantly boosts DMF's ability to identify complex hidden patterns among nuisance effects in the data, thereby facilitating the detection of signals that are therapeutically relevant. Furthermore, RT-DMF incorporates an iterative residual thresholding procedure, which plays a crucial role in retaining signals more likely to hold therapeutic importance. Validation using simulated datasets and real pharmacogenomics datasets demonstrates the effectiveness of our approach in correcting noise and imputing missing data in drug sensitivity datasets (open-source package available at https://github.com/tomwhoooo/rtdmf).

摘要

药物基因组学研究正吸引着精准医学领域研究人员越来越多的关注。高通量实验和多重方法的进展使得在分子特征明确的癌细胞系(CCLs)中能够大规模定量药物敏感性,从而产生了许多用于药物生物标志物发现的公开药物敏感性数据集。然而,已注意到这些数据集之间药物敏感性值存在显著不一致。这种不一致表明存在大量噪声,进而阻碍了下游分析。为了解决药物敏感性数据中的噪声问题,我们引入了一个强大且可扩展的深度学习框架,即残差阈值深度矩阵分解(RT-DMF)。该方法以单个药物敏感性数据矩阵作为唯一输入,并输出一个经过校正和插补的矩阵。深度矩阵分解(DMF)在揭示微妙模式方面表现出色,因为它对数据结构假设的依赖最小。这一特性显著增强了DMF识别数据中干扰效应之间复杂隐藏模式的能力,从而便于检测具有治疗相关性的信号。此外,RT-DMF纳入了一个迭代残差阈值化程序,该程序在保留更可能具有治疗重要性的信号方面起着关键作用。使用模拟数据集和真实药物基因组学数据集进行的验证证明了我们的方法在纠正药物敏感性数据集中的噪声和插补缺失数据方面的有效性(开源软件包可在https://github.com/tomwhoooo/rtdmf获取)。

相似文献

1
Large-scale information retrieval and correction of noisy pharmacogenomic datasets through residual thresholded deep matrix factorization.通过残差阈值深度矩阵分解对噪声药物基因组数据集进行大规模信息检索和校正。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf226.
2
Large-Scale Information Retrieval and Correction of Noisy Pharmacogenomic Datasets through Residual Thresholded Deep Matrix Factorization.通过残差阈值深度矩阵分解对有噪声的药物基因组数据集进行大规模信息检索与校正
bioRxiv. 2023 Dec 8:2023.12.07.570723. doi: 10.1101/2023.12.07.570723.
3
AICM: A Genuine Framework for Correcting Inconsistency Between Large Pharmacogenomics Datasets.AICM:纠正大型药物基因组学数据集之间不一致性的真正框架。
Pac Symp Biocomput. 2019;24:248-259.
4
Improved anticancer drug response prediction in cell lines using matrix factorization with similarity regularization.使用具有相似性正则化的矩阵分解改进细胞系中抗癌药物反应预测。
BMC Cancer. 2017 Aug 2;17(1):513. doi: 10.1186/s12885-017-3500-5.
5
DeepDRK: a deep learning framework for drug repurposing through kernel-based multi-omics integration.DeepDRK:一种基于核的多组学整合的药物重定位深度学习框架。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab048.
6
Bayesian semi-nonnegative matrix tri-factorization to identify pathways associated with cancer phenotypes.贝叶斯半非负矩阵三因子分解鉴定与癌症表型相关的途径。
Pac Symp Biocomput. 2020;25:427-438.
7
Hessian Regularized [Formula: see text]-Nonnegative Matrix Factorization and Deep Learning for miRNA-Disease Associations Prediction.Hessian 正则化[公式:见正文]-非负矩阵分解及其在 miRNA-疾病关联预测中的深度学习应用。
Interdiscip Sci. 2024 Mar;16(1):176-191. doi: 10.1007/s12539-023-00594-8. Epub 2023 Dec 15.
8
DeepMF: deciphering the latent patterns in omics profiles with a deep learning method.DeepMF:一种利用深度学习方法解析组学图谱中潜在模式的方法。
BMC Bioinformatics. 2019 Dec 27;20(Suppl 23):648. doi: 10.1186/s12859-019-3291-6.
9
Prediction of anticancer drug sensitivity using an interpretable model guided by deep learning.利用深度学习指导的可解释模型预测抗癌药物敏感性。
BMC Bioinformatics. 2024 May 9;25(1):182. doi: 10.1186/s12859-024-05669-x.
10
Cancer Drug Sensitivity Prediction Based on Deep Transfer Learning.基于深度迁移学习的癌症药物敏感性预测
Int J Mol Sci. 2025 Mar 10;26(6):2468. doi: 10.3390/ijms26062468.

本文引用的文献

1
TransCell: In Silico Characterization of Genomic Landscape and Cellular Responses by Deep Transfer Learning.TransCell:基于深度迁移学习的基因组景观和细胞反应的计算特征分析。
Genomics Proteomics Bioinformatics. 2024 Jul 3;22(2). doi: 10.1093/gpbjnl/qzad008.
2
DBDNMF: A Dual Branch Deep Neural Matrix Factorization method for drug response prediction.DBDNMF:一种用于药物反应预测的双分支深度神经矩阵分解方法。
PLoS Comput Biol. 2024 Apr 4;20(4):e1012012. doi: 10.1371/journal.pcbi.1012012. eCollection 2024 Apr.
3
Predicting Drug Response Based on Multi-Omics Fusion and Graph Convolution.
基于多组学融合和图卷积的药物反应预测。
IEEE J Biomed Health Inform. 2022 Mar;26(3):1384-1393. doi: 10.1109/JBHI.2021.3102186. Epub 2022 Mar 7.
4
Deep generative neural network for accurate drug response imputation.深度生成神经网络在药物反应预测中的应用
Nat Commun. 2021 Mar 19;12(1):1740. doi: 10.1038/s41467-021-21997-5.
5
Veridical data science.真实数据科学。
Proc Natl Acad Sci U S A. 2020 Feb 25;117(8):3920-3929. doi: 10.1073/pnas.1901326117. Epub 2020 Feb 13.
6
Identification of pathways associated with chemosensitivity through network embedding.通过网络嵌入识别与化疗敏感性相关的途径。
PLoS Comput Biol. 2019 Mar 20;15(3):e1006864. doi: 10.1371/journal.pcbi.1006864. eCollection 2019 Mar.
7
AICM: A Genuine Framework for Correcting Inconsistency Between Large Pharmacogenomics Datasets.AICM:纠正大型药物基因组学数据集之间不一致性的真正框架。
Pac Symp Biocomput. 2019;24:248-259.
8
Revisiting inconsistency in large pharmacogenomic studies.重新审视大型药物基因组学研究中的不一致性。
F1000Res. 2016 Sep 16;5:2333. doi: 10.12688/f1000research.9611.3. eCollection 2016.
9
Consistency in drug response profiling.药物反应谱的一致性。
Nature. 2016 Nov 30;540(7631):E5-E6. doi: 10.1038/nature20171.
10
Drug response consistency in CCLE and CGP.CCLE和CGP中的药物反应一致性。
Nature. 2016 Nov 30;540(7631):E9-E10. doi: 10.1038/nature20580.