• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AICM:纠正大型药物基因组学数据集之间不一致性的真正框架。

AICM: A Genuine Framework for Correcting Inconsistency Between Large Pharmacogenomics Datasets.

作者信息

Hu Zhiyue Tom, Ye Yuting, Newbury Patrick A, Huang Haiyan, Chen Bin

机构信息

Department of Biostatistics, University of California, Berkeley, USA,

出版信息

Pac Symp Biocomput. 2019;24:248-259.

PMID:30864327
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6417811/
Abstract

The inconsistency of open pharmacogenomics datasets produced by different studies limits the usage of such datasets in many tasks, such as biomarker discovery. Investigation of multiple pharmacogenomics datasets confirmed that the pairwise sensitivity data correlation between drugs, or rows, across different studies (drug-wise) is relatively low, while the pairwise sensitivity data correlation between cell-lines, or columns, across different studies (cell-wise) is considerably strong. This common interesting observation across multiple pharmacogenomics datasets suggests the existence of subtle consistency among the different studies (i.e., strong cell-wise correlation). However, significant noises are also shown (i.e., weak drug-wise correlation) and have prevented researchers from comfortably using the data directly. Motivated by this observation, we propose a novel framework for addressing the inconsistency between large-scale pharmacogenomics data sets. Our method can significantly boost the drug-wise correlation and can be easily applied to re-summarized and normalized datasets proposed by others. We also investigate our algorithm based on many different criteria to demonstrate that the corrected datasets are not only consistent, but also biologically meaningful. Eventually, we propose to extend our main algorithm into a framework, so that in the future when more datasets become publicly available, our framework can hopefully offer a "ground-truth" guidance for references.

摘要

不同研究产生的开放药物基因组学数据集的不一致性限制了此类数据集在许多任务中的使用,比如生物标志物发现。对多个药物基因组学数据集的调查证实,不同研究之间(按药物)药物或行之间的成对敏感性数据相关性相对较低,而不同研究之间(按细胞系)细胞系或列之间的成对敏感性数据相关性相当强。多个药物基因组学数据集的这一常见有趣观察结果表明不同研究之间存在细微的一致性(即强细胞系相关性)。然而,也存在显著噪声(即弱药物相关性),这使得研究人员无法直接轻松地使用这些数据。受此观察结果的启发,我们提出了一种新颖的框架来解决大规模药物基因组学数据集之间的不一致性。我们的方法可以显著提高药物相关性,并且可以轻松应用于他人提出的重新汇总和标准化的数据集。我们还基于许多不同标准对我们的算法进行了研究,以证明校正后的数据集不仅一致,而且具有生物学意义。最终,我们提议将我们的主要算法扩展为一个框架,以便在未来当更多数据集公开可用时,我们的框架有望为参考提供“真实”指导。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/243a/6417811/180debf57060/nihms-999800-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/243a/6417811/07ab1c20d788/nihms-999800-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/243a/6417811/270c6ce17f72/nihms-999800-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/243a/6417811/eb4113c72f04/nihms-999800-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/243a/6417811/18141bb8ab77/nihms-999800-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/243a/6417811/180debf57060/nihms-999800-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/243a/6417811/07ab1c20d788/nihms-999800-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/243a/6417811/270c6ce17f72/nihms-999800-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/243a/6417811/eb4113c72f04/nihms-999800-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/243a/6417811/18141bb8ab77/nihms-999800-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/243a/6417811/180debf57060/nihms-999800-f0005.jpg

相似文献

1
AICM: A Genuine Framework for Correcting Inconsistency Between Large Pharmacogenomics Datasets.AICM:纠正大型药物基因组学数据集之间不一致性的真正框架。
Pac Symp Biocomput. 2019;24:248-259.
2
Systematic assessment of analytical methods for drug sensitivity prediction from cancer cell line data.基于癌细胞系数据预测药物敏感性的分析方法的系统评估。
Pac Symp Biocomput. 2014:63-74.
3
Large-scale integration of heterogeneous pharmacogenomic data for identifying drug mechanism of action.整合大规模异质药物基因组数据以确定药物作用机制。
Pac Symp Biocomput. 2018;23:44-55.
4
Improved anticancer drug response prediction in cell lines using matrix factorization with similarity regularization.使用具有相似性正则化的矩阵分解改进细胞系中抗癌药物反应预测。
BMC Cancer. 2017 Aug 2;17(1):513. doi: 10.1186/s12885-017-3500-5.
5
Stepwise group sparse regression (SGSR): gene-set-based pharmacogenomic predictive models with stepwise selection of functional priors.逐步分组稀疏回归(SGSR):基于基因集的药物基因组学预测模型,具有功能先验的逐步选择。
Pac Symp Biocomput. 2015;20:32-43.
6
PACdb: a database for cell-based pharmacogenomics.PACdb:一个基于细胞的药物基因组学数据库。
Pharmacogenet Genomics. 2010 Apr;20(4):269-73. doi: 10.1097/FPC.0b013e328337b8d6.
7
Drug Intervention Response Predictions with PARADIGM (DIRPP) identifies drug resistant cancer cell lines and pathway mechanisms of resistance.使用PARADIGM进行药物干预反应预测(DIRPP)可识别耐药癌细胞系及耐药的通路机制。
Pac Symp Biocomput. 2014:125-35.
8
Comparison and validation of genomic predictors for anticancer drug sensitivity.比较和验证预测抗癌药物敏感性的基因组标志物。
J Am Med Inform Assoc. 2013 Jul-Aug;20(4):597-602. doi: 10.1136/amiajnl-2012-001442. Epub 2013 Jan 26.
9
Integrating heterogeneous drug sensitivity data from cancer pharmacogenomic studies.整合癌症药物基因组学研究中的异质药物敏感性数据。
Oncotarget. 2016 Aug 9;7(32):51619-51625. doi: 10.18632/oncotarget.10010.
10
Evaluating the consistency of large-scale pharmacogenomic studies.评估大规模药物基因组学研究的一致性。
Brief Bioinform. 2019 Sep 27;20(5):1734-1753. doi: 10.1093/bib/bby046.

引用本文的文献

1
Large-scale information retrieval and correction of noisy pharmacogenomic datasets through residual thresholded deep matrix factorization.通过残差阈值深度矩阵分解对噪声药物基因组数据集进行大规模信息检索和校正。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf226.
2
TransCell: In Silico Characterization of Genomic Landscape and Cellular Responses by Deep Transfer Learning.TransCell:基于深度迁移学习的基因组景观和细胞反应的计算特征分析。
Genomics Proteomics Bioinformatics. 2024 Jul 3;22(2). doi: 10.1093/gpbjnl/qzad008.
3
Large-Scale Information Retrieval and Correction of Noisy Pharmacogenomic Datasets through Residual Thresholded Deep Matrix Factorization.

本文引用的文献

1
The Cellosaurus, a Cell-Line Knowledge Resource.细胞osaurus,一个细胞系知识资源库。
J Biomol Tech. 2018 Jul;29(2):25-38. doi: 10.7171/jbt.18-2902-002. Epub 2018 May 10.
2
A machine learning approach to integrate big data for precision medicine in acute myeloid leukemia.一种用于整合大数据以实现急性髓系白血病精准医疗的机器学习方法。
Nat Commun. 2018 Jan 3;9(1):42. doi: 10.1038/s41467-017-02465-5.
3
Revisiting inconsistency in large pharmacogenomic studies.重新审视大型药物基因组学研究中的不一致性。
通过残差阈值深度矩阵分解对有噪声的药物基因组数据集进行大规模信息检索与校正
bioRxiv. 2023 Dec 8:2023.12.07.570723. doi: 10.1101/2023.12.07.570723.
4
Reassessing pharmacogenomic cell sensitivity with multilevel statistical models.用多层次统计模型重新评估药物基因组细胞敏感性。
Biostatistics. 2023 Oct 18;24(4):901-921. doi: 10.1093/biostatistics/kxac010.
5
Precision Medicine: Improving health through high-resolution analysis of personal data.精准医学:通过对个人数据的高分辨率分析改善健康状况。
Pac Symp Biocomput. 2019;24:220-223.
F1000Res. 2016 Sep 16;5:2333. doi: 10.12688/f1000research.9611.3. eCollection 2016.
4
Consistency in drug response profiling.药物反应谱的一致性。
Nature. 2016 Nov 30;540(7631):E5-E6. doi: 10.1038/nature20171.
5
Drug response consistency in CCLE and CGP.CCLE和CGP中的药物反应一致性。
Nature. 2016 Nov 30;540(7631):E9-E10. doi: 10.1038/nature20580.
6
A Landscape of Pharmacogenomic Interactions in Cancer.癌症中的药物基因组学相互作用全景
Cell. 2016 Jul 28;166(3):740-754. doi: 10.1016/j.cell.2016.06.017. Epub 2016 Jul 7.
7
Growth rate inhibition metrics correct for confounders in measuring sensitivity to cancer drugs.生长速率抑制指标可校正癌症药物敏感性测量中的混杂因素。
Nat Methods. 2016 Jun;13(6):521-7. doi: 10.1038/nmeth.3853. Epub 2016 May 2.
8
Aiming High--Changing the Trajectory for Cancer.志存高远——改变癌症发展轨迹
N Engl J Med. 2016 May 19;374(20):1901-4. doi: 10.1056/NEJMp1600894. Epub 2016 Apr 4.
9
Functional Genomic Landscape of Human Breast Cancer Drivers, Vulnerabilities, and Resistance.人类乳腺癌驱动因素、脆弱性和抗性的功能基因组图谱
Cell. 2016 Jan 14;164(1-2):293-309. doi: 10.1016/j.cell.2015.11.062.
10
Leveraging big data to transform target selection and drug discovery.利用大数据变革靶点选择与药物研发。
Clin Pharmacol Ther. 2016 Mar;99(3):285-97. doi: 10.1002/cpt.318.