• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

泥化:从基因表达数据中准确估计泛癌症肿瘤纯度。

PUREE: accurate pan-cancer tumor purity estimation from gene expression data.

机构信息

Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Singapore, 138672, Republic of Singapore.

School of Computing, National University of Singapore, Computing 1, 13 Computing Drive, Singapore, 117417, Republic of Singapore.

出版信息

Commun Biol. 2023 Apr 11;6(1):394. doi: 10.1038/s42003-023-04764-8.

DOI:10.1038/s42003-023-04764-8
PMID:37041233
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10090153/
Abstract

Tumors are complex masses composed of malignant and non-malignant cells. Variation in tumor purity (proportion of cancer cells in a sample) can both confound integrative analysis and enable studies of tumor heterogeneity. Here we developed PUREE, which uses a weakly supervised learning approach to infer tumor purity from a tumor gene expression profile. PUREE was trained on gene expression data and genomic consensus purity estimates from 7864 solid tumor samples. PUREE predicted purity with high accuracy across distinct solid tumor types and generalized to tumor samples from unseen tumor types and cohorts. Gene features of PUREE were further validated using single-cell RNA-seq data from distinct tumor types. In a comprehensive benchmark, PUREE outperformed existing transcriptome-based purity estimation approaches. Overall, PUREE is a highly accurate and versatile method for estimating tumor purity and interrogating tumor heterogeneity from bulk tumor gene expression data, which can complement genomics-based approaches or be used in settings where genomic data is unavailable.

摘要

肿瘤是由恶性和非恶性细胞组成的复杂肿块。肿瘤纯度(样本中癌细胞的比例)的变化既会混淆综合分析,也能使肿瘤异质性研究成为可能。在这里,我们开发了 PUREE,它使用一种弱监督学习方法从肿瘤基因表达谱中推断肿瘤纯度。PUREE 是在来自 7864 个实体瘤样本的基因表达数据和基因组共识纯度估计值上进行训练的。PUREE 在不同的实体瘤类型中具有高精度的预测纯度,并推广到来自未见肿瘤类型和队列的肿瘤样本。使用来自不同肿瘤类型的单细胞 RNA-seq 数据进一步验证了 PUREE 的基因特征。在全面的基准测试中,PUREE 优于现有的基于转录组的纯度估计方法。总的来说,PUREE 是一种从肿瘤基因表达数据中估计肿瘤纯度和研究肿瘤异质性的高度准确和通用的方法,可以补充基于基因组学的方法,或在没有基因组数据的情况下使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6545/10090153/0a49b7fe1cf0/42003_2023_4764_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6545/10090153/327bdffa35f7/42003_2023_4764_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6545/10090153/f1dcabc780b7/42003_2023_4764_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6545/10090153/a09dde95a619/42003_2023_4764_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6545/10090153/c19af1aedb0a/42003_2023_4764_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6545/10090153/ec657f372985/42003_2023_4764_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6545/10090153/0a49b7fe1cf0/42003_2023_4764_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6545/10090153/327bdffa35f7/42003_2023_4764_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6545/10090153/f1dcabc780b7/42003_2023_4764_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6545/10090153/a09dde95a619/42003_2023_4764_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6545/10090153/c19af1aedb0a/42003_2023_4764_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6545/10090153/ec657f372985/42003_2023_4764_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6545/10090153/0a49b7fe1cf0/42003_2023_4764_Fig6_HTML.jpg

相似文献

1
PUREE: accurate pan-cancer tumor purity estimation from gene expression data.泥化:从基因表达数据中准确估计泛癌症肿瘤纯度。
Commun Biol. 2023 Apr 11;6(1):394. doi: 10.1038/s42003-023-04764-8.
2
Prediction of tumor purity from gene expression data using machine learning.利用机器学习从基因表达数据预测肿瘤纯度。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab163.
3
Comprehensive Benchmarking and Integration of Tumor Microenvironment Cell Estimation Methods.全面基准测试和肿瘤微环境细胞估计方法的整合。
Cancer Res. 2019 Dec 15;79(24):6238-6246. doi: 10.1158/0008-5472.CAN-18-3560. Epub 2019 Oct 22.
4
Weakly-supervised tumor purity prediction from frozen H&E stained slides.从冷冻 H&E 染色切片中进行弱监督的肿瘤纯度预测。
EBioMedicine. 2022 Jun;80:104067. doi: 10.1016/j.ebiom.2022.104067. Epub 2022 May 26.
5
An assessment of computational methods for estimating purity and clonality using genomic data derived from heterogeneous tumor tissue samples.使用来自异质性肿瘤组织样本的基因组数据评估估计纯度和克隆性的计算方法。
Brief Bioinform. 2015 Mar;16(2):232-41. doi: 10.1093/bib/bbu002. Epub 2014 Feb 20.
6
Systematic Assessment of Tumor Purity and Its Clinical Implications.肿瘤纯度的系统评估及其临床意义
JCO Precis Oncol. 2020 Sep 4;4. doi: 10.1200/PO.20.00016. eCollection 2020.
7
EPIC: A Tool to Estimate the Proportions of Different Cell Types from Bulk Gene Expression Data.EPIC:一种从批量基因表达数据估计不同细胞类型比例的工具。
Methods Mol Biol. 2020;2120:233-248. doi: 10.1007/978-1-0716-0327-7_17.
8
Putative biomarkers for predicting tumor sample purity based on gene expression data.基于基因表达数据预测肿瘤样本纯度的候选生物标志物。
BMC Genomics. 2019 Dec 27;20(1):1021. doi: 10.1186/s12864-019-6412-8.
9
Impact of Tumor Purity on Immune Gene Expression and Clustering Analyses across Multiple Cancer Types.肿瘤纯度对多种癌症类型免疫基因表达和聚类分析的影响。
Cancer Immunol Res. 2018 Jan;6(1):87-97. doi: 10.1158/2326-6066.CIR-17-0201. Epub 2017 Nov 15.
10
AdRoit is an accurate and robust method to infer complex transcriptome composition.AdRoit 是一种推断复杂转录组组成的准确而稳健的方法。
Commun Biol. 2021 Oct 22;4(1):1218. doi: 10.1038/s42003-021-02739-1.

引用本文的文献

1
Single-Cell Transcriptomic Analysis Unveils Key Regulators and Signaling Pathways in Lung Adenocarcinoma Progression.单细胞转录组分析揭示肺腺癌进展中的关键调节因子和信号通路。
Biomedicines. 2025 Jun 30;13(7):1606. doi: 10.3390/biomedicines13071606.
2
Deep learning based deconvolution methods: A systematic review.基于深度学习的反卷积方法:系统综述
Comput Struct Biotechnol J. 2025 Jun 11;27:2544-2565. doi: 10.1016/j.csbj.2025.05.038. eCollection 2025.
3
The 3D genome of plasma cells in multiple myeloma.多发性骨髓瘤中浆细胞的三维基因组

本文引用的文献

1
Single-cell and bulk transcriptome sequencing identifies two epithelial tumor cell states and refines the consensus molecular classification of colorectal cancer.单细胞和批量转录组测序确定了两种上皮肿瘤细胞状态,并完善了结直肠癌的共识分子分类。
Nat Genet. 2022 Jul;54(7):963-975. doi: 10.1038/s41588-022-01100-4. Epub 2022 Jun 30.
2
A pan-cancer metabolic atlas of the tumor microenvironment.肿瘤微环境的泛癌代谢图谱
Cell Rep. 2022 May 10;39(6):110800. doi: 10.1016/j.celrep.2022.110800.
3
Integrative Profiling of T790M-Negative EGFR-Mutated NSCLC Reveals Pervasive Lineage Transition and Therapeutic Opportunities.
Sci Rep. 2025 Jun 2;15(1):19331. doi: 10.1038/s41598-025-03132-2.
4
Human papillomavirus integration induces oncogenic host gene fusions in oropharyngeal cancers.人乳头瘤病毒整合在口咽癌中诱导致癌性宿主基因融合。
Cancer Discov. 2025 May 14. doi: 10.1158/2159-8290.CD-24-1535.
5
A coregulatory influence map of glioblastoma heterogeneity and plasticity.胶质母细胞瘤异质性和可塑性的共调节影响图谱。
NPJ Precis Oncol. 2025 Apr 15;9(1):110. doi: 10.1038/s41698-025-00890-0.
6
Identification of Fanconi anemia pathway genes as novel prognostic biomarkers and therapeutic targets for breast cancer.鉴定范可尼贫血通路基因作为乳腺癌新的预后生物标志物和治疗靶点。
Transl Cancer Res. 2025 Feb 28;14(2):843-864. doi: 10.21037/tcr-24-772. Epub 2025 Feb 26.
7
Real-world data analysis for factors influencing the quality check status in FoundationOne CDx cancer genomic profiling tests.影响FoundationOne CDx癌症基因组分析测试质量检查状态的因素的真实世界数据分析。
Sci Rep. 2025 Feb 12;15(1):5167. doi: 10.1038/s41598-025-85846-x.
8
GBMPurity: A Machine Learning Tool for Estimating Glioblastoma Tumour Purity from Bulk RNA-seq Data.GBMPurity:一种用于从批量RNA测序数据估计胶质母细胞瘤肿瘤纯度的机器学习工具。
Neuro Oncol. 2025 Feb 1. doi: 10.1093/neuonc/noaf026.
9
Prognostic value of residual disease (RD) biology and gene expression changes during the neoadjuvant treatment in patients with HER2-positive early breast cancer (EBC).HER2阳性早期乳腺癌(EBC)患者新辅助治疗期间残留病灶(RD)生物学及基因表达变化的预后价值
Ann Oncol. 2025 Apr;36(4):403-413. doi: 10.1016/j.annonc.2024.12.010. Epub 2024 Dec 18.
10
Tumor-associated neutrophils attenuate the immunosensitivity of hepatocellular carcinoma.肿瘤相关中性粒细胞减弱肝细胞癌的免疫敏感性。
J Exp Med. 2025 Jan 6;222(1). doi: 10.1084/jem.20241442. Epub 2024 Dec 5.
T790M 阴性 EGFR 突变 NSCLC 的综合分析揭示了广泛的谱系转化和治疗机会。
Clin Cancer Res. 2021 Nov 1;27(21):5939-5950. doi: 10.1158/1078-0432.CCR-20-4607. Epub 2021 Jul 14.
4
Gene Set Knowledge Discovery with Enrichr.基因集知识发现与 Enrichr
Curr Protoc. 2021 Mar;1(3):e90. doi: 10.1002/cpz1.90.
5
Pan-Cancer Analysis of Ligand-Receptor Cross-talk in the Tumor Microenvironment.泛癌症分析肿瘤微环境中的配体-受体相互作用。
Cancer Res. 2021 Apr 1;81(7):1802-1812. doi: 10.1158/0008-5472.CAN-20-2352. Epub 2021 Feb 5.
6
Systematic Assessment of Tumor Purity and Its Clinical Implications.肿瘤纯度的系统评估及其临床意义
JCO Precis Oncol. 2020 Sep 4;4. doi: 10.1200/PO.20.00016. eCollection 2020.
7
Multimodal genomic features predict outcome of immune checkpoint blockade in non-small-cell lung cancer.多模态基因组特征预测非小细胞肺癌免疫检查点阻断的疗效。
Nat Cancer. 2020 Jan;1(1):99-111. doi: 10.1038/s43018-019-0008-8. Epub 2020 Jan 13.
8
Visualizing and interpreting cancer genomics data via the Xena platform.通过Xena平台可视化和解读癌症基因组学数据。
Nat Biotechnol. 2020 Jun;38(6):675-678. doi: 10.1038/s41587-020-0546-8.
9
Changing Technologies of RNA Sequencing and Their Applications in Clinical Oncology.RNA测序技术的变革及其在临床肿瘤学中的应用
Front Oncol. 2020 Apr 9;10:447. doi: 10.3389/fonc.2020.00447. eCollection 2020.
10
EPIC: A Tool to Estimate the Proportions of Different Cell Types from Bulk Gene Expression Data.EPIC:一种从批量基因表达数据估计不同细胞类型比例的工具。
Methods Mol Biol. 2020;2120:233-248. doi: 10.1007/978-1-0716-0327-7_17.