• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高维数据空间的特性:对探索基因和蛋白质表达数据的启示

The properties of high-dimensional data spaces: implications for exploring gene and protein expression data.

作者信息

Clarke Robert, Ressom Habtom W, Wang Antai, Xuan Jianhua, Liu Minetta C, Gehan Edmund A, Wang Yue

机构信息

Department of Oncology and Lombardi Comprehensive Cancer Center, Georgetown University School of Medicine, 3970 Reservoir Road NW, Washington, DC 20057, USA.

出版信息

Nat Rev Cancer. 2008 Jan;8(1):37-49. doi: 10.1038/nrc2294.

DOI:10.1038/nrc2294
PMID:18097463
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2238676/
Abstract

High-throughput genomic and proteomic technologies are widely used in cancer research to build better predictive models of diagnosis, prognosis and therapy, to identify and characterize key signalling networks and to find new targets for drug development. These technologies present investigators with the task of extracting meaningful statistical and biological information from high-dimensional data spaces, wherein each sample is defined by hundreds or thousands of measurements, usually concurrently obtained. The properties of high dimensionality are often poorly understood or overlooked in data modelling and analysis. From the perspective of translational science, this Review discusses the properties of high-dimensional data spaces that arise in genomic and proteomic studies and the challenges they can pose for data analysis and interpretation.

摘要

高通量基因组和蛋白质组技术在癌症研究中被广泛应用,以构建更好的诊断、预后和治疗预测模型,识别和表征关键信号网络,并寻找药物开发的新靶点。这些技术给研究人员带来了从高维数据空间中提取有意义的统计和生物学信息的任务,其中每个样本由成百上千次测量定义,这些测量通常是同时获得的。在数据建模和分析中,高维特性常常未被充分理解或被忽视。从转化科学的角度来看,本综述讨论了基因组和蛋白质组研究中出现的高维数据空间的特性,以及它们在数据分析和解释方面可能带来的挑战。

相似文献

1
The properties of high-dimensional data spaces: implications for exploring gene and protein expression data.高维数据空间的特性:对探索基因和蛋白质表达数据的启示
Nat Rev Cancer. 2008 Jan;8(1):37-49. doi: 10.1038/nrc2294.
2
Primary transcripts: From the discovery of RNA processing to current concepts of gene expression - Review.初级转录物:从 RNA 加工的发现到当前的基因表达概念 - 综述。
Exp Cell Res. 2018 Dec 15;373(1-2):1-33. doi: 10.1016/j.yexcr.2018.09.011. Epub 2018 Sep 26.
3
Approaches to working in high-dimensional data spaces: gene expression microarrays.高维数据空间中的工作方法:基因表达微阵列
Br J Cancer. 2008 Mar 25;98(6):1023-8. doi: 10.1038/sj.bjc.6604207. Epub 2008 Feb 19.
4
Introduction: Cancer Gene Networks.引言:癌症基因网络
Methods Mol Biol. 2017;1513:1-9. doi: 10.1007/978-1-4939-6539-7_1.
5
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
6
[Proteomic analysis: why and how ?].
Bull Cancer. 2001 Jul;88(7):663-70.
7
Predicting censored survival data based on the interactions between meta-dimensional omics data in breast cancer.基于乳腺癌元维度组学数据间的相互作用预测删失生存数据。
J Biomed Inform. 2015 Aug;56:220-8. doi: 10.1016/j.jbi.2015.05.019. Epub 2015 Jun 3.
8
[Advances in high-throughput proteomic analysis].[高通量蛋白质组学分析的进展]
Se Pu. 2021 Feb;39(2):112-117. doi: 10.3724/SP.J.1123.2020.08023.
9
The future of cancer management: translating the genome, transcriptome, and proteome.癌症管理的未来:解读基因组、转录组和蛋白质组。
Ann Surg Oncol. 2003 Jan-Feb;10(1):7-14. doi: 10.1245/aso.2003.05.031.
10
Regularized binormal ROC method in disease classification using microarray data.使用微阵列数据进行疾病分类的正则化双法线ROC方法。
BMC Bioinformatics. 2006 May 9;7:253. doi: 10.1186/1471-2105-7-253.

引用本文的文献

1
A Hybrid Sequential Feature Selection Approach for Identifying New Potential mRNA Biomarkers for Usher Syndrome Using Machine Learning.一种使用机器学习识别Usher综合征新潜在mRNA生物标志物的混合序列特征选择方法。
Biomolecules. 2025 Jul 4;15(7):963. doi: 10.3390/biom15070963.
2
Integrating multi-omics and machine learning for disease resistance prediction in legumes.整合多组学和机器学习用于豆类抗病性预测
Theor Appl Genet. 2025 Jun 27;138(7):163. doi: 10.1007/s00122-025-04948-2.
3
A versatile information retrieval framework for evaluating profile strength and similarity.一种用于评估简档强度和相似度的通用信息检索框架。
Nat Commun. 2025 Jun 4;16(1):5181. doi: 10.1038/s41467-025-60306-2.
4
A dynamic and multimodal framework to define microglial states.一个用于定义小胶质细胞状态的动态多模态框架。
Nat Neurosci. 2025 May 20. doi: 10.1038/s41593-025-01978-3.
5
Multicollinearity and redundancy of the PET radiomic feature set.PET放射组学特征集的多重共线性和冗余性。
Eur Radiol. 2025 May 7. doi: 10.1007/s00330-025-11637-7.
6
Representation of high-dimensional cell morphology and morphodynamics in 2D latent space.二维潜在空间中高维细胞形态和形态动力学的表示。
Phys Biol. 2025 Apr 24;22(3). doi: 10.1088/1478-3975/adcd37.
7
Random k conditional nearest neighbor for high-dimensional data.用于高维数据的随机k条件最近邻
PeerJ Comput Sci. 2025 Jan 24;11:e2497. doi: 10.7717/peerj-cs.2497. eCollection 2025.
8
Integrating Model-Informed Drug Development With AI: A Synergistic Approach to Accelerating Pharmaceutical Innovation.将模型驱动的药物研发与人工智能相结合:加速药物创新的协同方法。
Clin Transl Sci. 2025 Jan;18(1):e70124. doi: 10.1111/cts.70124.
9
Embracing the informative missingness and silent gene in analyzing biologically diverse samples.分析具有生物多样性样本时要包容信息缺失和沉默基因。
Sci Rep. 2024 Nov 16;14(1):28265. doi: 10.1038/s41598-024-78076-0.
10
Preoperative prediction of CNS WHO grade and tumour aggressiveness in intracranial meningioma based on radiomics and structured semantics.基于放射组学和结构化语义学的颅内脑膜瘤 CNS WHO 分级和肿瘤侵袭性的术前预测。
Sci Rep. 2024 Sep 4;14(1):20586. doi: 10.1038/s41598-024-71200-0.

本文引用的文献

1
Ratio-based decisions and the quantitative analysis of cDNA microarray images.基于比率的决策与cDNA微阵列图像的定量分析
J Biomed Opt. 1997 Oct;2(4):364-74. doi: 10.1117/12.281504.
2
Novel approaches to smoothing and comparing SELDI TOF spectra.用于平滑和比较表面增强激光解吸电离飞行时间质谱(SELDI TOF)光谱的新方法。
Cancer Inform. 2005;1(1):78-85.
3
Classification algorithms for phenotype prediction in genomics and proteomics.基因组学和蛋白质组学中用于表型预测的分类算法。
Front Biosci. 2008 Jan 1;13:691-708. doi: 10.2741/2712.
4
Emergent unsupervised clustering paradigms with potential application to bioinformatics.具有生物信息学潜在应用的紧急无监督聚类范式。
Front Biosci. 2008 Jan 1;13:677-90. doi: 10.2741/2711.
5
Human X-box binding protein-1 confers both estrogen independence and antiestrogen resistance in breast cancer cell lines.人X盒结合蛋白1赋予乳腺癌细胞系雌激素非依赖性和抗雌激素抗性。
FASEB J. 2007 Dec;21(14):4013-27. doi: 10.1096/fj.06-7990com. Epub 2007 Jul 27.
6
Estrogen receptor alpha (ESR1) gene amplification is frequent in breast cancer.雌激素受体α(ESR1)基因扩增在乳腺癌中很常见。
Nat Genet. 2007 May;39(5):655-60. doi: 10.1038/ng2006. Epub 2007 Apr 8.
7
Identification of angiogenesis/metastases genes predicting chemoradiotherapy response in patients with laryngopharyngeal carcinoma.预测喉咽癌患者放化疗反应的血管生成/转移基因的鉴定
J Clin Oncol. 2007 Apr 10;25(11):1369-76. doi: 10.1200/JCO.2005.05.3397.
8
Microarray-based classification of a consecutive series of 121 childhood acute leukemias: prediction of leukemic and genetic subtype as well as of minimal residual disease status.基于微阵列技术对121例儿童急性白血病连续病例系列进行分类:白血病和基因亚型以及微小残留病状态的预测
Leukemia. 2007 Jun;21(6):1198-203. doi: 10.1038/sj.leu.2404688. Epub 2007 Apr 5.
9
Transcription regulation by mutant p53.突变型p53的转录调控
Oncogene. 2007 Apr 2;26(15):2202-11. doi: 10.1038/sj.onc.1210294.
10
Molecular definition of breast tumor heterogeneity.乳腺肿瘤异质性的分子定义。
Cancer Cell. 2007 Mar;11(3):259-73. doi: 10.1016/j.ccr.2007.01.013.