Sachs Michael C
National Cancer Institute, Biometric Research Branch, 9609 Medical Center Drive, Room 5W114, MSC 9735, Bethesda, MD 20892-9735, USA.
Chin Clin Oncol. 2015 Sep;4(3):29. doi: 10.3978/j.issn.2304-3865.2015.01.02.
High-throughput technologies enable the measurement of a large number of molecular characteristics from a small tissue specimen. High-dimensional molecular information (referred to as omics data) offers the possibility of predicting the future outcome of a patient (prognosis) and predicting the likely response to a specific treatment (prediction). Embedded in the vast amount of data is the hope that there exists some signal that will enable practitioners to deliver therapy personalized to the molecular profile of a tumor, thereby improving health outcomes. The challenges are to determine that the omics assays are valid and reproducible in a clinical setting, to develop a valid and optimal omics-based test that algorithmically determines the optimal treatment regime, to evaluate that test in a powerful and unbiased manner, and finally to demonstrate clinical utility: that the test under study improves clinical outcome as compared to not using the test. We review the statistical considerations involved in each of these stages, specifically dealing with the challenges of high-dimensional, omics data.
高通量技术能够从小的组织样本中测量大量分子特征。高维分子信息(称为组学数据)为预测患者未来的结果(预后)以及预测对特定治疗的可能反应(预测)提供了可能性。大量数据中蕴含着这样一种希望,即存在某种信号,使从业者能够根据肿瘤的分子特征提供个性化治疗,从而改善健康结果。面临的挑战包括确定组学检测在临床环境中是有效且可重复的,开发一种基于组学的有效且最优的检测方法,通过算法确定最佳治疗方案,以有力且无偏倚的方式评估该检测方法,最后证明其临床实用性:即与不使用该检测方法相比,所研究的检测方法能改善临床结果。我们回顾了每个阶段所涉及的统计考量,特别处理高维组学数据带来的挑战。