Department of Cancer Imaging and Metabolism, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, USA.
Magn Reson Imaging. 2012 Nov;30(9):1234-48. doi: 10.1016/j.mri.2012.06.010. Epub 2012 Aug 13.
"Radiomics" refers to the extraction and analysis of large amounts of advanced quantitative imaging features with high throughput from medical images obtained with computed tomography, positron emission tomography or magnetic resonance imaging. Importantly, these data are designed to be extracted from standard-of-care images, leading to a very large potential subject pool. Radiomics data are in a mineable form that can be used to build descriptive and predictive models relating image features to phenotypes or gene-protein signatures. The core hypothesis of radiomics is that these models, which can include biological or medical data, can provide valuable diagnostic, prognostic or predictive information. The radiomics enterprise can be divided into distinct processes, each with its own challenges that need to be overcome: (a) image acquisition and reconstruction, (b) image segmentation and rendering, (c) feature extraction and feature qualification and (d) databases and data sharing for eventual (e) ad hoc informatics analyses. Each of these individual processes poses unique challenges. For example, optimum protocols for image acquisition and reconstruction have to be identified and harmonized. Also, segmentations have to be robust and involve minimal operator input. Features have to be generated that robustly reflect the complexity of the individual volumes, but cannot be overly complex or redundant. Furthermore, informatics databases that allow incorporation of image features and image annotations, along with medical and genetic data, have to be generated. Finally, the statistical approaches to analyze these data have to be optimized, as radiomics is not a mature field of study. Each of these processes will be discussed in turn, as well as some of their unique challenges and proposed approaches to solve them. The focus of this article will be on images of non-small-cell lung cancer.
“放射组学”是指从计算机断层扫描、正电子发射断层扫描或磁共振成像获得的医学图像中提取和分析大量高通量的高级定量成像特征。重要的是,这些数据旨在从标准护理图像中提取,从而产生非常大的潜在研究对象库。放射组学数据是可开采的形式,可以用于构建描述性和预测性模型,将图像特征与表型或基因-蛋白质特征相关联。放射组学的核心假设是,这些模型可以包括生物或医学数据,可以提供有价值的诊断、预后或预测信息。放射组学的工作可以分为不同的过程,每个过程都有自己需要克服的挑战:(a) 图像采集和重建,(b) 图像分割和渲染,(c) 特征提取和特征定性,以及(d) 数据库和数据共享,最终用于(e)特定的信息学分析。这些单独的过程都提出了独特的挑战。例如,必须确定和协调最佳的图像采集和重建方案。此外,分割必须稳健且涉及最少的操作人员输入。必须生成能够稳健反映个体体积复杂性的特征,但不能过于复杂或冗余。此外,必须生成允许合并图像特征和图像注释以及医疗和遗传数据的信息学数据库。最后,必须优化分析这些数据的统计方法,因为放射组学不是一个成熟的研究领域。将依次讨论这些过程,以及它们的一些独特挑战和提出的解决方案。本文的重点将是非小细胞肺癌的图像。